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HUMAN GENES AND GENE EXPRESSION PRODUCTS 

Cross-Reference t o Related Application 

[0001] Ths application claims the benefit of earlier-filed U.S. provisional application serial no. 
60/226,326 filed August 16, 2000, which application is incorporated herein by reference in its 

entirety. 

Field of the Invention 

[0002] The present invention relates to polynucleotides of human origin, particularly in human 
colon, breast, prostate, and/or lung tissue, and the encoded gene products. 
Background of the Invention 

[0003] Identification of novel polynucleotides, particularly those that encode an expressed gene 
product, is important in the advancement of drug discovery, diagnostic technologies, and the 
understanding of the progression and nature of complex diseases such as cancer. Identification of 
genes expressed in different cell types isolated from sources that differ in disease state or stage, 
developmental stage, exposure to various environmental factors, the tissue of origin, the species 
from which the tissue was isolated, and the like is key to identifying the genetic factors that are 
responsible for the phenotypes associated with these various differences. 

[0004] This invention provides novel human polynucleotides, the polypeptides encoded by 
these polynucleotides, and the genes and proteins corresponding to these novel polynucleotides. 
Summary of the Invention 

[0005] This invention relates to novel human polynucleotides and variants thereof, their 
encoded polypeptides and variants thereof, to genes corresponding to these polynucleotides and to 
proteins expressed by the genes. The invention also relates to diagnostics and therapeutics 
comprising such novel human polynucleotides, their corresponding genes or gene products, 
including probes, antisense nucleotides, and antibodies. The polynucleotides of the invention 
correspond to a polynucleotide comprising the sequence information of at least one of SEQ ID 
NOS:1-6010. 

[0006] Various aspects and embodiments of the invention will be readily apparent to the 
ordinarily skilled artisan upon reading the description provided herein. 

Detailed D escription of the Invention 
[0007] Before the present invention is described, it is to be understood that this invention is not 
limited to particular embodiments described, as such may, of course, vary. It is also to be 
understood that the terminology used herein is for the purpose of describing particular embodiments 
only, and is not intended to be limiting. 
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groups such as fluororibose and thioate, and nucleotide branches. The sequence of nucleotides may be 
interrupted by non-nucleotide components. A polynucleotide may be further modified after 
polymerization, such as by conjugation with a labeling component. Other types of modifications 
included in this definition are caps, substitution of one or more of the naturally occurring nucleotides 
with an analog, and introduction of means for attaching the polynucleotide to proteins, metal ions 
labeling components, other polynucleotides, or a solid support. 

10013] The terms "polypeptide" and "protein," used interchangebly herein, refer to a polymeric 
form of armno acids of any length, which can include coded and non-coded amino acids, chemically or 
biochemically modified or derivatized amino acids, and polypeptides having modified peptide 
backbones. The term includes fusion proteins, including, but not limited to, fusion proteins with a 
heterologous amino acid sequence, fusions with heterologous and homologous leader sequences with 
or without N-terminal methionine residues; immunologically tagged proteins; and the like. 
[0014] "Diagnosis" as used herein generally includes determination of a subject's susceptibility to 
a disease or disorder, determination as to whether a subject is presently affected by a disease or 
disorder, prognosis of a subject affected by a disease or disorder (e.g., identification of pre-metastatic 
or metastatic cancerous states, stages of cancer, or responsiveness of cancer to therapy), and 
therametrics (e.g., monitoring a subject's condition to provide information as to the effect or efficacy of 
therapy). 

[0015] "Sample" or "biological sample" as used herein encompasses a variety of sample types 
and are generally meant to refer to samples of biological fluids or tissues, particularly samples obtamed 
from tissues, especially from cells of the type associated with a disease or condition for which a 
diagnostic application is designed (e.g., ductal adenocarcinoma), and the like. "Sample" or "biological 
sample" are meant to encompass blood and other liquid samples of biological origin, solid tissue 
samples, such as a biopsy specimen or tissue cultures or cells derived therefrom and the progeny 
thereof. These terms encompass samples that have been manipulated in any way after their 
procurement as well as derivatives and fractions of samples, where the samples may be maniuplated 
by, for example, treatment with reagents, solubilization, or enrichment for certain components The 
terms also encompass clinical samples, and also includes cells in cell culture, cell supematants cell 
lysates, serum, plasma, biological fluids, and tissue samples. Where the sample is solid tissue, the cells 
of the tissue can be dissociated or tissue sections can be analyzed. 

[0016] The terms "treatment," "treating," "treat" and the like are used herein to generally refer to 
obtaining a desired pharmacologic and/or physiologic effect. The effect may be prophylactic in terms 
of completely or partially preventing a disease or symptom thereof and/or may be therapeutic in terms 
of a partial or complete stabilization or cure for a disease and/or adverse effect attributable to the 
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disease. "Treatment" as used herein covers any treatment of a disease in a mammal, particularly a 
human, and includes: (a) preventing the disease or symptom from occurring in a subject which may be 
predisposed to the disease or symptom but has not yet been diagnosed as having it; (b) inhibiting the 
disease symptom, i.e., arresting its development; or relieving the disease symptom, i.e., causing 
regression of the disease or symptom. 

[0017] The terms "individual," "subject," "host," and "patient," used interchangeably herein and 
refer to any mammalian subject for whom diagnosis, treatment, or therapy is desired, particularly 
humans. Other subjects may include cattle, dogs, cats, guinea pigs, rabbits, rats, mice, horses, and so 
on. 

[0018] As used herein the term "isolated" refers to a polynucleotide, a polypeptide, an antibody, 
or a host cell that is in an environment different from that in which the polynucleotide, the polypeptide, 
the antibody, or the host cell naturally occurs. A polynucleotide, a polypeptide, an antibody, or a host 
cell which is isolated is generally substantially purified. As used herein, the term "substantially 
purified" refers to a compound (e.g., either a polynucleotide or a polypeptide or an antibody) that is 
removed from its natural environment and is at least 60% free, preferably 75% free, and most 
preferably 90% free from other components with which it is naturally associated. Thus, for example, a 
composition containing A is "substantially free of* B when at least 85% by weight of the total A+B in 
the composition is A. Preferably, A comprises at least about 90% by weight of the total of A+B in the 
composition, more preferably at least about 95% or even 99% by weight. 

[0019] A "host cell," as used herein, refers to a microorganism or a eukaryotic cell or cell line 
cultured as a unicellular entity which can be, or has been, used as a recipient for a recombinant vector 
or other transfer polynucleotides, and include the progeny of the original cell which has been 
transfected. It is understood that the progeny of a single cell may not necessarily be completely 
identical in morphology or in genomic or total DNA complement as the original parent, due to natural, 
accidental, or deliberate mutation. 

[0020] The terms "cancer," "neoplasm," "tumor," and "carcinoma," are used interchangeably 
herein to refer to cells which exhibit relatively autonomous growth, so that they exhibit an aberrant 
growth phenotype characterized by a significant loss of control of cell proliferation. In general, cells of 
interest for detection or treatment in the present application include precancerous (e.g. , benign), 
malignant, metastatic, and non-metastatic cells. Detection of cancerous cell is of particular interest. 
[0021] The use of "e", as in 10e-3, indicates that the number to the left of "e" is raised to the 
power of the number to the right of "e" (thus, 10e-3 is 10" 3 ). 

[0022] The term "heterologous" as used herein in the context of, for example, heterologous 
nucleic acid or amino acid sequences, heterologous polypeptides, or heterologous nucleic acid, is meant 
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to refer to material that originates from a source different from that with which it is joined or 
associated. For example, two DNA sequences are heterologous to one another if the sequences are 
from different genes or from different species. A recombinant host cell containing a sequence that is 
heterologous to the host cell can be, for example, a bacterial cell containing a sequence encoding a 
human polypeptide. 

[0023] The invention relates to polynucleotides comprising the disclosed nucleotide sequences to 
full length cDNA, mRNA, genomic sequences, and genes corresponding to these sequences and 
degenerate variants thereof, and to polypeptides encoded by the polynucleotides of the invention and 
polypeptide variants. The followmg detailed description describes the polynucleotide compositions 
encompassed by the invention, methods for obtaining cDNA or genomic DNA encoding a full-length 
gene product, expression of these polynucleotides and genes, identification of structural motifs of the 
polynucleotides and genes, identification of the function of a gene product encoded by a gene 
corresponding to a polynucleotide of the invention, use of the provided polynucleotides as probes and 
in mapping and in tissue profiling, use of the corresponding polypeptides and other gene products to 
ra 1S e antibodies, and use of the polynucleotides and their encoded gene products for therapeutic and 
diagnostic purposes. 

Polynucleo tide Comp ngitmng 
[0024] The scope of the invention with respect to polynucleotide compositions includes, but is not 
necessarily lunited to, polynucleotides having a sequence set forth in any one of SEQ ID NOS 1-6010 
polynucleotides obtained from the biological materials described herein or other biological sources 
(particularly human sources) by hybridization under stringent conditions (particularly conditions of 
high stringency); genes corresponding to the provided polynucleotides; variants of the provided 
polynucleotides and their corresponding genes, particularly those variants that retain a biological 
activity of the encoded gene product (e.g., a biological activity ascribed to a gene product 
corresponding to the provided polynucleotides as a result of the assignment of the gene product to a 
protein family(ies) and/or identification of a functional domain present in the gene product) Other 
nucleic acid compositions contemplated by and within the scope of the present invention will be readily 
apparent to one of ordinary skill in the art when provided with the disclosure here. "Polynucleotide" 
and "nucleic acid" as used herein with reference to nucleic acids of the composition is not intended to 
be limiting as to the length or structure of the nucleic acid unless specifically indicated. 
[0025] The invention features polynucleotides that are expressed in human tissue, especially 
human colon, prostate, breast, lung and/or endothelial tissue. Novel nucleic acid compositions of the 
invention of particular interest comprise a sequence set forth in any one of SEQ ID NOS - 1-6010 or an 
identifying sequence thereof. An "identifying sequence" is a contiguous sequence of residues at least 
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about 10 nt to about 20 nt in length, usually at least about 50 nt to about 100 nt in length, that uniquely 
identifies a polynucleotide sequence, e.g. , exhibits less than 90%, usually less than about 80% to about 
85% sequence identity to any contiguous nucleotide sequence of more than about 20 nt. Thus, the 
subject novel nucleic acid compositions include full length cDNAs or mRNAs that encompass an 
identifying sequence of contiguous nucleotides from any one of SEQ ID NOS: 1-6010. 
[0026] The polynucleotides of the invention also include polynucleotides having sequence 
similarity or sequence identity. Nucleic acids having sequence similarity are detected by hybridization 
under low stringency conditions, for example, at 50°C and 10XSSC (0.9 M saline/0.09 M sodium 
citrate) and remain bound when subjected to washing at 55°C in 1XSSC. Sequence identity can be 
determined by hybridization under stringent conditions, for example, at 50°C or higher and 0. 1XSSC 
(9 mM saline/0.9 mM sodium citrate). Hybridization methods and conditions are well known in the 
art, see, e.g., USPN 5,707,829. Nucleic acids that are substantially identical to the provided 
polynucleotide sequences, e.g. allelic variants, genetically altered versions of the gene, etc., bind to the 
provided polynucleotide sequences ( SEQ ID NOS:1-6010) under stringent hybridization conditions. 
By using probes, particularly labeled probes of DNA sequences, one can isolate homologous or related 
genes. The source of homologous genes can be any species, e.g. primate species, particularly human; 
rodents, such as rats and mice; canines, felines, bovines, ovines, equines, yeast, nematodes, etc. 
[0027] Preferably, hybridization is performed using at least 15 contiguous nucleotides (nt) of at 
least one of SEQ ID NOS: 1-6010. That is, when at least 15 contiguous nt of one of the disclosed SEQ 
ID NOS. is used as a probe, the probe will preferentially hybridize with a nucleic acid comprising the 
complementary sequence, allowing the identification and retrieval of the nucleic acids that uniquely 
hybridize to the selected probe. Probes from more than one SEQ ID NO. can hybridize with the same 
nucleic acid if the cDNA from which they were derived corresponds to one mRNA. Probes of more 
than 15 nt can be used, e.g., probes of from about 18 nt to about 100 nt, but 15 nt represents sufficient 
sequence for unique identification. 

[0028] The polynucleotides of the invention also include naturally occurring variants of the 
nucleotide sequences {e.g., degenerate variants, allelic variants, etc.). Variants of the polynucleotides 
of the invention are identified by hybridization of putative variants with nucleotide sequences disclosed 
herein, preferably by hybridization under stringent conditions. For example, by using appropriate wash 
conditions, variants of the polynucleotides of the invention can be identified where the allelic variant 
exhibits at most about 25-30% base pair (bp) mismatches relative to the selected polynucleotide probe. 
In general, allelic variants contain 15-25% bp mismatches, and can contain as little as even 5-15%, or 
2-5%, or 1-2% bp mismatches, as well as a single bp mismatch. 
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[0029J The invention also encompasses homologs corresponding to the polynucleotides of SEQ 
ID NOS: 1-6010, where the source of homologous genes can be any mammalian species, e.g., primate 
species, particularly human; rodents, such as rats; canines, felines, bovines, ovines, equines, yeast, 
nematodes, etc. Between mammalian species, e.g., human and mouse, homologs generally nave 
substantial sequence similarity, e.g., at least 75% sequence identity, usually at least 90%, more usually 
at least 95% between nucleotide sequences. Sequence similarity is calculated based on a reference 
sequence, which may be a subset of a larger sequence, such as a conserved motif, coding region, 
flanking region, etc. A reference sequence will usually be at least about 18 contiguous nt long, more 
usually at least about 30 nt long, and may extend to the complete sequence that is being compared. 
Algorithms for sequence analysis are known in the art, such as gapped BLAST, described in Altschul, 
et al. Nucleic Acids Res. (1997) 25:3389-3402. 

[0030] In general, variants of the invention have a sequence identity greater than at least about 
65%, preferably at least about 75%, more preferably at least about 85%, and can be greater than at 
least about 90% or more as determined by the Smith-Waterman homology search algorithm as 
implemented in MPSRCH program (Oxford Molecular). For the purposes of this invention, a preferred 
method of calculating percent identity is the Smith-Waterman algorithm, using the following. Global 
DNA sequence identity must be greater than 65% as determined by the Smith-Waterman homology 
search algorithm as implemented in MPSRCH program (Oxford Molecular) using an affine gap search 
with the following search parameters: gap open penalty, 12; and gap extension penalty, 1 . 
[0031] The subject nucleic acids can be cDNAs or genomic DNAs, as weil as fragments thereof, 
particularly fragments that encode a biologically active gene product and/or are useful in the methods' 
disclosed herein {e.g., in diagnosis, as a unique identifier of a differentially expressed gene of interest, 
etc.). The term "cDNA" as used herein is intended to include all nucleic acids that share the 
arrangement of sequence elements found in native mature mRNA species, where sequence elements are 
exons and 3' and 5' non-coding regions. Normally mRNA species have contiguous exons, with the 
intervening introns, when present, being removed by nuclear RNA splicing, to create a continuous open 
reading frame encoding a polypeptide of the invention. 

[0032] A genomic sequence of interest comprises the nucleic acid present between the initiation 
codon and the stop codon, as defined in the listed sequences, including all of the introns that are 
normally present in a native chromosome. It can further include the 3 ' and 5 ' untranslated regions 
found in the mature mRNA. It can further include specific transcriptional and translational regulatory 
sequences, such as promoters, enhancers, etc., including about 1 kb, but possibly more, of flanking 
genomic DNA at either the 5' and 3' end of the transcribed region. The genomic DNA can be isolated 
as a fragment of 100 kbp or smaller; and substantially free of flanking chromosomal sequence. The 
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genomic DNA flanking the coding region, either 3 ' and 5', or internal regulatory sequences as 
sometimes found in introns, contains sequences required for proper tissue, stage-specific, or disease- 
state specific expression. 

[0033] The nucleic acid compositions of the subject invention can encode all or a part of the 
subject polypeptides. Double or single stranded fragments can be obtained from the DNA sequence by 
chemically synthesizing oligonucleotides in accordance with conventional methods, by restriction 
enzyme digestion, by PCR amplification, etc. Isolated polynucleotides and polynucleotide fragments 
of the invention comprise at least about 10, about 15, about 20, about 35, about 50, about 100, about 
150 to about 200, about 250 to about 300, or about 350 contiguous nt selected from the polynucleotide 
sequences as shown in SEQ ID NOS: 1-6010. For the most part, fragments will be of at least 15 nt, 
usually at least 18 nt or 25 nt, and up to at least about 50 contiguous nt in length or more. In a 
preferred embodiment, the polynucleotide molecules comprise a contiguous sequence of at least 12 nt 
selected from the group consisting of the polynucleotides shown in SEQ ID NOS: 1-6010. 
[0034] Probes specific to the polynucleotides of the invention can be generated using the 
polynucleotide sequences disclosed in SEQ ID NOS: 1-6010. The probes are preferably at least about 
12, 15, 16, 18, 20, 22, 24, or 25 nt fragment of a corresponding contiguous sequence of SEQ ID 
NOS: 1-6010, and can be less than 2, 1, 0.5, 0. 1, or 0.05 kb in length. The probes can be synthesized 
chemically or can be generated from longer polynucleotides using restriction enzymes. The probes can 
be labeled, for example, with a radioactive, biotinylated, or fluorescent tag. Preferably, probes are 
designed based upon an identifying sequence of a polynucleotide of one of SEQ ID NOS:1-6010. 
More preferably, probes are designed based on a contiguous sequence of one of the subject 
polynucleotides that remain unmasked following application of a masking program for masking low 
complexity (e.g.,XBLAST) to the sequence., i.e., one would select an unmasked region, as indicated by 
the polynucleotides outside the poly-n stretches of the masked sequence produced by the masking 
program. 

[0035] The polynucleotides of the subject invention are isolated and obtained in substantial 
purity, generally as other than an intact chromosome. Usually, the polynucleotides, either as DNA or 
RNA, will be obtained substantially free of other naturally-occurring nucleic acid sequences, generally 
being at least about 50%, usually at least about 90% pure and are typically "recombinant," e.g., 
flanked by one or more nucleotides with which it is not normally associated on a naturally occurring 
chromosome. 

[0036] The polynucleotides of the invention can be provided as a linear molecule or within a 
circular molecule, and can be provided within autonomously replicating molecules (vectors) or within 
molecules without replication sequences. Expression of the polynucleotides can be regulated by their 
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own or by other regulatory sequences known in the art. The polynucleotides of the invention can be 
introduced into suitable host cells using a variety of techniques available in the art, such as transferrin 
polycation-mediated DNA transfer, transfection with naked or encapsulated nucleic acids, liposome- 
mediated DNA transfer, intracellular transportation of DNA-coated latex beads, protoplast fusion, viral 
infection, electroporation, gene gun, calcium phosphate-mediated transfection, and the like. 
[0037] The subject nucleic acid compositions can be used to, for example, produce polypeptides 
as probes for the detection of mRNA of the invention in biological samples (e.g., extracts of human ' 
cells) to generate additional copies of the polynucleotides, to generate ribozymes or antisense 
oligonucleotides, and as single stranded DNA probes or as triple-strand forming oligonucleotides. The 
probes described herein can be used to, for example, determine the presence or absence of the 
polynucleotide sequences as shown in SEQ ID NOS: 1-6010 or variants thereof in a sample. These 
and other uses are described in more detail below. 

Use of Polynucleotides to Obtain Full - T .eneth cDNA P^ne. and Promote 
[0038] In one embodiment, the polynucleotides are useful as starting materials to construct larger 
molecules. In one example, the polynucleotides of the invention are used to construct polynucleotides 
that encode a larger polypeptide {e.g. , up to the full-length native polypeptide as well as fusion proteins 
comprising all or a portion of the native polypeptide) or may be used to produce haptens of the 
polypeptide {e.g., polypeptides useful to generate antibodies). 

[0039] In one particular example, the polynucleotides of the invention are used to make or isolate 
cDNA molecules encoding all or portion of a naturally-occuring polypeptide. Full-length cDNA 
molecules comprising the disclosed polynucleotides are obtained as follows. A polynucleotide having a 
sequence of one of SEQ ID NOS: 1-60 10, or a portion thereof comprising at least 12, 15, 18, or 20 nt, 
is used as a hybridization probe to detect hybridizing members of a cDNA library using probe design 
methods, cloning methods, and clone selection techniques such as those described in USPN 5,654,173. 
Libraries of cDNA are made from selected tissues, such as normal or tumor tissue, or from tissues of a 
mammal treated with, for example, a pharmaceutical agent. Preferably, the tissue is the same as the 
tissue from which the polynucleotides of the invention were isolated, as both the polynucleotides 
described herein and the cDNA represent expressed genes. Most preferably, the cDNA library is made 
from the biological material described herein in the Examples. The choice of cell type for library 
construction can be made after the identity of the protein encoded by the gene corresponding to the 
polynucleotide of the invention is known. This will indicate which tissue and cell types are likely to 
express the related gene, and thus represent a suitable source for the mRNA for generating the cDNA. 
Where the provided polynucleotides are isolated from cDNA libraries, the libraries are prepared from 
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mRNA of human colon cells, more preferably, human colon cancer cells, even more preferably from a 
highly metastatic colon cell, Kml2L4-A. 

[0040J Techniques for producing and probing nucleic acid sequence libraries are described for 
example, m Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Ed., (1989) Cold Spring 
Harbor Press, Cold Spring Harbor, NY. The cDNA can be prepared by using primers based on 
polynucleot,descom P rising aS equenceofSEQIDNOS:l-6010. In one embodiment thecDNA 
library can be made from only poly-adenylated mRNA. Thus, poIy-T primers can be used to prepare 
cDNA from the mRNA. 

[0041] Members of the library that are larger than the provided polynucleotides, and preferably 
that encompass the complete coding sequence of the native message, are obtained. In order to confirm 
that the entire cDNA has been obtained, RNA protection experiments are performed as follows 
Hybnchzation of a full-length cDNA to an mRNA will protect the RNA from RNase degradation If 
the cDNA is not full length, then the portions of the mRNA that are not hybridized will be subject to 
RNase degradation. This is assayed, as is known in the art, by changes in electrophoretic mobility on 
polyacrylamide gels, or by detection of released monoribonucleotides. Sambrook et al., Molecular 
Cloning: A Laboratory Manual, 2nd Ed., (1989) Cold Spring Harbor Press, Cold Spring Harbor, NY 
In order to obtain additional sequences 5' to the end of a partial cDNA, 5' RACE (PCR Protocols- A 
Guide to Methods and Applications, (1990) Academic Press, Inc.) can be performed. 
[0042] Genomic DNA is isolated using the provided polynucleotides in a manner similar to the 
isolation of full-length cDNAs. Briefly, the provided polynucleotides, or portions thereof, are used as 
probes to libraries of genomic DNA. Preferably, the library is obtained from the cell type that was 
used to generate the polynucleotides of the invention, but this is not essential. Most preferably the 
genomic DNA is obtained from the biological material described herein in the Examples. Such 
libraries can be in vectors suitable for carrying large segments of a genome, such as P 1 or YAC as 
described in detail in Sambrook et al., supra, 9.4-9.30. In addition, genomic sequences can be isolated 
from human BAC libraries, which are commercially available from Research Genetics, Inc., Huntsville 
Alabama, USA, for example. In order to obtain additional 5' or 3' sequences, chromosome walking is ' 
performed, as described in Sambrook et al., such that adjacent and overlapping fragments of genomic 
DNA are 1S olated. These are mapped and pieced together, as is known in the art, using restriction 
digestion enzymes and DNA ligase. 

[0043] Using the polynucleotide sequences of the invention, corresponding full-length genes can 
be elated using both classical and PCR methods to construct and probe cDNA libraries. Using either 
method, Northern blots, preferably, are performed on a number of cell types to determine which cell 
Imes express the gene of interest at the highest level. Classical methods of constructing cDNA libraries 
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are taught in Sambrook et al., supra. With these methods, cDNA can be produced from mRNA and 
mserted mto viral or expression vectors. Typically, libraries of mRNA comprising P oly(A) tails can be 
produced with poly(T) primers. Similarly, cDNA libraries can be produced using the instant sequences 
as primers. 

[0044J PGR methods are used to amplify the members of a cDNA library that comprise the 
desired insert. In this case, the desired insert will contain sequence from the full length cDNA that 
corresponds to the instant polynucleotides. Such PGR methods include gene trapping and RACE 
methods. GenetrappmgentailsmsertingamemberofacDNAlibraryintoavector. The vector then is 
denatured to produce single stranded molecule, Next, a substrate-bound probe, such as a biotinylated 
ohgo, ,s used to trap cDNA inserts of interest. Biotinylated probes can be linked to an avidin-bound 
sohd substrate. PGR methods can be used to amplify the trapped cDNA. To trap sequences 
corresponding to the full length genes, the labeled probe sequence is based on the polynucleotide 
sequences of the invention. Random primers or primers specific to the library vector can be used to 
amphfy the trapped cDNA. Such gene trapping techniques are described in Gruber et al WO 
95/04745 and Gruber et al., USPN 5,500,356. Kits are commercially available to perform gene 
trappmg experiments from, for example, Life Technologies, Gaithersburg, Maryland, USA. 
[0045] "Rapid amplification of cDNA ends," or RACE, is a PCR method of amplifying cDNAs 
from a number of different RNAs. The cDNAs are ligated to an oligonucleotide linker, and amplified 
by PCR using two primers. One primer is based on sequence from the instant polynucleotides for 
which full length sequence is desired, and a second primer comprises sequence that hybridizes to the 
oligonucleotide linker to amplify the cDNA. A description of this method is reported in WO 
97/191 10. In preferred embodiments of RACE, a common primer is designed to anneal to an arbitrary 
adaptor sequence ligated to cDNA ends (Apte and Siebert, Biotechniques (1993) 15:890-893- Edwards 
et al., Nuc. Acids Res. (1991) 19:5227-5232). When a single gene-specific RACE primer is paired 
wth the common primer, preferential amplification of sequences between the single gene specific 
primer and the common primer occurs. Commercial cDNA pools modified for use in RACE are 
available. 

[0046] Another PCR-based method generates full-length cDNA library with anchored ends 
without needing specific knowledge of the cDNA sequence. The method uses lock-docking primers (I- 
VI), where one primer, poly TV (I-III) locks over the polyA tail of eukaryotic mRNA producing first 
strand synthesis and a second primer, polyGH (IV-VI) locks onto the polyC tail added by terminal 
deoxynucleotidyl transferase (TdT)(see, e.g., WO 96/40998). 

[0047] The promoter region of a gene generally is located 5' to the initiation site for RNA 
polymerase II. Hundreds of promoter regions contain the "TATA" box, a sequence such as TATTA or 
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TATAA, which is sensitive to mutations. The promoter region can be obtained by performing 5 ' 
RACE using a primer from the coding region of the gene. Alternatively, the cDNA can be used as a 
probe for the genomic sequence, and the region 5' to the coding region is identified by "walking up " If 
the gene is highly expressed or differentially expressed, the promoter from the gene can be of use in a 
regulatory construct for a heterologous gene. 

[0048] Once the full-length cDNA or gene is obtained, DNA encoding variants can be prepared 
by s,te-directed mutagenesis, described in detail in Sambrook et al., 15.3-15.63. The choice of codon or 
nucleotide to be replaced can be based on disclosure herein on optional changes in amino acids to 
achieve altered protein structure and/or function. 

[0049] As an alternative method to obtaining DNA or RNA from a biological material, nucleic 
acid comprising nucleotides having the sequence of one or more polynucleotides of the invention can 
be synthesized. Thus, the invention encompasses nucleic acid molecules ranging in length from 15 nt 
(corresponding to at least 15 contiguous nt of one of SEQ ID NOS: 1-6010) up to a maximum length 
suitable for one or more biological manipulations, including replication and expression, of the nucleic 
acid molecule. The invention includes but is not limited to (a) nucleic acid having the size of a full 
gene, and comprising at least one of SEQ ID NOS:1-6010; (b) the nucleic acid of (a) also comprising 
at least one additional gene, operably linked to permit expression of a fusion protein; (c) an expression 
vector comprising (a) or (b); (d) a plasmid comprising (a) or (b); and (e) a recombinant viral particle 
comprising (a) or (b). Once provided with the polynucleotides disclosed herein, construction or 
preparation of (a) - (e) are well within the skill in the art. 

[0050] The sequence of a nucleic acid comprising at least 15 contiguous nt of at least any one of 
SEQ ID NOS: 1-6010, preferably the entire sequence of at least any one of SEQ ID NOS. 1-6010, is 
not limited and can be any sequence of A, T, G, and/or C (for DNA) and A, U, G, and/or C (for RNA) 
or modified bases thereof, including inosine and pseudouridine. The choice of sequence will depend on 
the desired function and can be dictated by coding regions desired, the intron-like regions desired, and 
the regulatory regions desired. Where the entire sequence of any one of SEQ ID NOS: 1-6010 is within 
the nucleic acid, the nucleic acid obtained is referred to herein as a polynucleotide comprising the 
sequence of any one of SEQ ID NOS : 1 -60 1 0. 

Expression of Polypeptide Encode h y Full-Lenp th r. DNA or F..11-T 
[0051] The provided polynucleotides (e.g., a polynucleotide having a sequence of one of SEQ ID 
NOS: 1-6010), the corresponding cDNA, or the full-length gene is used to express a partial or complete 
gene product. Constructs of polynucleotides having sequences of SEQ ID NOS: 1 -60 1 0 can also be 
generated synthetically. Alternatively, single-step assembly of a gene and entire plasmid from large 
numbers of oligodeoxyribonucleotides is described by, e.g., Stemmer et al., Gene (Amsterdam) (1995) 
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164(l):49-53. In this method, assembly PGR (the synthesis of long DNA sequences from large 
numbers of oligodeoxyribonucleotides (oligos)) is described. The method is derived from DNA 
shufmng(Stemmer,Nature(1994)370:389-391) ! and does not rely on DNA ligase, but instead relies 
on DNA polymerase to build increasingly longer DNA fragments during the assembly process 
[0052] Appropriate polynucleotide constructs are purified using standard recombinant DNA 
toques as described in, for example, Sambrook et al, Molecular Clonin g : A Laboratory Manual 

descnbed in United States Dept. of HHS, National Institute of Health (NIH) Guidelines for 
Recombinant DNA Research. The gene product encoded by a polynucleotide of the invention is 
expressed in any expression system, including, for example, bacterial, yeast, insect, amphibian and 
mammalian systems. Vectors, host ells and methods for obtaining expression in same are well known 
in the art. Suitable vectors and host cells are described in USPN 5,654,173. 
[0053] Polynucleotide molecules comprising a polynucleotide sequent provided herein are 
generally propagated by placing the molecule in a vector. Viral and non-viral vectors are used 
mcluding plasmids. The choic* of plasmid will depend on the type of cel. in which propagation is 
desired and the purpose of propagation. Certain vectors are useful for amplifying and making large 
amounts of the desired DNA sequence. Other vectors are suitable for expression in cells in culture 
Still other vectors are suitable for transfer and expression in ells in a whole animal or person The 
choics of appropriate vector is well within the skill of the art. Many such vectors are available 
commercially. Methods for preparation of vectors comprising a desired sequence are well know, in the 
art. 

[0054] The polynucleotides set forth in SEQ ID NOS: 1-6010 or their corresponding full-length 
polynucleotides are linked to regulatory sequences as appropriate to obtain the desired expression 
properties. These can include promoters (attached either at the 5' end of the sense strand or at the 3' 
end of the antisense strand), enhancers, terminators, operators, repressors, and inducers The 
promoters can be regulated or constitutive. In some situations it may be desirable to use conditionally 
active promoters, such as tissue-specific or developmental stage-specific promoters. These are linked 
to the desired nucleotide sequence using the techniques described above for linkage to vectors. Any 
techniques known in the art can be used. 

[0055] When any of the above host cells, or other appropriate host cslls or organisms, are used to 
replicate and/or express the polynucleotides or nucleic acids of the invention, the resulting replicated 
nucleic acd, RNA, expressed protein or polypeptide, is within the scope of the invention as a product 
of the host cell or organism. The product is recovered by any appropriate means known in the art 
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[0056] Once the gene cc^resp^ding ,„ . ^ polynncIeotMe is , 

teregrfatedm to cento which the gene is native Fore,.™.. . ., 

, . . S °"™ c - f or example, an endogenous gene of a cell can be 

regulated by an exogenous regulatory sequence as disclosed in USPN 5,641,670. 

Identification of FinttA-l ^ stn,^,„i 
[0057) T™™™*^**^^^ 

= be ahgned with indivdua, fcnown sequences. ShnilaHty with individual sciences ^ 
7»^^vi*„ftop.,ypepudes^ 

^sequence, exhibiting similan* witi, « than c™ individual sequence can exhibit activities to. 
are characteristic of either or both individual sequences. 

neighbors can be used as probes and primers to identify and isol*e the fid, lengd, sequence 

polynucleotides. 

[0059J Typically, , selected polymiCeotide is tr^iated - all six frames ,„ determtae to best 
ahgnmen, wtth to u^dua, sequences. The sequences disced herein m to Sequence Listing are in 
a to 3 onentation and translation in three frames . be sufficient (with a few specific exceptions as 

which „,„ be ahgued with to individual sequences. Dafcbases with individual sequences are 

(1 96, 266, Dooht«,e, Academic Press, he., a division of Harcourt Brace * Co., Sau Diego 
Cahforma, USA. Databases include GenBauk, EMBL, ^ DNA Database of Japan (DDBfl 
[0060] Queo, and individual sequences can be aligned using to methods and computet 

N.„o M , Center fo, Biotech»o,„gy motion, which is supported by to National Lib^ of 
^^^^^Healu, SeealsoAltschul, eta,. Nucleic Acids Res. (,997) 

inc. Otor techniques for alignment are described in Doolittie, supra. Preferably, an aligranen, 

o^of.lg^ti^tpermiugapsm^alignn.cn.s. S eeMeti,.Mol. Bio,. (1997, 70 ,73- 
87. A,so, to GAP prog™, using to Needleman and Wuiach aligumcn, method can be u,i, iz ed to 
align sequences. An alternative search strategy uses MPSRCH software, which rure on a MASPAR 
computer. MPSRCH uses a Smith-Waterman a, gorithm ,„ scone sequences on a massively parane, 
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computer. This approach improves ability to identify sequences that are distantly related matches and 

"^^^m.^m^^ m ^^^^^ 

teprovKMpoiynue^fe^^^^^^^^^^^ ^ 

by any of the DNA or protein sequence databases, including the paten, databases (,*., GeneSeq) 
Also mcotporated by reference are those sequences that have been submitted «, these databases as of 
the f,l»g date of the present application but no, madepubhc until after the filmg date of the present 

application. 

[00M, Results of individual and quay sequence aligmuettts can be divided into three categories- 
htgh stnultrty, weak similarity, and no similarity. Individutu ahgmuen. results ranging fen, high 
stmtlanty to weak simiiarity provide a basts for determining polypeptide activity and/or structure 
Parameter for categorizing individual resu,ts indude: percentage of the aligmnen. region length where 
■he stages, aligrcnen, is found, percent sequence .dentity, and p value. The percentage of the 
ahgnmen, region length !s alailaKd by ^ ^ q[ ^ ^ 

found m me region of strongest aligrnnent, ,g„ con^ region of to ^vidua. ^ 
contains the greatest number of residues mat are idemtc.1 to the residues of the corresponding regton of 
the ahgned query sequence. This number is divided by the total residue ,engm of the query sequence to 
emulate a percentage. For example, a „ sequence of 20 amino acid residues might be aligm*, with 
a 20 ammo acid region of an mdividutu sequence. The indtvidual sequence might be identic^ to amino 
actd residues 5, 9.15, ,md 17-19„fthe quety sequence. The region of strongest alignment is thus the 
regton stretehtag f™ residue 9-19, an 1, amino acid streteh. The percentage of the ahgmnen, region 
length,*: 11 (^of tore^of m^di^)^^^^^^^^ 
55%. 

[0062] P ercent sequence identity is Mated by ^ rf ^ ^ 

between the query and individual sequence and dividing total number of matches by the number of 
rescues of the individual sequence found in the region of strongest alignment. Thus, the percent 
.denfcty m the example above would be 10 matches divided by 1 1 amino acids, or approximately, 

90.9% 

[0063] P value is the probability that the alignment was produced by chance. For a single 
alignment, the p value can be calculated according to Karlin et al., Proa Natl. Acad Sci (1990) 
87:2264 and Karlin et al., Proc. Natl. Acad. Sci. (1993) 90. The p value of multiple alignments using 
the same query sequence can be calculated using an heuristic approach described in Altschul et al Nat 
Genet. (1994) 6:1 19. Alignment programs such as BLAST program can calculate the p value. See 
also Altschul et al., Nucleic Acids Res. (1997) 25:3389-3402. 
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align can lie determined according to Doolittle, supra; BLAST 2.0 (see eg Lchul etalTT 
Acids Res. (1997) 25 3389-34021 or fast . ^ "'^ 

^highest. ' " <~ « «» - whe« sequence 

inore typically, at least about 58%; even more typically; at least about 60% of the total residue length 
* *. oueiy sequence. Usua% , percent lengdi of the aligmnen, region can be .s much as about 62^ 

more usually, as much as about 86%. 

100661 Thepv^ueisusedinconjunc.onwidid^me^. * high similar* is found, the 
the queiy sequence to be considered high similarity. 

ZLomnT™' ^^^^'^'-^^eofweaksimi^, 

™ acrf residues m lengd,; more »pic.,,, . ta about 20; _ _ . 
annno residues u,,engm.Usual,y,,c„ gU ,o fmeal ^ m ^ on ^ teasmucIiss 
*mno acid residues; more usuany, as much as about 40; even more usually, as much as about 60 

45 /.sequence identity. Us „ aUy , ^ sequence idenu* can be as much as about 50%- more 

usually, as much as about 55%; even more usually, .s much as about 60% 

[006»] ^^^^^^^^^ 

P^ilesequencewhenthepvalueis usually less ^ or equal to about 10e-2; mo« usua U y, ,1 ta 
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*™*anab.„. u**.-.^^^^^ lv XJ^ 
-M, or a w sequence M _ Mvidual seq _ md ^ ^ ;*<° 

sequence ^ 

^ sequence * ^ „ te , 80 ^ ^ ^ ^ , east90residuesin 

tag* -ore prefer,b,y, 12 0 residues m ^ even more preferably, ,50 amino acid residues in 

' 0 ! ,7 ° , , Atafflataa! ^^ Trans,auc^ of the provided 

rao^f A1 11 CaI [ ^ a '' 8IW< ' am " n0 profiles that define either protein families or^ common 
mo 0 f,A,so,« 0 »s 0 f Ih e prOTMpoWWdescaobealisiralio 

products (e.g., polypeptides, enccOed by the provide po^lecides or ccrespondmg cDNA or 
genes. For example, sequences mat show an identity „ r similarity . ehemokine proiHe or MSA 

can exhibit chemokine activities. 

(»07i, ^-hdM^-^^a,,,,.^.^^,,..,,^ 

™m„ ao,d setn«nce of members tha, belong «, the fami,y and (2) constincting a statistical 
represen^onofmeatignment. Such memods are described, for example, in Brmeye. d Nucl Acid 

For example, the Genome Sequencing Center a. th„ Washington University School of Medicine ' 
provtdes a web se, (Pfara) which provides MSAs of 547 different famihes and motifs. These MSAs 
^ desenbed also in Sonnhammer e, a,., Proteins (,997, 2 8: 405-420. Other sources over the world 

9(3) 249-251. Techntques for building profiles from MSAs are described in Sonrutammer e. al supra- 
Enzymology (,996) 266, Doo,ittle, Academic Press, ,ne„ San Diego, Cahfonua, USA 
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t»mly or mot,f Typ,cally, , program sllch M Searchwise is used to compare me query 

cab™ n^r* * Fe ° 8e ' " • " MoL EvoL <1987) 25:351 - « 

also^m^ n!rc *^cd align the query sequence with the members of a family or motif 

W mclude PileUp, whic h creaks a mtrtiple science anient, and ls cJLd „ P^l 

° APiSb — ofsequenees. Ami^ 

Igonthm of Srmth e, a,., Adv. App,. Math. C .98„ 2: 482. h genera,, the fo„o»™, 8 facrs arel W 

conserved residues found in the que^ sequence, (^percentage „f conserved residues found u, the 
que* sequence, (3, number of frameshifts, and (4, spacing between conserved residues 
[0074] Some ahgnmen. programs tha, bod, transhtte „d align sequer^ c„ make any number of 
^^^^^^^^^^ *"* 

»dpo fiIe MSAs. ^example.awea.stai^resul^.omnoWshasc.beabl 
^cau™of^v, PyOT s^ hmo f, qtt ^ s ^ )taa ^ 

fewer frames,.*; even more pretably, one or fewer frameshifls; even more preferably „„ 
frameshifts are found in an alignment of query md profile or MSAs. 

fOOTS, Cor^edresite^mose^^dsfoundatapardcmarpos.Uc.manors.neof 
conserved whenaCass or ^ isfound inat leastabc., 70o/„„f memembersof afamtly 
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or motif; more usually, at least about 80%; even more usually, at least about 90%; even more usually, 
at least about 95%. 

[0077] A residue is considered conserved when three unrelated amino acids are found at a 
particular position in some or all of the members; more usually, two unrelated amino acids. These 
residues are conserved when the unrelated amino acids are found at particular positions in at least 
about 40% of all class member; more typically, at least about 50%; even more typically, at least about 
60% of the members. Usually, a residue is conserved when a class or single amino acid is found in at 
least about 70% of the members of a family or motif; more usually, at least about 80%; even more 
usually, at least about 90%; even more usually, at least about 95%. 

[0078] A query sequence has similarity to a profile or MSA when the query sequence comprises 
at least about 25% of the conserved residues of the profile or MSA; more usually, at least about 30%; 
even more usually; at least about 40%. Typically, the query sequence has a stronger similarity to a 
profile sequence or MSA when the query sequence comprises at least about 45% of the conserved 
residues of the profile or MSA; more typically, at least about 50%; even more typically, at least about 
55%. 

[0079] Identification of Secreted & Membrane-Bo, mH Poly p^H^ P ~ th , rrrrtrf1 m<] 
membrane-bound polypeptides of the present invention are of particular interest. For example, levels 
of secreted polypeptides can be assayed in body fluids that are convenient, such as blood, plasma, 
serum, and other body fluids such as urine, prostatic fluid and semen. Membrane-bound polypeptides 
are useful for constructing vaccine antigens or inducing an immune response. Such antigens would 
comprise all or part of the extracellular region of the membrane-bound polypeptides. Because both 
secreted and membrane-bound polypeptides comprise a fragment of contiguous hydrophobic amino 
acids, hydrophobicity predicting algorithms can be used to identify such polypeptides. 
[0080] A signal sequence is usually encoded by both secreted and membrane-bound polypeptide 
genes to direct a polypeptide to the surface of the cell. The signal sequence usually comprises a stretch 
of hydrophobic residues. Such signal sequences can fold into helical structures. Membrane-bound 
polypeptides typically comprise at least one transmembrane region that possesses a stretch of 
hydrophobic amino acids that can transverse the membrane. Some transmembrane regions also exhibit 
a helical structure. Hydrophobic fragments within a polypeptide can be identified by using computer 
algorithms. Such algorithms include Hopp & Woods, Proc. Natl. Acad. Sci. USA (1981) 78:3824- 
3828; Kyte & Doolittle, J. Mol. Biol. (1982) 157: 105-132; and RAOAR algorithm, Degli Esposti et 
al., Eur. J. Biochem. (1990) 190: 207-219. 

[0081] Another method of identifying secreted and membrane-bound polypeptides is to translate 
the polynucleotides of the invention in all six frames and determine if at least 8 contiguous 
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hydrophobic amino acids are present. Those translated polypeptides with at least 8; more typically, 10; 
even more typically, 12 contiguous hydrophobic amino acids are considered to be either a putative 
secreted or membrane bound polypeptide. Hydrophobic amino acids include alanine, glycine, histidine, 
isoleucine, leucine, lysine, methionine, phenylalanine, proline, threonine, tryptophan, tyrosine, and 
valine 

Identification of the Function of an Expression Product of a Full-Length Gene 

[0082] Ribozymes, antisense constructs, and dominant negative mutants can be used to determine 
function of the expression product of a gene corresponding to a polynucleotide provided herein. These 
methods and compositions are particularly useful where the provided novel polynucleotide exhibits no 
significant or substantial homology to a sequence encoding a gene of known function. Antisense 
molecules and ribozymes can be constructed from synthetic polynucleotides. Typically, the 
phosphoramidite method of oligonucleotide synthesis is used. See Beaucage et al., Tet Lett. (1981) 
22; 1 859 and USPN 4,668,777. Automated devices for synthesis are available to create 
oligonucleotides using this chemistry. Examples of such devices include Biosearch 8600, Models 392 
and 394 by Applied Biosystems, a division of Perkin-Elmer Corp., Foster City, California, USA; and 
Expedite by Perceptive Biosystems, Framingham, Massachusetts, USA. Synthetic RNA, phosphate 
analog oligonucleotides, and chemically derivatized oligonucleotides can also be produced, and can be 
covalently attached to other molecules. RNA oligonucleotides can be synthesized, for example, using 
RNA phosphoramidites. This method can be performed on an automated synthesizer, such as Applied 
Biosystems, Models 392 and 394, Foster City, California, USA. 

[0083] Phosphorothioate oligonucleotides can also be synthesized for antisense construction. A 
sulfurizing reagent, such as tetraethylthiruam disulfide (TETD) in acetonitrile can be used to convert 
the internucleotide cyanoethyl phosphite to the phosphorothioate triester within 15 minutes at room 
temperature. TETD replaces the iodine reagent, while all other reagents used for standard 
phosphoramidite chemistry remain the same. Such a synthesis method can be automated using Models 
392 and 394 by Applied Biosystems, for example. 

[0084] Oligonucleotides of up to 200 nt can be synthesized, more typically, 100 nt; more typically 
50 nt; even more typically, 30 to 40 nt. These synthetic fragments can be annealed and ligated together 
to construct larger fragments. See, for example, Sambrook et al., supra. Trans-cleaving catalytic 
RNAs (ribozymes) are RNA molecules possessing endoribonuclease activity. Ribozymes are 
specifically designed for a particular target, and the target message must contain a specific nucleotide 
sequence. They are engineered to cleave any RNA species site-specifically in the background of 
cellular RNA. The cleavage event renders the mRNA unstable and prevents protein expression. 
Importantly, ribozymes can be used to inhibit expression of a gene of unknown function for the 



20 




purpose of determining its function in an in vitro or in vivo context, by detecting the phenotypic effect. 
One commonly used ribozyme motif is the hammerhead, for which the substrate sequence 
requirements are minimal Design of the hammerhead ribozyme, as well as therapeutic uses of 
ribozymes, are disclosed in Usman et al, Current Opin. Struct. Biol. (1996) 6:527. Methods for 
production of ribozymes, including hairpin structure ribozyme fragments, methods of increasing 
ribozyme specificity, and the like are known in the art. 

[0085] The hybridizing region of the ribozyme can be modified or can be prepared as a branched 
structure as described in Horn and Urdea, Nucleic Acids Res. (1989) 17:6959. The basic structure of 
the ribozymes can also be chemically altered in ways familiar to those skilled in the art, and chemically 
synthesized ribozymes can be administered as synthetic oligonucleotide derivatives modified by 
monomelic units. In a therapeutic context, liposome mediated delivery of ribozymes improves cellular 
uptake, as described in Birikh et al., Eur. J. Biochem. (1997) 245: 1. 

[0086] Antisense nucleic acids are designed to specifically bind to RNA, resulting in the 
formation of RNA-DNA or RNA-RNA hybrids, with an arrest of DNA replication, reverse 
transcription or messenger RNA translation. Antisense polynucleotides based on a selected 
polynucleotide sequence can interfere with expression of the corresponding gene. Antisense 
polynucleotides are typically generated within the cell by expression from antisense constructs that 
contain the antisense strand as the transcribed strand. Antisense polynucleotides based on the 
disclosed polynucleotides will bind and/or interfere with the translation of mRNA comprising a 
sequence complementary to the antisense polynucleotide. The expression products of control cells and 
cells treated with the antisense construct are compared to detect the protein product of the gene 
corresponding to the polynucleotide upon which the antisense construct is based. The protein is 
isolated and identified using routine biochemical methods. 

[0087] Given the extensive background literature and clinical experience in antisense therapy, one 
skilled in the art can use selected polynucleotides of the invention as additional potential therapeutics. 
The choice of polynucleotide can be narrowed by first testing them for binding to "hot spot" regions of 
the genome of cancerous cells. If a polynucleotide is identified as binding to a "hot spot," testing the 
polynucleotide as an antisense compound in the corresponding cancer cells is warranted. 
[0088] As an alternative method for identifying function of the gene corresponding to a 
polynucleotide disclosed herein, dominant negative mutations are readily generated for corresponding 
proteins that are active as homomultimers. A mutant polypeptide will interact with wild-type 
polypeptides (made from the other allele) and form a non-functional multimer. Thus, a mutation is in a 
substrate-binding domain, a catalytic domain, or a cellular localization domain. Preferably, the mutant 
polypeptide will be overproduced. Point mutations are made that have such an effect. In addition, 
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fusion of different polypeptides of various lengths to the terminus of a protein can yield dominant 
negative mutants. General strategies are available for making dominant negative mutants (see, e.g., 
Herskowitz, Nature (1987) 329:2 19). Such techniques can be used to create loss of function 
mutations, which are useful for determining protein function. 

Polypeptides and Variants Thereof 
[0089] The polypeptides of the invention include those encoded by the disclosed polynucleotides, 
as well as nucleic acids that, by virtue of the degeneracy of the genetic code, are not identical in 
sequence to the disclosed polynucleotides. Thus, the invention includes within its scope a polypeptide 
encoded by a polynucleotide having the sequence of any one of SEQ ID NOS.1-6010 or a variant 



thereof. 



[0090] In general, the term "polypeptide" as used herein refers to both the full length polypeptide 
encoded by the recited polynucleotide, the polypeptide encoded by the gene represented by the recited 
polynucleotide, as well as portions or fragments thereof. "Polypeptides" also includes variants of the 
naturally occurring proteins, where such variants are homologous or substantially similar to the 
naturally occurring protein, and can be of an origin of the same or different species as the naturally 
occurring protein (e.g., human, murine, or some other species that naturally expresses the recited 
polypeptide, usually a mammalian species). In general, variant polypeptides have a sequence that has 
at least about 80%, usually at least about 90%, and more usually at least about 98% sequence identity 
with a differentially expressed polypeptide of the invention, as measured by BLAST 2.0 using the 
parameters described above. The variant polypeptides can be naturally or non-naturally glycosylated, 
i.e., the polypeptide has a glycosylation pattern that differs from the glycosylation pattern found in the 
corresponding naturally occurring protein. 

[0091] The invention also encompasses homologs of the disclosed polypeptides (or fragments 
thereof) where the homologs are isolated from other species, i.e. other animal or plant species, where 
such homologs, usually mammalian species, e.g. rodents, such as mice, rats; domestic animals, e.g., 
horse, cow, dog, cat; and humans. By "homolog" is meant a polypeptide having at least about 35%, 
usually at least about 40% and more usually at least about 60% amino acid sequence identity to a 
particular differentially expressed protein as identified above, where sequence identity is determined 
using the BLAST 2.0 algorithm, with the parameters described supra. 

[0092] In general, the polypeptides of the subject invention are provided in a non-naturally 
occurring environment, e.g. are separated from their naturally occurring environment. In certain 
embodiments, the subject protein is present in a composition that is enriched for the protein as 
compared to a control. As such, purified polypeptide is provided, where by purified is meant that the 
protein is present in a composition that is substantially free of non-differentially expressed 
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polypeptides, where by substantially free is meant that less than 90%, usually less than 60% and more 
usually less than 50% of the composition is made up of non-differentially expressed polypeptides. 
[0093] Also within the scope of the invention are variants; variants of polypeptides include 
mutants, fragments, and fusions. Mutants can include amino acid substitutions, additions or deletions. 
The amino acid substitutions can be conservative amino acid substitutions or substitutions to eliminate 
non-essential amino acids, such as to alter a glycosylation site, a phosphorylation site or an acetylation 
site, or to minimize misfolding by substitution or deletion of one or more cysteine residues that are not 
necessary for function. Conservative amino acid substitutions are those that preserve the general 
charge, hydrophobicity/ hydrophilicity, and/or steric bulk of the amino acid substituted. Variants can 
be designed so as to retain or have enhanced biological activity of a particular region of the protein 
(e.g., a functional domain and/or, where the polypeptide is a member of a protein family, a region 
associated with a consensus sequence). Selection of amino acid alterations for production of variants 
can be based upon the accessibility (interior vs. exterior) of the amino acid (see, e.g., Go et al, Int. J. 
Peptide Protein Res. (1980) 15:211), the thermostability of the variant polypeptide (see, e.g., Querol et 
al, Prot. Eng. (1996) 9:265), desired glycosylation sites (see, e.g., Olsen and Thomsen, J. Gen. 
Microbiol. (1991) 137:579), desired disulfide bridges (see, e.g., Clarke et al., Biochemistry (1993) 
32:4322; and Wakarchuk et al., Protein Eng. (1994) 7: 1379), desired metal binding sites (see, e.g., 
Toma et al, Biochemistry (1991) 30:97, and Haezerbrouck et al., Protein Eng. (1993) 6:643), and 
desired substitutions within proline loops (see, e.g., Masul et al., Appl. Env. Microbiol. (1994) 
60:3579). Cysteine-depleted muteins can be produced as disclosed in USPN 4,959,3 14. 
[0094] Variants also include fragments of the polypeptides disclosed herein, particularly haptens, 
biologically active fragments, and/or fragments corresponding to functional domains. Fragments of 
interest will typically be at least about 10 aa to at least about 15 aa in length, usually at least about 50 
aa in length, and can be as long as 300 aa in length or longer, but will usually not exceed about 1000 aa 
in length, where the fragment will have a stretch of amino acids that is identical to a polypeptide 
encoded by a polynucleotide having a sequence of any SEQ ID NOS: 1-6010, or a homolog thereof 
The protein variants described herein are encoded by polynucleotides that are within the scope of the 
invention. The genetic code can be used to select the appropriate codons to construct the 
corresponding variants. 

Computer-Related Embodiments 
[0095] In general, a library of polynucleotides is a collection of sequence information, which 
information is provided in either biochemical form (e.g., as a collection of polynucleotide molecules), 
or m electronic form (e.g., as a collection of polynucleotide sequences stored in a computer-readable 
form, as in a computer system and/or as part of a computer program). The sequence information of the 
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polynucleotides can be used in a variety of ways, e.g., as a resource for gene discovery, as a 
representation of sequences expressed in a selected cell type (e.g., cell type markers), and/or as markers 
of a given disease or disease state. In general, a disease marker is a representation of a gene product 
that is present in all cells affected by disease either at an increased or decreased level relative to a 
normal cell (e.g., a cell of the same or similar type that is not substantially affected by disease). For 
example, a polynucleotide sequence in a library can be a polynucleotide that represents an mRNA, 
polypeptide, or other gene product encoded by the polynucleotide, that is either overexpressed or 
underexpressed in a breast ductal cell affected by cancer relative to a normal (i.e., substantially disease- 
free) breast cell. 

[0096] The nucleotide sequence information of the library can be embodied in any suitable form, 
e.g., electronic or biochemical forms. For example, a library of sequence information embodied in 
electronic form comprises an accessible computer data file (or, in biochemical form, a collection of 
nucleic acid molecules) that contains the representative nucleotide sequences of genes that are 
differentially expressed (e.g., overexpressed or underexpressed) as between, for example, i) a 
cancerous cell and a normal cell; ii) a cancerous cell and a dysplastic cell; iii) a cancerous cell and a cell 
affected by a disease or condition other than cancer; iv) a metastatic cancerous cell and a normal cell 
and/or non-metastatic cancerous cell; v) a malignant cancerous cell and a non-malignant cancerous cell 
(or a normal cell) and/or vi) a dysplastic cell relative to a normal cell. Other combinations and 
comparisons of cells affected by various diseases or stages of disease will be readily apparent to the 
ordinarily skilled artisan. Biochemical embodiments of the library include a collection of nucleic acids 
that have the sequences of the genes in the library, where the nucleic acids can correspond to the entire 
gene in the library or to a fragment thereof, as described in greater detail below. 
[0097] The polynucleotide libraries of the subject invention generally comprise sequence 
information of a plurality of polynucleotide sequences, where at least one of the polynucleotides has a 
sequence of any of SEQ ID NOS: 1-6010. By plurality is meant at least 2, usually at least 3 and can 
include up to all of SEQ ID NOS: 1-6010. The length and number of polynucleotides in the library 
will vary with the nature of the library, e.g., if the library is an oligonucleotide array, a cDNA array, a 
computer database of the sequence information, etc. 

[0098] Where the library is an electronic library, the nucleic acid sequence information can be 
present in a variety of media. "Media" refers to a manufacture, other than an isolated nucleic acid 
molecule, that contains the sequence information of the present invention. Such a manufacture 
provides the genome sequence or a subset thereof in a form that can be examined by means not directly 
applicable to the sequence as it exists in a nucleic acid. For example, the nucleotide sequence of the 
present invention, e.g. the nucleic acid sequences of any of the polynucleotides of SEQ ID NOS: 1- 
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6010, can be recorded on computer readable media, e.g. any medium that can be read and accessed 
directly by a computer. Such media include, but are not limited to: magnetic storage media, such as a 
floppy disc, a hard disc storage medium, and a magnetic tape; optical storage media such as CD-ROM; 
electrical storage media such as RAM and ROM; and hybrids of these categories such as 
magnetic/optical storage media. One of skill in the art can readily appreciate how any of the presently 
known computer readable mediums can be used to create a manufacture comprising a recording of the 
present sequence information. "Recorded" refers to a process for storing information on computer 
readable medium, using any such methods as known in the art. Any convenient data storage structure 
can be chosen, based on the means used to access the stored information. A variety of data processor 
programs and formats can be used for storage, e.g. word processing text file, database format, etc. In 
addition to the sequence information, electronic versions of the libraries of the invention can be 
provided in conjunction or connection with other computer-readable information and/or other types of 
computer-readable files (e.g., searchable files, executable files, etc, including, but not limited to, for 
example, search program software, etc.). 

[0099] By providing the nucleotide sequence in computer readable form, the information can be 
accessed for a variety of purposes. Computer software to access sequence information is publicly 
available. For example, the gapped BLAST (Altschul et al. Nucleic Acids Res. (1997) 25:3389-3402) 
and BLAZE (Brutlag et al. Comp. Chem. (1993) 17:203) search algorithms on a Sybase system can be 
used to identify open reading frames (ORFs) within the genome that contain homology to ORFs from 
other organisms. 

[00100] As used herein, "a computer-based system" refers to the hardware means, software means, 
and data storage means used to analyze the nucleotide sequence information of the present invention. ' 
The minimum hardware of the computer-based systems of the present invention comprises a central 
processing unit (CPU), input means, output means, and data storage means. A skilled artisan can 
readily appreciate that any one of the currently available computer-based system are suitable for use in 
the present invention. The data storage means can comprise any manufacture comprising a recording 
of the present sequence information as described above, or a memory access means that can access 
such a manufacture. 

[00101] "Search means" refers to one or more programs implemented on the computer-based 
system, to compare a target sequence or target structural motif, or expression levels of a polynucleotide 
in a sample, with the stored sequence information. Search means can be used to identify fragments or 
regions of the genome that match a particular target sequence or target motif. A variety of known 
algorithms are publicly known and commercially available, e.g. MacPattern (EMBL), BLASTN and 
BLASTX (NCBI). A "target sequence" can be any polynucleotide or amino acid sequence of six or 
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more contiguous nucleotides or two or more amino acids, preferably from about 10 to 100 amino acids 
or from about 30 to 300 nt A variety of comparing means can be used to accomplish comparison of 
sequence information from a sample (e.g., to analyze target sequences, target motifs, or relative 
expression levels) with the data storage means. A skilled artisan can readily recognize that any one of 
the publicly available homology search programs can be used as the search means for the computer 
based systems of the present invention to accomplish comparison of target sequences and motifs. 
Computer programs to analyze expression levels in a sample and in controls are also known in the art. 
[00102] A "target structural motif," or "target motif," refers to any rationally selected sequence or 
combination of sequences in which the sequence(s) are chosen based on a three-dimensional 
configuration that is formed upon the folding of the target motif, or on consensus sequences of 
regulatory or active sites. There are a variety of target motifs known in the art. Protein target motifs 
include, but arc not limited to, enzyme active sites and signal sequences. Nucleic acid target motifs 
include, but are not limited to, hairpin structures, promoter sequences and other expression elements 
such as binding sites for transcription factors. 

[00103] A variety of structural formats for the input and output means can be used to input and 
output the information in the computer-based systems of the present invention. One format for an 
output means ranks the relative expression levels of different polynucleotides. Such presentation 
provides a skilled artisan with a ranking of relative expression levels to determine a gene expression 
profile. 

[00104] As discussed above, the "library" of the invention also encompasses biochemical libraries 
of the polynucleotides of SEQ ID NOS.1-6010 , e.g., collections of nucleic acids representing the 
provided polynucleotides. The biochemical libraries can take a variety of forms, e.g., a solution of 
cDNAs, a pattern of probe nucleic acids stably associated with a surface of a solid support (i.e., an 
array) and the like. Of particular interest are nucleic acid arrays in which one or more of SEQ ID 
NOS:1-6010 is represented on the array. By array is meant a an article of manufacture that has at least 
a substrate with at least two distinct nucleic acid targets on one of its surfaces, where the number of 
distinct nucleic acids can be considerably higher, typically being at least 10, usually at least 20, and 
often at least 25 distinct nucleic acid molecules. A variety of different array formats have been 
developed and are known to those of skill in the art. The arrays of the subject invention find use in a 
variety of applications, including gene expression analysis, drug screening, mutation analysis and the 
like, as disclosed in the above-listed exemplary patent documents. 

[00105] In addition to the above nucleic acid libraries, analogous libraries of polypeptides are also 
provided, where the polypeptides of the library will represent at least a portion of the polypeptides 
encoded by a gene corresponding to one or more of SEQ ID NOS: 1-6010. 



26 



• 



Utilities 



a 



[00106] The polynucleotides of the invention are useful in a variety of applications. Exemplary 
utihes of the polynucleotides of the invention are described below. 

1001071 Co **ruction **** Molecules- Recombinant DNA, ^ xw l eic AHH 1Ulllt; _ r , m 
one embodiment of particular interest, the polynucleotides described herein as useful as the building 
blocks for larger molecules. In one example, the polynucleotide is a component of a larger cDNA 
molecule which in turn can be adapted for expression in a host cell (e.g. , a bacterial or eukaryotic (. g. 
yeast or mammalian) host cell). The cDNA can include, in addition to the polypeptide encoded by me ' 
starting material polynucleotide (/.,, a polynucleotide described herein), an amino acid sequence that is 
heterologous to the polypeptide encoded by the polynucleotide described herein as in a sequence 
encoding a fusion protein). In some embodiments, the polynucleotides described herein is used as 
starting material polynucleotide for synthesizing all or a portion of the gene to which the described 
polynucleotide corresponds. For example, a DNA molecule encoding a full-length human polypeptide 
can be constructed using a polynucleotide described herein as starting material. 
[00108] In another embodiment, the polynucleotides of the invention are used in nucleic acid 
multimers . Nucleic acid multimers can be linear or branched polymers of the same repeating single- 
stranded oligonucleotide unit or different single-stranded oligonucleotide units. Where the molecules 
are branched, the multimers are generally described as either "fork" or "comb" structures. The 
oligonucleotide units of the multimer may be composed of RNA, DNA, modified nucleotides or 
combinations thereof. At least one of the units has a sequence, length, and composition that permits it 
to bind specifically to a first single-stranded nucleotide sequence of interest, typically analyte or an 
oligonucleotide bound to the analyte. In order to achieve such specificity and stability, this unit will 
normally be 15 to 50 nt, preferably 15 to 30 nt, in length and have a GC content in the range of 40% to 
60%. In addition to such unit(s), the multimer includes a multiplicity of units that are capable of 
hybridizing specifically and stably to a second single-stranded nucleotide of interest, typically a labeled 
oligonucleotide or another multimer. These units will also normally be 15 to 50 nt, preferably 15 to 30 
nt, in length and have a GC content in the range of 40% to 60%. When a multimer is designed to be 
hybridized to another multimer, the first and second oligonucleotide units are heterogeneous (different) 
One or more of the polynucleotides described herein, or a portion of a polynucleotide described herein, 
can be used as a repeating unit of such nucleic acid multimers. 

[00109] The total number of oligonucleotide units in the multimer will usually be in the range of 3 
to 50, more usually 10 to 20. In multimers in which the unit that hybridizes to the nucleotide sequence 
of interest ,s different from the unit that hybridizes to the labeled oligonucleotide, the number ratio of 
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the latter to the former will usually be 2:1 to 30:1, more usually 5:1 to 20:1, and-preferably 10:1 to 



15:1 



is* 
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[00110] The oligonucleotide units of the multimer may be covalently linked directly to each other 
through phosphodiester bonds or through interposed linking agents such as nucleic acid, amino acid, 
carbohydrate or polyol bridges, or through other cross-linking agents that are capable of cross-linkmg 
nucleic acid or modified nucleic acid strands. The site(s) of linkage may be at the ends of the unit (in 
either normal 3,-5' orientation or randomly oriented) and/or at one or more internal nucleotides in the 
strand In linear multimers the individual units are linked end-to-end to form a linear polymer. In one 
type of branched multimer three or more oligonucleotide units emanate from a point of origin to form a 
branched structure. The point of origin may be another oligonucleotide unit or a multifunctional 
molecule to which at least three units can be covalently bound. In another type, there is an 
oligonucleotide unit backbone with one or more pendant oligonucleotide units. These latter-type 
multimers are "fork-like", "comb-like" or combination "fork-" and "comb-like" in structure. The 
pendant units will normally depend from a modified nucleotide or other organic moiety having 
appropriate functional groups to which oligonucleotides may be conjugated or otherwise attached. The 
multimer may be totally linear, totally branched, or a combination of linear and branched portions. 
Preferably there will be at least two branch points in the multimer, more preferably at least 3, 
preferably5to 10. The multimer may include one or more segments of double-stranded sequences. 
[001 1 1] Multimeric nucleic acid molecules are useful in amplifying the signal that results from 
hybridization of one the first sequence of the multimeric molecule to a target sequence. The 
amplification is theoretically proportional to the number of iterations of the second segment. 
[001 12] Without being held to theory, forked structures of greater than about eight branches 
exhibited steric hindrance which inhibited binding of labeled probes to the multimer. On the other 
hand, comb structures exhibit little or no steric problems and are thus a preferred type of branched 
multimer. For a description of branched nucleic acid multimers of both the fork and comb types, as 
well as methods of use and synthesis, see, e.g., U.S. Pat. Nos. 5,124,246 (fork-type structures); 
5,710,264 (synthesis of comb structures); and 5,849,481. 

1001131 ^e of Polynucleotide Probes in Mapping, and in TW P™fiii„„ Polynucleotide probes, 
generally comprising at least 12 contiguous nt of a polynucleotide as shown in the Sequence Listing, 
are used for a variety of purposes, such as chromosome mapping of the polynucleotide and detection of 
transcription levels. Additional disclosure about preferred regions of the disclosed polynucleotide 
sequences is found in the Examples. A probe that hybridizes specifically to a polynucleotide disclosed 
herein should provide a detection signal at least 5-, 10-, or 20-fold higher than the background 
hybridization provided with other unrelated sequences. 
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1001141 Paction of Exp ression T rvH« Nucleotide probes are used to detect expression of a gene 
corresponding to the provided polynucleotide. In Northern blots, mRNA is separated 
electrophoretically and contacted with a probe. A probe is detected as hybridizing to an mRNA species 
of a particular size. The amount of hybridization is quantitated to determine relative amounts of 
expression, for example under a particular condition. Probes are used for in situ hybridization to cells 
to detect expression. Probes can also be used in vivo for diagnostic detection of hybridizing sequences. 
Probes are typically labeled with a radioactive isotope. Other types of detectable labels can be used 
such as chromophores, fluors, and enzymes. Other examples of nucleotide hybridization assays are 
described in WO92/02526 and USPN 5,124,246. 

[00115] Alternatively, the Polymerase Chain Reaction (PCR) is another means for detecting small 
amounts of target nucleic acids (see, e.g., Mullis et al., Meth. Enzymol. (1987) 155:335; USPN 
4,683,195; and USPN 4,683,202). Two primer polynucleotides nucleotides that hybridize with the 
target nucleic acids are used to prime the reaction. The primers can be composed of sequence within or 
3' and 5' to the polynucleotides of the Sequence Listing. Alternatively, if the primers are 3' and 5' to 
these polynucleotides, they need not hybridize to them or the complements. After amplification of the 
target with a thermostable polymerase, the amplified target nucleic acids can be detected by methods 
known in the art, e.g., Southern blot. mRNA or cDNA can also be detected by traditional blotting 
techniques (e.g., Southern blot, Northern blot, etc.) described in Sambrook et al., "Molecular Cloning: 
A Laboratory Manual" (New York, Cold Spring Harbor Laboratory, 1989) (e.g., without PCR 
amplification). In general, mRNA or cDNA generated from mRNA using a polymerase enzyme can be 
purified and separated using gel electrophoresis, and transferred to a solid support, such as 

nitrocellulose. The solid support is exposed to a labeled probe, washed to remove any unhybridized 
probe, and duplexes containing the labeled probe are detected. 

[00116] Mapping. Polynucleotides of the present invention can be used to identify a chromosome 
on which the corresponding gene resides. Such mapping can be useful in identifying the function of the 
polynucleotide-related gene by its proximity to other genes with known function. Function can also be 
assigned to the polynucleotide-related gene when particular syndromes or diseases map to the same 
chromosome. For example, use of polynucleotide probes in identification and quantification of nucleic 
acid sequence aberrations is described in USPN 5,783,387. An exemplary mapping method is 
fluorescence in situ hybridization (FISH), which facilitates comparative genomic hybridization to allow 
total genome assessment of changes in relative copy number of DNA sequences (see, e.g., Valdes et 
al., Methods in Molecular Biology (1997) 68: 1). Polynucleotides can also be mapped to particular 
chromosomes using, for example, radiation hybrids or chromosome-specific hybrid panels. See Leach 
et al., Advances in Genetics, (1995) 33:63-99; Walter et al., Nature Genetics (1994) 7:22; Walter and 
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Goodfellow, Trends in Genetics (1992) 9:352. Panels for radiation hybrid mapping are available from 
Research Genetics, Inc., Huntsville, Alabama, USA. Databases for markers using various panels are 
available via the world wide web at sites supported by the Stanford Human Genome Center (Stanford 
University) and the Whitehead Institute for Biomedical Research/MIT Center for Genome Research 
The statistical program RHMAP can be used to construct a map based on the data from radiation 
hybridization with a measure of the relative likelihood of one order versus another. RHMAP is 
available via the world wide web at a site supported by the University of Michigan. In addition, 
commercial programs are available for identifying regions of chromosomes commonly associated with 
disease, such as cancer. 

[ ° 01171 T ^"e Typing or Profiling Expression of specific mRNA corresponding to the provided 
polynucleotides can vary in different cell types and can be tissue-specific. This variation ofmRNA 
levels in different cell types can be exploited with nucleic acid probe assays to determine tissue types. 
For example, PCR, branched DNA probe assays, or blotting techniques utilizing nucleic acid probes 
substantially identical or complementary to polynucleotides listed in the Sequence Listing can 
determine the presence or absence of the corresponding cDNA or mRNA. 
[00118] Tissue typing can be used to identify the developmental organ or tissue source of a 
metastatic lesion by identifying the expression of a particular marker of that organ or tissue. If a 
polynucleotide is expressed only in a specific tissue type, and a metastatic lesion is found to express 
that polynucleotide, then the developmental source of the lesion has been identified. Expression of a 
particular polynucleotide can be assayed by detection of either the corresponding mRNA or the protein 
product. As would be readily apparent to any forensic scientist, the sequences disclosed herein are 
useful in differentiating human tissue from non-human tissue. In particular, these sequences are useful 
to differentiate human tissue from bird, reptile, and amphibian tissue, for example. 
[ ° 01191 Use of Polymorphisms . A polynucleotide of the invention can be used in forensics, 
genetic analysis, mapping, and diagnostic applications where the corresponding region of a gene is 
polymorphic in the human population. Any means for detecting a polymorphism in a gene can be used, 
including, but not limited to electrophoresis of protein polymorphic variants, differential sensitivity to ' 
restriction enzyme cleavage, and hybridization to allele-specific probes. 

[00120] Antibody Production Expression products of a polynucleotide of the invention, as well as 
the corresponding mRNA, cDNA, or complete gene, can be prepared and used for raising antibodies 
for experimental, diagnostic, and therapeutic purposes. For polynucleotides to which a corresponding 
gene has not been assigned, this provides an additional method of identifying the corresponding gene. 
The polynucleotide or related cDNA is expressed as described above, and antibodies are prepared. 
These antibodies are specific to an epitope on the polypeptide encoded by the polynucleotide, and can 
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precipitate or bind to the corresponding native protein in a cell or tissue preparation or in a cell-free 

extract of an in vitro expression system. 

[00121, Methods for production of antibodies that specifically bind a selected antigen are well 
known mthe art. Immunogens for raising antibodies can be prepared by nixing a polypeptide encoded 
by a polynucleotide of the invention with an adjuvant, and/or by making fusion proteins whh larger 
monogenic proteins. Polypeptides can also be covalently linked to other larger immunogenic 
protems, such as keyhole limpet hemocyanin. Immunogens are typically administered intradermally 
subcutaneously, or intramuscularly to experimental animals such as rabbits, sheep, and mice to ' 
generate antibodies. Monoclonal antibodies can be generated by isolating spleen cells and fusing 
myeloma cells to form hybridomas. Alternatively, the selected polynucleotide is administered directly 
such as by mtramuscular injection, and expressed in vivo. The expressed protein generates a variety of 
protem-specific immune responses, including production of antibodies, comparable to achninistration 
of the protein. 

[00122] Preparations of polyclonal and monoclonal antibodies specific for polypeptides encoded 
by a selected polynucleotide are made using standard methods known in the art. The antibodies 
specially bind to epitopes present in the polypeptides encoded by polynucleotides disclosed in the 
Sequence Listing. Typically, at least 6, 8, 10, or 12 contiguous amino acids are required to form an 
eprtope. EP*opesthatmvolvenon-^^ ^ ^ 

least 15, 25, or 50 amino acids. Antibodies that specifically bind to human polypeptides encoded by 
the provded polypeptides should provide a detection signal at least 5-, 10-, or 20-fold higher than a 
detect™ signal provided with other proteins when used in Western blots or other immunochemical 
assays. Preferably, antibodies that specifically bind polypeptides contemplated by the invention do not 
bmd to other proteins in immunochemical assays at detectable levels and can immunoprecipitate the 
specific polypeptide from solution. 

[00123] The invention also contemplates naturally occurring antibodies specific for a polypeptide 
of the mvention. For example, serum antibodies to a polypeptide of the invention in a human 
population can be purified by methods well known in the art, e.g., by passing antiserum over a column 
to winch the corresponding selected polypeptide or fusion protein is bound. The bound antibodies can 
then be eluted from the column, for example, using a buffer with a high salt concentration 
[00124] In addition to the antibodies discussed above, the invention also contemplates genetically 
engmeered antibodies antibodies (e.g., chimeric antibodies, humanized antibodies, human antibodies 
produced by a transgenic animal (e.g., a transgenic mouse such as the XenomousTM), antibody 
denvatives (e.g., single chain antibodies, antibody fragments (e.g., Fab, etc.)), accordmg to methods 
well known in the art. 
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Polynucleotides or Arrays for Diagnostics. 
[00125] Polynucleotide arrays provide a high throughput technique that can assay a large number 
of polynucleotides in a sample. This technology can be used as a diagnostic and as tool to test for 
differential expression expression, e.g., to determine function of an encoded protein. A variety of 
methods of producing arrays, as well as variations of these methods, are known in the art and 
contemplated for use in the invention. For example, arrays can be created by spotting polynucleotide 
probes onto a substrate (e.g., glass, nitrocellulose, etc.) in a two-dimensional matrix or aiTay having 
bound probes. The probes can be bound to the substrate by either covalent bonds or by non-specific 
interactions, such as hydrophobic interactions. Samples of polynucleotides can be detectably labeled 
(e.g., using radioactive or fluorescent labels) and then hybridized to the probes. Double stranded 
polynucleotides, comprising the labeled sample polynucleotides bound to probe polynucleotides, can be 
detected once the unbound portion of the sample is washed away. Alternatively, the polynucleotides of 
the test sample can be immobilized on the array, and the probes detectably labeled. Techniques for 
constructing arrays and methods of using these arrays are described in, for example, Schena et al. 
(1996) Proc Natl Acad Sci USA. 93(20): 10614-9; Schena et al. (1995) Science 270(5235):467-70; 
Shalon et al. (1996) Genome Res. 6(7):639-45, USPN 5,807,522, EP 799 897; WO 97/29212; WO 
97/27317; EP 785 280; WO 97/02357; USPN 5,593,839; USPN 5,578,832; EP 728 520; 
USPN 5,599,695; EP 721 016; USPN 5,556,752; WO 95/22058; and USPN 5,631,734. 
[00126] Arrays can be used to, for example, examine differential expression of genes and can be 
used to determine gene function. For example, arrays can be used to detect differential expression of a 
gene corresponding to a polynucleotide of the invention, where expression is compared between a test 
cell and control cell (e.g., cancer cells and normal cells). For example, high expression of a particular 
message in a cancer cell, which is not observed in a corresponding normal cell, can indicate a cancer 
specific gene product. Exemplary uses of arrays are further described in, for example, Pappalarado et 
al., Sem. Radiation Oncol. (1998) 8:217; and Ramsay Nature Biotechnol. (1998) 16:40. Furthermore, 
many variations on methods of detection using arrays are well within the skill in the art and within the 
scope of the present invention. For example, rather than immobilizing the probe to a solid support, the 
test sample can be immobilized on a solid support which is then contacted with the probe. 

Differential Expression in Diag nosis 
[00127] The polynucleotides of the invention can also be used to detect differences in expression 
levels between two cells, e.g., as a method to identify abnormal or diseased tissue in a human. For 
polynucleotides corresponding to profiles of protein families, the choice of tissue can be selected 
according to the putative biological function. In general, the expression of a gene corresponding to a 
specific polynucleotide is compared between a first tissue that is suspected of being diseased and a 
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second, normal tissue of the human. The tissue suspected of being abnormal or diseased can be 
derived from a different tissue type of the human, but preferably it is derived from the same tissue type; 
for example, an intestinal polyp or other abnormal growth should be compared with normal intestinal 
tissue. The normal tissue can be the same tissue as that of the test sample, or any normal tissue of the 
patient, especially those that express the polynucleotide-related gene of interest (e.g., brain, thymus, 
testis, heart, prostate, placenta, spleen, small intestine, skeletal muscle, pancreas, and the mucosal 
lining of the colon). A difference between the polynucleotide-related gene, mRNA, or protein in the 
two tissues which are compared, for example, in molecular weight, amino acid or nucleotide sequence, 
or relative abundance, indicates a change in the gene, or a gene which regulates it, in the tissue of the 
human that was suspected of being diseased. Examples of detection of differential expression and its 
use in diagnosis of cancer are described in USPNs 5,688,641 and 5,677,125. 
[00128] A genetic predisposition to disease in a human can also be detected by comparing 
expression levels of an mRNA or protein corresponding to a polynucleotide of the invention in a fetal 
tissue with levels associated in normal fetal tissue. Fetal tissues that are used for this purpose include, 
but are not limited to, amniotic fluid, chorionic villi, blood, and the blastomere of an in vitro-fertilized 
embryo. The comparable normal polynucleotide-related gene is obtained from any tissue. The mRNA 
or protein is obtained from a normal tissue of a human in which the polynucleotide-related gene is 
expressed. Differences such as alterations in the nucleotide sequence or size of the same product of the 
fetal polynucleotide-related gene or mRNA, or alterations in the molecular weight, amino acid 
sequence, or relative abundance of fetal protein, can indicate a germline mutation in the polynucleotide- 
related gene of the fetus, which indicates a genetic predisposition to disease. In general, diagnostic, 
prognostic, and other methods of the invention based on differential expression involve detection of a 
level or amount of a gene product, particularly a differentially expressed gene product, in a test sample 
obtained from a patient suspected of having or being susceptible to a disease (e.g., breast cancer, lung 
cancer, colon cancer and/or metastatic forms thereof), and comparing the detected levels to those levels 
found in normal cells (e.g., cells substantially unaffected by cancer) and/or other control cells (e.g., to 
differentiate a cancerous cell from a cell affected by dysplasia). Furthermore, the severity of the 
disease can be assessed by comparing the detected levels of a differentially expressed gene product 
with those levels detected in samples representing the levels of differentially expressed gene product 
associated with varying degrees of severity of disease. It should be noted that use of the term 
"diagnostic" herein is not necessarily meant to exclude "prognostic" or "prognosis," but rather is used 
as a matter of convenience. 

[00129] The term "differentially expressed gene" is generally intended to encompass a 
polynucleotide that can, for example, include an open reading frame encoding a gene product (e.g., a 
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polypeptide), and/or introns of such genes and adjacent 5' and 3' non-coding nucleotide sequences 
involved in the regulation of expression, up to about 20 kb beyond the coding region, but possibly 
further in either direction. The gene can be introduced into an appropriate vector for 
extrachromosomal maintenance or for integration into a host genome. In general, a difference in 
expression level associated with a decrease in expression level of at least about 25%, usually at least 
about 50% to 75%, more usually at least about 90% or more is indicative of a differentially expressed 
gene of interest, i.e., a gene that is underexpressed or down-regulated in the test sample relative to a 
control sample. Furthermore, a difference in expression level associated with an increase in expression 
of at least about 25%, usually at least about 50% to 75%, more usually at least about 90% and can be 
at least about 1 '/.-fold, usually at least about 2-fold to about 10-fold, and can be about 100-fold to 
about 1,000-fold increase relative to a control sample is indicative of a differentially expressed gene of 
interest, i.e., an overexpressed or up-regulated gene. 

[00130] "Differentially expressed polynucleotide" as used herein means a nucleic acid molecule 
(RNA or DNA) comprising a sequence that represents a differentially expressed gene, e.g., the 
differentially expressed polynucleotide comprises a sequence (e.g., an open reading frame encoding a 
gene product) that uniquely identifies a differentially expressed gene so that detection of the 
differentially expressed polynucleotide in a sample is correlated with the presence of a differentially 
expressed gene in a sample. "Differentially expressed polynucleotide" is also meant to encompass 
fragments of the disclosed polynucleotides, e.g., fragments retaining biological activity, as well as 
nucleic acids homologous, substantially similar, or substantially identical (e.g., having about 90% 
sequence identity) to the disclosed polynucleotides. 

[00131] Methods of the subject invention useful in diagnosis or prognosis typically involve 
comparison of the abundance of a selected differentially expressed gene product in a sample of interest 
with that of a control to determine any relative differences in the expression of the gene product, where 
the difference can be measured qualitatively and/or quantitatively. Quantitation can be accomplished, 
for example, by comparing the level of expression product detected in the sample with the amounts of 
product present in a standard curve. A comparison can be made visually; by using a technique such as 
densitometry, with or without computerized assistance; by preparing a representative library of cDNA 
clones of mRNA isolated from a test sample, sequencing the clones in the library to determine that 
number of cDNA clones corresponding to the same gene product, and analyzing the number of clones 
corresponding to that same gene product relative to the number of clones of the same gene product in a 
control sample; or by using an array to detect relative levels of hybridization to a selected sequence or 
set of sequences, and comparing the hybridization pattern to that of a control. The differences in 
expression are then correlated with the presence or absence of an abnormal expression pattern. A 
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variety of different methods for determining the nucleic acid abundance in a sample are known to those 
of skill in the art (see, e.g., WO 97/273 17). 

[00132] In general, diagnostic assays of the invention involve detection of a gene product of a 
polynucleotide sequence (e.g., mRNA or polypeptide) that corresponds to a sequence of SEQ ID 
NOS: 1-6010. The patient from whom the sample is obtained can be apparently healthy, susceptible to 
disease (e.g., as determined by family history or exposure to certain environmental factors), or can 
already be identified as having a condition in which altered expression of a gene product of the 
invention is implicated. 

[00133] Diagnosis can be determined based on detected gene product expression levels of a gene 
product encoded by at least one, preferably at least two or more, at least 3 or more, or at least 4 or more 
of the polynucleotides having a sequence set forth in SEQ IDNOS:1-6010, and can involve detection 
of expression of genes corresponding to all of SEQ ID NOS.1-6010 and/or additional sequences that 
can serve as additional diagnostic markers and/or reference sequences. Where the diagnostic method is 
designed to detect the presence or susceptibility of a patient to cancer, the assay preferably involves 
detection of a gene product encoded by a gene corresponding to a polynucleotide that is differentially 
expressed in cancer. Examples of such differentially expressed polynucleotides are described in the 
Examples below. Given the provided polynucleotides and information regarding their relative 
expression levels provided herein, assays using such polynucleotides and detection of their expression 
levels in diagnosis and prognosis will be readily apparent to the ordinarily skilled artisan. 
[00134] Any of a variety of detectable labels can be used in connection with the various 
embodiments of the diagnostic methods of the invention. Suitable detectable labels include 
fluorochromes,(e.g. fluorescein isothiocyanate (FITC), rhodamine, Texas Red, phycoerythrin, 
allophycocyanin, 6-carboxyfluorescein (6-FAM), 2',7'-dimethoxy-4',5'-dichloro-6- 
carboxyfluorescein, 6-carboxy-X-rhodamine (ROX), 6-carboxy-2',4',7',4,7-h e xachlorofluorescein 
(HEX), 5-carboxyfluorescein (5-FAM) or N,N,N^N^tetramemyl-6-carboxvrhodamine (TAMRA)), 
radioactive labels, (e.g. 32P, 35S, 3H, etc.), and the like. The detectable label can involve a two stage 
systems (e.g., biotin-avidin, hapten-anti-hapten antibody, etc.). 

[00135] Reagents specific for the polynucleotides and polypeptides of the invention, such as 
antibodies and nucleotide probes, can be supplied in a kit for detecting the presence of an expression 
product in a biological sample. The kit can also contain buffers or labeling components, as well as 
instructions for using the reagents to detect and quantify expression products in the biological sample. 
Exemplary embodiments of the diagnostic methods of the invention are described below in more detail. 
[001361 Polypeptide detection in diapnosis In one embodiment, the test sample is assayed for the 
level of a differentially expressed polypeptide. Diagnosis can be accomplished using any of a number 
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of methods to determine the absence or presence or altered amounts of the differentially expressed 
polypeptide in the test sample. For example, detection can utilize staining of cells or histological 
sections with labeled antibodies, performed in accordance with conventional methods. Cells can be 
permeabilized to stain cytoplasmic molecules. In general, antibodies that specifically bind a 
differentially expressed polypeptide of the invention are added to a sample, and incubated for a period 
of time sufficient to allow binding to the epitope, usually at least about 10 minutes. The antibody can 
be detectably labeled for direct detection (e.g., using radioisotopes, enzymes, fluorescers, 
chemiluminescers, and the like), or can be used in conjunction with a second stage antibody or reagent 
to detect binding (e.g., biotin with horseradish peroxidase-conjugated avidin, a secondary antibody 
conjugated to a fluorescent compound, e.g. fluorescein, rhodamine, Texas red, etc.). The absence or 
presence of antibody binding can be determined by various methods, including flow cytometry of 
dissociated cells, microscopy, radiography, scintillation counting, etc. Any suitable alternative 
methods of qualitative or quantitative detection of levels or amounts of differentially expressed 
polypeptide can be used, for example, ELISA, western blot, immunoprecipitation, radioimmunoassay, 
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[00137] mRNA detection. The diagnostic methods of the invention can also or alternatively 
involve detection of mRNA encoded by a gene corresponding to a differentially expressed 
polynucleotide of the invention. Any suitable qualitative or quantitative methods known in the art for 
detecting specific mRNAs can be used. mRNA can be detected by, for example, in situ hybridization 
in tissue sections, by reverse transcriptase-PCR, or in Northern blots containing poly A+ mRNA. One 
of skill in the art can readily use these methods to determine differences in the size or amount of 
mRNA transcripts between two samples. mRNA expression levels in a sample can also be determined 
by generation of a library of expressed sequence tags (ESTs) from the sample, where the EST library is 
representative of sequences present in the sample (Adams, et al., (1991) Science 252:1651). 
Enumeration of the relative representation of ESTs within the library can be used to approximate the 
relative representation of the gene transcript within the starting sample. The results of EST analysis of 
a test sample can then be compared to EST analysis of a reference sample to determine the relative 
expression levels of a selected polynucleotide, particularly a polynucleotide corresponding to one or 
more of the differentially expressed genes described herein. Alternatively, gene expression in a test 
sample can be performed using serial analysis of gene expression (SAGE) methodology (e.g., 
Velculescu et al., Science (1995) 270:484) or differential display (DD) methodology (see, e.g., USPN 
5,776,683 and USPN 5,807,680). 

[00138] Alternatively, gene expression can be analyzed using hybridization analysis. 
Oligonucleotides or cDNA can be used to selectively identify or capture DNA or RNA of specific 
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sequence composition, and the amount of RNA or cDNA hybridized to a known capture sequence 
determined qualitatively or quantitatively, to provide information about the relative representation of a 
particular message within the pool of cellular messages in a sample. Hybridization analysis can be 
designed to allow for concurrent screening of the relative expression of hundreds to thousands of genes 
by using, for example, array-based technologies having high density formats, including filters, 
microscope slides, or microchips, or solution-based technologies that use spectroscopic analysis (e.g., 
mass spectrometry). One exemplary use of arrays in the diagnostic methods of the invention is 
described below in more detail 

[00139] Use of a single gene in diagnostic applications. The diagnostic methods of the invention 
can focus on the expression of a single differentially expressed gene. For example, the diagnostic 
method can involve detecting a differentially expressed gene, or a polymorphism of such a gene (e.g., a 
polymorphism in a coding region or control region), that is associated with disease. Disease-associated 
polymorphisms can include deletion or truncation of the gene, mutations that alter expression level 
and/or affect activity of the encoded protein, etc. 

[00140] A number of methods are available for analyzing nucleic acids for the presence of a 
specific sequence, e.g. a disease associated polymorphism. Where large amounts of DNA are 
available, genomic DNA is used directly. Alternatively, the region of interest is cloned into a suitable 
vector and grown in sufficient quantity for analysis. Cells that express a differentially expressed gene 
can be used as a source of mRNA, which can be assayed directly or reverse transcribed into cDNA for 
analysis. The nucleic acid can be amplified by conventional techniques, such as the polymerase chain 
reaction (PCR), to provide sufficient amounts for analysis, and a detectable label can be included in the 
amplification reaction (e.g., using a detectably labeled primer or detectably labeled oligonucleotides) to 
facilitate detection. Alternatively, various methods are also known in the art that utilize oligonucleotide 
ligation as a means of detecting polymorphisms, see, e.g., Riley et al., Nucl. Acids Res. (1990) 
18:2887; and Delahunty et al., Am. J. Hum. Genet. (1996) 58: 1239. 

[00141] The amplified or cloned sample nucleic acid can be analyzed by one of a number of 
methods known in the art. The nucleic acid can be sequenced by dideoxy or other methods, and the 
sequence of bases compared to a selected sequence, e.g., to a wild-type sequence. Hybridization with 
the polymorphic or variant sequence can also be used to determine its presence in a sample (e.g., by 
Southern blot, dot blot, etc.). The hybridization pattern of a polymorphic or variant sequence and a 
control sequence to an array of oligonucleotide probes immobilized on a solid support, as described in 
US 5,445,934, or in WO 95/35505, can also be used as a means of identifying polymorphic or variant 
sequences associated with disease. Single strand conformational polymorphism (SSCP) analysis, 
denaturing gradient gel electrophoresis (DGGE), and heteroduplex analysis in gel matrices are used to 
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detect conformational changes created by DNA sequence variation as alterations in electrophoretic 
mobility. Alternatively, where a polymorphism creates or destroys a recognition site for a restriction 
endonuclease, the sample is digested with that endonuclease, and the products size fractionated to 
determine whether the fragment was digested. Fractionation is performed by gel or capillary 
electrophoresis, particularly acrylamide or agarose gels. 

[00142] Screening for mutations in a gene can be based on the functional or antig enic 
characteristics of the protein. Protein truncation assays are useful in detecting deletions that can affect 
the biological activity of the protein. Various immunoassays designed to detect polymorphisms in 
proteins can be used in screening. Where many diverse genetic mutations lead to a particular disease 
phenotype, functional protein assays have proven to be effective screening tools. The activity of the 
encoded protein can be determined by comparison with the wild-type protein. 

Diagnosis, Prognosis. Ass essment of Therapy (Therametrics). and Management of Cancer 
[00143] The polynucleotides of the invention, as well as their gene products, are of particular 
interest as genetic or biochemical markers (e.g., in blood or tissues) that will detect the earliest changes 
along the carcinogenesis pathway and/or to monitor the efficacy of various therapies and preventive 
interventions. For example, the level of expression of certain polynucleotides can be indicative of a 
poorer prognosis, and therefore warrant more aggressive chemo- or radio-therapy for a patient or vice 
versa. The correlation of novel surrogate tumor specific features with response to treatment and 
outcome in patients can define prognostic indicators that allow the design of tailored therapy based on 
the molecular profile of the tumor. These therapies include antibody targeting, antagonists (e.g., small 
molecules), and gene therapy. Determining expression of certain polynucleotides and comparison of a 
patient's profile with known expression in normal tissue and variants of the disease allows a 
determination of the best possible treatment for a patient, both in terms of specificity of treatment and 
in terms of comfort level of the patient. Surrogate tumor markers, such as polynucleotide expression, 
can also be used to better classify, and thus diagnose and treat, different forms and disease states of 
cancer. Two classifications widely used in oncology that can benefit from identification of the 
expression levels of the genes corresponding to the polynucleotides of the invention are staging of the 
cancerous disorder, and grading the nature of the cancerous tissue. 

[00144] The polynucleotides that correspond to differentially expressed genes, as well as then- 
encoded gene products, can be useful to monitor patients having or susceptible to cancer to detect 
potentially malignant events at a molecular level before they are detectable at a gross morphological 
level. In addition, the polynucleotides of the invention, as well as the genes corresponding to such 
polynucleotides, can be useful as therametrics, e.g., to assess the effectiveness of therapy by using the 
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polynucleotides or their encoded gene products, to assess, for example, tumor burden in the patient 
before, during, and after therapy. 

[00145] Furthermore, a polynucleotide identified as corresponding to a gene that is differentially 
expressed in, and thus is important for, one type of cancer can also have implications for development 
or risk of development of other types of cancer, e.g., where a polynucleotide represents a gene 
differentially expressed across various cancer types. Thus, for example, expression of a polynucleotide 
corresponding to a gene that has clinical implications for metastatic colon cancer can also have clinical 
implications for stomach cancer or endometrial cancer. 

[00146] Staging. Staging is a process used by physicians to describe how advanced the cancerous 
state is in a patient. Staging assists the physician in determining a prognosis, planning treatment and 
evaluating the results of such treatment. Staging systems vary with the types of cancer, but generally 
involve the following "TNM" system: the type of tumor, indicated by T; whether the cancer has 
metastasized to nearby lymph nodes, indicated by N; and whether the cancer has metastasized to more 
distant parts of the body, indicated by M. Generally, if a cancer is only detectable in the area of the 
primary lesion without having spread to any lymph nodes it is called Stage I. If it has spread only to 
the closest lymph nodes, it is called Stage II. In Stage III, the cancer has generally spread to the lymph 
nodes in near proximity to the site of the primary lesion. Cancers that have spread to a distant part of 
the body, such as the liver, bone, brain or other site, are Stage IV, the most advanced stage. 
[00147] The polynucleotides of the invention can facilitate fine-tuning of the staging process by 
identifying markers for the aggresivity of a cancer, e.g., the metastatic potential, as well as the presence 
in different areas of the body. Thus, a Stage II cancer with a polynucleotide signifying a high 
metastatic potential cancer can be used to change a borderline Stage II tumor to a Stage III tumor, 
justifying more aggressive therapy. Conversely, the presence of a polynucleotide signifying a lower 
metastatic potential allows more conservative staging of a tumor. 

[° 0148 1 Grading of cancers. Grade is a term used to describe how closely a tumor resembles 
normal tissue of its same type. The microscopic appearance of a tumor is used to identify tumor grade 
based on parameters such as cell morphology, cellular organization, and other markers of 
differentiation. As a general rule, the grade of a tumor corresponds to its rate of growth or 
aggressiveness, with undifferentiated or high-grade tumors being more aggressive than well- 
differentiated or low-grade tumors. The following guidelines are generally used for grading tumors: 1) 
GX Grade cannot be assessed; 2) Gl Well differentiated; 3) G2 Moderately well differentiated; 4) G3 
Poorly differentiated; 5) G4 Undifferentiated. The polynucleotides of the invention can be especially 
valuable in detennining the grade of the tumor, as they not only can aid in detennining the 
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differentiation status of the cells of a tumor, they can also identify factors other than differentiation that 
are valuable in determining the aggressiveness of a tumor, such as metastatic potential. 
[0 ° 1491 Detection of colon cancer The polynucleotides corresponding to genes that exhibit the 
appropriate expression pattern can be used to detect colon cancer in a subject. Colorectal cancer is one 
of the most common neoplasms in humans and perhaps the most frequent form of hereditary neoplasia. 
Prevention and early detection are key factors in controlling and curing colorectal cancer. Colorectal 
cancer begins as polyps, which are small, benign growths of cells that form on the inner lining of the 
colon. Over a period of several years, some of these polyps accumulate additional mutations and 
become cancerous. Multiple familial colorectal cancer disorders have been identified, which are 
summarized as follows: 1) Familial adenomatous polyposis (FAP); 2) Gardner's syndrome; 3) 
Hereditary nonpolyposis colon cancer (HNPCC); and 4) Familial colorectal cancer in Ashkenazi Jews. 
The expression of appropriate polynucleotides of the invention can be used in the diagnosis, prognosis 
and management of colorectal cancer. Detection of colon cancer can be determined using expression 
levels of any of these sequences alone or in combination with the levels of expression. Determination 
of the aggressive nature and/or the metastatic potential of a colon cancer can be determined by 
comparing levels of one or more polynucleotides of the invention and comparing total levels of another 
sequence known to vary in cancerous tissue, e.g., expression of p53, DCC ras, lor FAP (see, e.g., 
Fearon ER, et al., Cell (1990) 61(5):759; Hamilton SR et al., Cancer (1993) 72:957; Bodmlr W, et al., 
Nat Genet. (1994) 4(3):217; Fearon ER, Ann N Y Acad Sci. (1995) 768: 101). For example, 
development of colon cancer can be detected by examining the ratio of any of the polynucleotides of 
the invention to the levels of oncogenes (e.g., ras) or tumor suppressor genes (e.g., FAP or p53). Thus, 
expression of specific marker polynucleotides can be used to discriminate between normal and 
cancerous colon tissue, to discriminate between colon cancers with different cells of origin, to 
discriminate between colon cancers with different potential metastatic rates, etc. For a review of 
markers of cancer, see, e.g., Hanahan et al. (2000) Cell 100:57-70. 

[00150] Detection of prostate cancer The polynucleotides and their corresponding genes and gene 
products exhibiting the appropriate differential expression pattern can be used to detect prostate cancer 
in a subject. Over 95% of primary prostate cancers are adenocarcinomas. Signs and symptoms may 
include: frequent urination, especially at night; inability to urinate; trouble starting or holding back 
urination; a weak or interrupted urine flow; and frequent pain or stiffness in the lower back, hips or 
upper thighs. 

[00151] Many of the signs and symptoms of prostate cancer can be caused by a variety of other 
non-cancerous conditions. For example, one common cause of many of these signs and symptoms is a 
condition called benign prostatic hypertrophy, or BPH. In BPH, the prostate gets bigger and may block 
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the flow of urine or interfere with sexual function. The methods and compositions of the invention can 
be used to distinguish between prostate cancer and such non-cancerous conditions. The methods of the 
invention can be used in conjunction with conventional methods of diagnosis, e.g., digital rectal exam 
and/or detection of the level of prostate specific antigen (PSA), a substance produced and secreted by 
the prostate. 

[00152] Detection of breast cancer. The majority of breast cancers are adenocarcinoma subtypes, 
which can be summarized as follows: 1) ductal carcinoma in situ (DCIS), including comedocarcinoma; 
2) infiltrating (or invasive) ductal carcinoma (IDC); 3) lobular carcinoma in situ (LCIS); 4) infiltrating 
(or invasive) lobular carcinoma (ILC); 5) inflammatory breast cancer; 6) medullary carcinoma; 7) 
mucinous carcinoma; 8) Pagefs disease of the nipple; 9) Phyllodes tumor; and 10) tubular carcinoma; 
[00153] The expression of polynucleotides of the invention can be used in the diagnosis and 
management of breast cancer, as well as to distinguish between types of breast cancer. Detection of 
breast cancer can be determined using expression levels of any of the appropriate polynucleotides of 
the invention, either alone or in combination. Determination of the aggressive nature and/or the 
metastatic potential of a breast cancer can also be determined by comparing levels of one or more 
polynucleotides of the invention and comparing levels of another sequence known to vary in cancerous 
tissue, e.g., ER expression. In addition, development of breast cancer can be detected by examining the 
ratio of expression of a differentially expressed polynucleotide to the levels of steroid hormones (e.g., 
testosterone or estrogen) or to other hormones (e.g., growth hormone, insulin). Thus, expression of 
specific marker polynucleotides can be used to discriminate between normal and cancerous breast 
tissue, to discriminate between breast cancers with different cells of origin, to discriminate between 
breast cancers with different potential metastatic rates, etc. 

[00154] Detection of lung cancer. The polynucleotides of the invention can be used to detect lung 
cancer in a subject. Although there are more than a dozen different kinds of lung cancer, the two main 
types of lung cancer are small cell and nonsmall cell, which encompass about 90% of all lung cancer 
cases. Small cell carcinoma (also called oat cell carcinoma) usually starts in one of the larger bronchial 
tubes, grows fairly rapidly, and is likely to be large by the time of diagnosis. Nonsmall cell lung cancer 
(NSCLC) is made up of three general subtypes of lung cancer. Epidermoid carcinoma (also called 
squamous cell carcinoma) usually starts in one of the larger bronchial tubes and grows relatively 
slowly. The size of these tumors can range from very small to quite large. Adenocarcinoma starts 
growing near the outside surface of the lung and can vary in both size and growth rate. Some slowly 
growing adenocarcinomas are described as alveolar cell cancer. Large cell carcinoma starts near the 
surface of the lung, grows rapidly, and the growth is usually fairly large when diagnosed. Other less 
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common forms of lung cancer are carcinoid, cylindroma, mucoepidermoid, and malignant 
mesothelioma. 

[00155] The polynucleotides of the invention, e.g., polynucleotides differentially expressed in 
normal cells versus cancerous lung cells (e.g., tumor cells of high or low metastatic potential) or 
between types of cancerous lung cells (e.g., high metastatic versus low metastatic), can be used to 
distinguish types of lung cancer as well as identifying traits specific to a certain patient's cancer and 
selecting an appropriate therapy. For example, if the patient's biopsy expresses a polynucleotide that 
is associated with a low metastatic potential, it may justify leaving a larger portion of the patient's lung 
in surgery to remove the lesion. Alternatively, a smaller lesion with expression of a polynucleotide that 
is associated with high metastatic potential may justify a more radical removal of lung tissue and/or the 
surrounding lymph nodes, even if no metastasis can be identified through pathological examination. 

Use of Polynucleotides to Screen for Peptide Analogs and Antagonists 
[00156] Polypeptides encoded by the instant polynucleotides and corresponding full-length genes 
can be used to screen peptide libraries to identify binding partners, such as receptors, from among the 
encoded polypeptides. Peptide libraries can be synthesized according to methods known in the art (see, 
e.g., USPN 5,010,175 , and WO 91/17823). Agonists or antagonists of the polypeptides of the 
invention can be screened using any available method known in the art, such as signal transduction, 
antibody binding, receptor binding, mitogenic assays, chemotaxis assays, etc. The assay conditions 
ideally should resemble the conditions under which the native activity is exhibited in vivo, that is, under 
physiologic pH, temperature, and ionic strength. Suitable agonists or antagonists will exhibit strong 
inhibition or enhancement of the native activity at concentrations that do not cause toxic side effects in 
the subject. Agonists or antagonists that compete for binding to the native polypeptide can require 
concentrations equal to or greater than the native concentration, while inhibitors capable of binding 
irreversibly to the polypeptide can be added in concentrations on the order of the native concentration. 
[00157] Such screening and experimentation can lead to identification of a novel polypeptide 
binding partner, such as a receptor, encoded by a gene or a cDNA corresponding to a polynucleotide of 
the invention, and at least one peptide agonist or antagonist of the novel binding partner. Such 
agonists and antagonists can be used to modulate, enhance, or inhibit receptor function in cells to 
which the receptor is native, or in cells that possess the receptor as a result of genetic engineering. 
Further, if the novel receptor shares biologically important characteristics with a known receptor, 
information about agonist/antagonist binding can facilitate development of improved 
agonists/antagonists of the known receptor. 
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Pharmaceutical Compositions and Uses 
[00158] Pharmaceutical compositions can comprise polypeptides, receptors that specifically bind a 
polypeptide produced by a differentially expressed gene (e.g., antibodies, or polynucleotides (including 
antisense nucleotides and ribozymes) of the claimed invention in a therapeutically effective amount. 
The compositions can be used to treat primary tumors as well as metastases of primary tumors. In 
addition, the pharmaceutical compositions can be used in conjunction with conventional methods of 
cancer treatment, e.g., to sensitize tumors to radiation or conventional chemotherapy. 
[00159] Where the pharmaceutical composition comprises a receptor (such as an antibody) that 
specifically binds to a gene product encoded by a differentially expressed gene, the receptor can be 
coupled to a drug for delivery to a treatment site or coupled to a detectable label to facilitate imaging of 
a site comprising colon cancer cells. Methods for coupling antibodies to drugs and detectable labels 
are well known in the art, as are methods for imaging using detectable labels. 
[00160] The term "therapeutically effective amount" as used herein refers to an amount of a 
therapeutic agent to treat, ameliorate, or prevent a desired disease or condition, or to exhibit a 
detectable therapeutic or preventative effect. The effect can be detected by, for example, chemical 
markers or antigen levels. Therapeutic effects also include reduction in physical symptoms, such as 
decreased body temperature. The precise effective amount for a subject will depend upon the subject's 
size and health, the nature and extent of the condition, and the therapeutics or combination of 
therapeutics selected for administration. Thus, it is not useful to specify an exact effective amount in 
advance. However, the effective amount for a given situation is determined by routine experimentation 
and is within the judgment of the clinician. For purposes of the present invention, an effective dose 
will generally be from about 0.0 1 mg/kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg of the DNA 
constructs in the individual to which it is administered. 

[00161] A pharmaceutical composition can also contain a pharmaceutically acceptable carrier. The 
term "pharmaceutically acceptable carrier" refers to a carrier for administration of a therapeutic agent, 
such as antibodies or a polypeptide, genes, and other therapeutic agents. The term refers to any 
pharmaceutical carrier that does not itself induce the production of antibodies harmful to the individual 
receiving the composition, and which can be administered without undue toxicity. Suitable carriers can 
be large, slowly metabolized macromolecules such as proteins, polysaccharides, polylactic acids, 
polyglycolic acids, polymeric amino acids, amino acid copolymers, and inactive virus particles. Such 
carriers are well known to those of ordinary skill in the art. Pharmaceutically acceptable carriers in 
therapeutic compositions can include liquids such as water, saline, glycerol and ethanol. Auxiliary 
substances, such as wetting or emulsifying agents, pH buffering substances, and the like, can also be 
present in such vehicles. Typically, the therapeutic compositions are prepared as injectables, either as 
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liquid solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid vehicles 
prior to injection can also be prepared. Liposomes are included within the definition of a 
pharmaceutically acceptable carrier. Pharmaceutically acceptable salts can also be present in the 
pharmaceutical composition, e.g., mineral acid salts such as hydrochlorides, hydrobromides, 
phosphates, sulfates, and the like; and the salts of organic acids such as acetates, propionates, 
malonates, benzoates, and the like. A thorough discussion of pharmaceutically acceptable excipients is 
available in Remington's Pharmaceutical Sciences (Mack Pub. Co., NJ. 1991). 
Delivery Methods 

[00162] Once formulated, the compositions of the invention can be (1) administered directly to the 
subject (e.g., as polynucleotide or polypeptides); or (2) delivered ex vivo, to cells derived from the 
subject (e.g., as in ex vivo gene therapy). Direct delivery of the compositions will generally be 
accomplished by parenteral injection, e.g., subcutaneously, intraperitoneally, intravenously or 
intramuscularly, intratumorally or to the interstitial space of a tissue. Other modes of administration 
include oral and pulmonary administration, suppositories, and transdermal applications, needles, and 
gene guns or hyposprays. Dosage treatment can be a single dose schedule or a multiple dose schedule. 
[00163] Methods for the ex vivo delivery and reimplantation of transformed cells into a subject are 
known in the art and described in, e.g., WO 93/14778. Examples of cells useful in ex vivo applications 
include, for example, stem cells, particularly hematopoetic, lymph cells, macrophages, dendritic cells, 
or tumor cells. Generally, delivery of nucleic acids for both ex vivo and in vitro applications can be 
accomplished by, for example, dextran-mediated transfection, calcium phosphate precipitation, 
polybrene mediated transfection, protoplast fusion, electroporation, encapsulation of the 
polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei, all well known in the 
art. 

[00164] Once differential expression of a gene corresponding to a polynucleotide of the invention 
has been found to correlate with a proliferative disorder, such as neoplasia, dysplasia, and hyperplasia, 
the disorder can be amenable to treatment by administration of a therapeutic agent based on the 
provided polynucleotide, corresponding polypeptide or other corresponding molecule (e.g., antisense, 
ribozyme, etc.). In other embodiments, the disorder can be amenable to treatment by administration of 
a small molecule drug that, for example, serves as an inhibitor (antagonist) of the function of the 
encoded gene product of a gene having increased expression in cancerous cells relative to normal cells 
or as an agonist for gene products that are decreased in expression in cancerous cells (e.g., to promote 
the activity of gene products that act as tumor suppressors). 

[00165] The dose and the means of administration of the inventive pharmaceutical compositions 
are determined based on the specific qualities of the therapeutic composition, the condition, age, and 
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weight of the patient, the progression of the disease, and other relevant factors. For example, 
administration of polynucleotide therapeutic composition agents of the invention includes local or 
systemic administration, including injection, oral administration, particle gun or catheterized 
administration, and topical administration. Preferably, the therapeutic polynucleotide composition 
contains an expression construct comprising a promoter operably linked to a polynucleotide of at least 
12, 22, 25, 30, or 35 contiguous nt of the polynucleotide of the invention. Various methods can be used 
to administer the therapeutic composition directly to a specific site in the body. For example, a small 
metastatic lesion is located and the therapeutic composition injected several times in several different 
locations within the body of tumor. Alternatively, arteries that serve a tumor are identified, and the 
therapeutic composition injected into such an artery, in order to deliver the composition directly into 
the tumor. A tumor that has a necrotic center is aspirated and the composition injected directly into the 
now empty center of the tumor. The antisense composition is directly administered to the surface of 
the tumor, for example, by topical application of the composition. X-ray imaging is used to assist in 
certain of the above delivery methods. 

[00166] Targeted delivery of therapeutic compositions containing an antisense polynucleotide, 
subgenomic polynucleotides, or antibodies to specific tissues can also be used. Receptor-mediated 
DNA delivery techniques are described in, for example, Findeis et al., Trends BiotechnoL (1993) 
1 1:202; Chiou et al, Gene Therapeutics: Methods And Applications Of Direct Gene Transfer (J. A. 
Wolff, ed.) (1994); Wu et al., J. Biol. Chem. (1988) 263:621; Wu et al, J. Biol. Chem. (1994) 
269:542; Zenke et al., Proc. Natl. Acad. Sci. (USA) (1990) 87:3655; Wu et al., J. Biol. Chem. (1991) 
266:338. Therapeutic compositions containing a polynucleotide are administered in a range of about 
100 ng to about 200 mg of DNA for local administration in a gene therapy protocol. Concentration 
ranges of about 500 ng to about 50 mg, about 1 micrograms to about 2 mg, about 5 micrograms to 
about 500 micrograms, and about 20 micrograms to about 100 micrograms of DNA can also be used 
during a gene therapy protocol. Factors such as method of action (e.g., for enhancing or inhibiting 
levels of the encoded gene product) and efficacy of transformation and expression are considerations 
which will affect the dosage required for ultimate efficacy of the antisense subgenomic polynucleotides. 
Where greater expression is desired over a larger area of tissue, larger amounts of antisense 
subgenomic polynucleotides or the same amounts readministered in a successive protocol of 
administrations, or several administrations to different adjacent or close tissue portions of, for 
example, a tumor site, may be required to effect a positive therapeutic outcome. In all cases, routine 
experimentation in clinical trials will determine specific ranges for optimal therapeutic effect. For 
polynucleotide related genes encoding polypeptides or proteins with anti-inflammatory activity, 
suitable use, doses, and administration are described in USPN 5,654,173. 
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[00167] The therapeutic polynucleotides and polypeptides of the present invention can be delivered 
using gene delivery vehicles. The gene delivery vehicle can be of viral or non-viral origin (see 
generally, Jolly, Cancer Gene Therapy (1994) 1:51; Kimura, Human Gene Therapy (1994) 5:845; 
Connelly, Human Gene Therapy (1995) 1:185; and Kaplitt, Nature Genetics (1994) 6:148). 
Expression of such coding sequences can be induced using endogenous mammalian or heterologous 
promoters. Expression of the coding sequence can be either constitutive or regulated. 
[00168] Viral-based vectors for delivery of a desired polynucleotide and expression in a desired 
cell are well known in the art. Exemplary viral-based vehicles include, but are not limited to, 
recombinant retroviruses (see, e.g., WO 90/07936; WO 94/03622; WO 93/25698; WO 93/25234; 
USPN 5, 219,740; WO 93/1 1230; WO 93/10218; USPN 4,777,127; GB Patent No. 2,200,651; EP 0 
345 242; and WO 91/02805), alphavirus-based vectors (e.g., Sindbis virus vectors, Semliki forest 
virus (ATCC VR-67; ATCC VR-1247), Ross River virus (ATCC VR-373; ATCC VR-1246) and 
Venezuelan equine encephalitis virus (ATCC VR-923; ATCC VR-1250; ATCC VR 1249; ATCC VR- 
532), and adeno-associated virus (AAV) vectors (see, e.g., WO 94/12649, WO 93/03769; WO 
93/19191; WO 94/28938; WO 95/1 1984 and WO 95/00655). Administration of DNA linked to killed 
adenovirus, as described in Curiel, Hum. Gene Then (1992) 3:147, can also be employed. 
[00169] Non-viral delivery vehicles and methods can also be employed, including, but not limited 
to, polycationic condensed DNA linked or unlinked to killed adenovirus alone (see, e.g., Curiel, Hum. 
Gene Ther. (1992) 3:147); ligand-linked DNA (see, e.g., Wu, J. Biol. Chem. (1989) 264:16985); 
eukaryotic cell delivery vehicles cells (see, e.g., USPN 5,814,482; WO 95/07994; WO 96/17072; 
WO 95/30763; and WO 97/42338) and nucleic charge neutralization or fusion with cell membranes. 
Naked DNA can also be employed. Exemplary naked DNA introduction methods are described in WO 
90/1 1092 and USPN 5,580,859. Liposomes that can act as gene delivery vehicles are described in 
USPN 5,422,120; WO 95/13796; WO 94/23697; WO 91/14445; and EP 0524968. Additional 
approaches are described in Philip, Mol. Cell Biol. (1994) 14:241 1, and in Woffendin, Proc. Natl. 
Acad. Sci. (1994)91:1581 

[00170] Further non-viral delivery suitable for use includes mechanical delivery systems such as 
the approach described in Woffendin et al., Proc. Natl. Acad. Sci. USA (1994) 9 1 (24): 1 158 1. 
Moreover, the coding sequence and the product of expression of such can be delivered through 
deposition of photopolymerized hydrogel materials or use of ionizing radiation (see, e.g., USPN 
5,206,152 and WO 92/1 1033). Other conventional methods for gene delivery that can be used for 
delivery of the coding sequence include, for example, use of hand-held gene transfer particle gun (see, 
e.g., USPN 5,149,655); use of ionizing radiation for activating transferred gene (see, e.g., USPN 
5,206,152 and WO 92/1 1033). 
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[00171] The present invention will now be illustrated by reference to the following examples which 
set forth particularly advantageous embodiments. However, it should be noted that these embodiments 
are illustrative and are not to be construed as restricting the invention in any way. 

EXAMPLES 

[00172] The following examples are put forth so as to provide those of ordinary skill in the art with 
a complete disclosure and description of how to make and use the present invention, and are not 
intended to limit the scope of what the inventors regard as their invention nor are they intended to 
represent that the experiments below are all or the only experiments performed. It will be readily 
apparent to those skilled in the art that the formulations, dosages, methods of administration, and other 
parameters of this invention may be further modified or substituted in various ways without departing 
from the spirit and scope of the invention. Efforts have been made to ensure accuracy with respect to 
numbers used (e.g. amounts, temperature, etc.) but some experimental errors and deviations should be 
accounted for. Unless indicated otherwise, parts are parts by weight, molecular weight is weight 
average molecular weight, temperature is in degrees Centigrade, and pressure is at or near atmospheric. 

Examp le l:Source of Biological Materials and Overview of Novel Polynucleotides Expressed 
by the Biological Materials 

[00173] Candidate polynucleotides that may represent novel polynucleotides were obtained from 
cDNA libraries generated from selected cell lines and patient tissues. In order to obtain the candidate 
polynucleotides, mRNA was isolated from several selected cell lines and patient tissues, and used to 
construct cDNA libraries. The cells and tissues that served as sources for these cDNA libraries are 
summarized in Table 1 below. 

[00174] Human colon cancer cell line Kml2L4-A (Morikawa, et al., Cancer Research (1988) 
48:6863) is derived from the KM12C cell line. The KM12C cell line (Morikawa et al Cancer Res. 
(1988) 48:1943-1948), which is poorly metastatic (low metastatic) was established in culture from a 
Dukes' stage B2 surgical specimen (Morikawa et al. Cancer Res. (1988) 48:6863). The KM12L4-A is 
a highly metastatic subline derived from KM12C (Yeatman et al. Nucl Acids. Res. (1995) 23:4007; 
Bao-Ling et al. Proc. Annu. Meet. Am. Assoc. Cancer. Res. (1995) 21:3269). The KM12C and 
KM12C-derived cell lines (e.g., KM12L4, KM12L4-A, etc.) are well-recognized in the art as a model 
cell line for the study of colon cancer (see, e.g., Moriakawa et al, supra; Radinsky et al. Clin. Cancer 
Res. (1995) 1:19; Yeatman et al., (1995) supra; Yeatman et al. Clin. Exp. Metastasis (1996) 14:246). 
[00175] The MDA-MB-23 1 cell line (Brinkley et al. Cancer Res. (1980) 40:3 1 18-3 129) was 
originally isolated from pleural effusions (Cailleau, J. Natl Cancer. Inst. (1974) 53:661), is of high 
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metastatic potential, and forms poorly differentiated adenocarcinoma grade II in nude mice consistent 
with breast carcinoma. The MCF7 cell line was derived from a pleural effusion of a breast 
adenocarcinoma and is non-metastatic. The MV-522 cell line is derived from a human lung carcinoma 
and is of high metastatic potential. The UCP-3 cell line is a low metastatic human lung carcinoma cell 
line; the MV-522 is a high metastatic variant of UCP-3. These cell lines are well-recognized in the art 
as models for the study of human breast and lung cancer (see, e.g., Chandrasekaran et al., Cancer Res. 
(1979) 39.870 (MDA-MB-23 1 and MCF-7); Gastpar et al., J Med Chem (1998) 41:4965 (MDA-MB- 
23 1 and MCF-7); Ranson et al., Br J Cancer (1998) 77: 1586 (MDA-MB-23 1 and MCF-7); Kuang et 
al., Nucleic Acids Res (1998) 26: 1 1 16 (MDA-MB-23 1 and MCF-7); Varki et al., Int J Cancer (1987) 
40:46 (UCP-3); Varki et al., Tumour Biol. (1990) 1 1:327; (MV-522 and UCP-3); Varki et al., 
Anticancer Res. (1990) 10:637; (MV-522); Kelner et al., Anticancer Res (1995) 15:867 (MV-522); 
and Zhang et al., Anticancer Drugs (1997) 8:696 (MV522)). 

[00176] The samples of libraries 15-20 are derived from two different patients (UC#2, and UC#3). 

The bFGF-treated HMVEC were prepared by incubation with bFGF at lOng/ml for 2 hrs; the VEGF- 
treated HMVEC were prepared by incubation with 20ng/ml VEGF for 2 hrs. Following incubation 
with the respective growth factor, the cells were washed and lysis buffer added for RNA preparation. 
The GRRpz and WOca cell lines were provided by Dr. Donna M. Peehl, Department of Medicine, 
Stanford University School of Medicine. GRRpz was derived from normal prostate epithelium. The 
WOca cell line is a Gleason Grade 4 cell line. 

Table 1. Description of cDNA Libraries 



Library 
(Iib#) 


Description 


Number of 
Clones in 
Library 


0 


Artificial library composed of deselected clones (clones with no 
associated variant or cluster) 


673 


1 


Human Colon Cell Line Kml2 L4: High Metastatic Potential (derived 
from Kml2C) 


308731 


2 


Human Colon Cell Line Kml2C: Low Metastatic Potential 


284771 


3 


Human Breast Cancer Cell Lme MDA-MB-23 1 : High Metastatic 
Potential; micro-mets in lung 


326937 


4 


Human Breast Cancer Cell Line MCF7: Non Metastatic 


318979 


8 


Human Lung Cancer Cell Line MV-522: High Metastatic Potential 


223620 


9 


Human Lung Cancer Cell Line UCP-3: Low Metastatic Potential 


312503 


12 


Human microvascular endothelial cells (HMEC) - UNTREATED 
(PCR (OligodT) cDNA library) 


41938 


13 


Human microvascular endothelial cells (HMEC) - bFGF TREATED 
(PCR (OligodT) cDNA library) 


42100 


14 


Human microvascular endothelial cells (HMEC) - VEGF TREATED 
(PCR (OligodT) cDNA library) 


42825 


15 


Normal Colon - UC#2 Patient (MICRODISSECTED PCR (OligodT) 
dDNA library) 


282722 
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Library 

ytinj try 


Description 


Number of 
Clones in 
Library 


16 


coiuii i uinor - UC#2 Patient (MICRODISSECTED PCR (OlieodTI 
cDNA library) 


298831 


17 


uver Metastasis Irom Colon Tumor of UC#2 Patient 
(MICRODISSECTED PCR (OligodT) cDNA library) 


303467 


18 


[Normal ^oion - uutm Fatient (MICRODISSECTED PCR (OligodT) 

t/J-/lN J rv HDraTyJ 


36216 


19 


^oion lumor - u^ff;) ratient (.MICRODISSECTED PCR fOlieodT) 
cDNA library) V S ' 


41388 


20 


i^iver metastasis irom Colon Tumor of UC#3 Patient 
(MICRODISSECTED PCR (OligodT) cDNA library) 


30956 


21 


GKRpz Cells derived from normal prostate epithelium 


164801 


22 


wuca ^ens derived trom Oleason Grade 4 prostate cancer epithelium 


162088 


23 


Normal Lung Epithelium of Patient #1006 (MICRODISSECTED PCR 
) cl-mn/\ iiurary ) 


306198 


24 


rrimdiy tumor, i^arge cell Carcmoma of Patient #1006 

'MICRODTS^FPTFn PPR (fW\nrxAT\ ^n\TA ru \ 
^vn^ivui^iooEL i ruiv (Uiigod 1 ) cDNA library) 


309349 


25 


nuailfll lubittic npiuienum irom Patient lr 97-268 1 1 


279444 


26 


Prostate Cancer Epithelium Gleason 3+3 Patient IF97-2681 1 




27 


formal Breast Epithelium from Patient 5 15 


239494 


28 


Primary Breast tumor from Patient 515 


259960 


29 ] 


-ymph node metastasis from Patient 515 


326786 


30 I 


formal Prostate Epithelium from Chiron Patient ID 884 


298431 


31 


>rostate Cancer Epithelium (Gleason 4+4) from Chiron Patient ID 884 


331941 



Characteris ation of sequences in the libraries 
[00177] After using the software program Phred (ver 0.000925 c, Green and Weing,, ©1993- 
2000) to select those polynucleotides having the best quality sequence, the polyncueltoides were 
compared against the public databases to identify any homolgous sequences. The sequences of the 
isolated polynucleotides were first masked to eliminate low complexity sequences using the BLASTX 
masking program (Claverie "Effective Large-Scale Sequence Similarity Searches," In: Computer 
Methods for Macromolecular Sequence Analysis, Doolittle, ed., Meth. Enzymol. 266:212-227 
Academic Press, NY, NY (1996); see particularly Claverie, in "Automated DNA Sequencing and 
Analysis Techniques" Adams et al., eds., Chap. 36, p. 267 Academic Press, San Diego, 1994 and 
Claverie et al. Comput. Chem. (1993) 17: 191 ). Generally, masking does not influence the final search 
results, except to eliminate sequences of relatively little interest due to their low complexity, and to 
eliminate multiple "hits" based on similarity to repetitive regions common to multiple sequences, e.g., 
Alu repeats. The remaining sequences were then used in a BLASTN vs. GenBank search; sequences 
that exhibited greater than 70% overlap, 99% identity, and a p value of less than 1 x 10e-40 were 
discarded. Sequences from this search also were discarded if the inclusive parameters were met, but 
the sequence was ribosomal or vector-derived. 
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[00178] The resulting sequences from the previous search were classified into three groups (1, 2 
and 3 below) and searched in a BLASTX vs. NRP (non-redundant proteins) database search: (1) ' 
unknown (no hits in the GenBank search), (2) weak similarity (greater than 45% identity and p value of 
less than 1 x 10e-5), and (3) high similarity (greater than 60% overlap, greater than 80% identity, and 
p value less than 1 x 10e-5). Sequences having greater than 70% overlap, greater than 99% identity, 
and p value of less than 1 x 10e-40 were discarded. 

[00179] The remaining sequences were classified as unknown (no hits), weak similarity, and high 
similarity (parameters as above). Two searches were performed on these sequences. First, a BLAST 
vs. EST database search was performed and sequences with greater than 99% overlap, greater than 
99% similarity and a p value of less than 1 x 10e-40 were discarded. Sequences with a p value of less 
than 1 x 10e-65 when compared to a database sequence of human origin were also excluded. Second, a 
BLASTN vs. Patent GeneSeq database was performed and sequences having greater than 99% 
identity, p value less than 1 x 10e-40, and greater than 99% overlap were discarded. 
[00180] The remaining sequences were subjected to screening using other rules and redundancies 
inthedataset. Sequences with a p value of less than 1 x 10e-lll in relation to a database sequence of 
human origin were specifically excluded. The final result provided the 8064 sequences listed as SEQ 
IDNOS:l-6010in the accompanying Sequence Listing and summarized in Table 2 (inserted prior to 
claims). Each identified polynucleotide represents sequence from at least a partial mRNA transcript. 
Summary of po lynucleotides of the invention 
Table 2 (inserted prior to claims) provides a summary of polynucleotides isolated as described. 
Specifically, Table 2 provides: 1) the SEQ ID NO ("SEQ ID") assigned to each sequence for use in 
the present specification; 2) theCluster Identification No. ("CLUSTER"); 3) the Sequence Name 
assigned to each sequence; 3) the sequence name ("SEQ NAME") used as an internal identifier of the 
sequence; 4) the orientation of the sequence ("ORIENT") (either forward (F) or reverse (R)); 5) the 
name assigned to the clone from which the sequence was isolated ("CLONE ID"); and the name of the 
library from which the sequence was isolated ("LIBRARY"), where the notatiion indicates that name 
of the cell line or patient sample (e.g., UC2-NormColon indicates the sequence was isolated from 
normla colon tissue of the patient assigned the idnetification UC#2). Because at least some of the 
provided polynucleotides represent partial mRNA transcripts, two or more polynucleotides may 
represent different regions of the same mRNA transcript and the same gene and/or may be contained 
within the same clone. Thus, for example, if two or more SEQ ID NOS: are identified as belonging to 
the same clone, then either sequence can be used to obtain the full-length mRNA or gene 
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Example 2:Results of Public Database Search to Identify Function of Gene Products 
[00181] SEQ ID NOS: 1-6010 were translated in all three reading frames, and the nucleotide 
sequences and translated amino acid sequences used as query sequences to search for homologous 
sequences in either the GenBank (nucleotide sequences) or Non-Redundant Protein (amino acid 
sequences) databases. Query and individual sequences were aligned using the BLAST 2.0 programs, 
available over the world wide at a site sponsored by the National Center for Biotechnology 
Information, which is supported by the National Library of Medicine and the National Institutes of 
Health (see also Altschul, et al. Nucleic Acids Res. (1997) 25:3389-3402). The sequences were 
masked to various extents to prevent searching of repetitive sequences or poly-A sequences, using the 
BLASTX program for masking low complexity as described above in Example 1. 
[00182] Tables 3A and 3B (inserted prior to claims) provides the alignment summaries having a p 
value of 1 x 10e-2 or less indicating substantial homology between the sequences of the present 
^ invention and those of the indicated public databases. Specifically, Table 3 A provides the SEQ ID NO 

%B of ^ ^ UQT y sequence, the accession number of the GenBank database entry of the homologous 

• 

^ sequence, and the individual p value of each alignment. Table 3 A provides the SEQ ID NO of the 

fy query sequence, the accession number of the Non-Redundant Protein database entry of the homologous 

q sequence, and the individual p value of each alignment. The alignments provided in Tables 3 A and 3B 

ut are the best available alignment to a DNA or amino acid sequence at a time just prior to filing of the 

j 3 present specification. The activity of the polypeptide encoded by the SEQ ID NOS listed in these 

[0 tables can be extrapolated to be substantially the same or substantially similar to the activity of the 

j n reported nearest neighbor or closely related sequence. The accession number of the nearest neighbor is 

O reported, providing a publicly available reference to the activities and functions exhibited by the 

nearest neighbor. The public information regarding the activities and functions of each of the nearest 
neighbor sequences is incorporated by reference in this application. Also incorporated by reference is 
all publicly available information regarding the sequence, as well as the putative and actual activities 
and functions of the nearest neighbor sequences listed in Tables 3A and 3B and their related sequences. 
The search program and database used for the alignment, as well as the calculation of the p value are 
also indicated. 

[00183] Full length sequences or fragments of the polynucleotide sequences of the nearest 
neighbors can be used as probes and primers to identify and isolate the full length sequence of the 
corresponding polynucleotide. The nearest neighbors can indicate a tissue or cell type to be used to 
construct a library for the full-length sequences of the corresponding polynucleotides. 
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Example 3:Members of Protein Families 
[00184] SEQ ID NOS: 1-60 10 were used to conduct a profile search as described in the 
specification above. Several of the polynucleotides of the invention were found to encode polypeptides 
having characteristics of a polypeptide belonging to a known protein family (and thus represent 
members of these protein families) and/or comprising a known functional domain. Table 4 (inserted 
before claims) provides the SEQ ID NO: of the query sequence, the Sequence Name, the Cluster to 
which the sequence is assigned, a brief description of the profile hit, the orientation (Direction, "Dir") 
of the query sequence with respect to the individual sequence )where forward (for) indicates that the 
alignment is in the same direction (left to right) as the sequence provided in the Sequence Listing and 
reverse (rev) indicates that the alignment is with a sequence complementary to the sequence provided in 
the Sequence Listing), and the score of the profile hit. 

[00185] Some polynucleotides exhibited multiple profile hits where the query sequence contains 
overlapping profile regions, and/or where the sequence contains two different functional domains. 
Each of the profile hits of Table 4 is described in more detail below. The acronyms for the profiles 
(provided in parentheses) are those used to identify the profile in the Pfam, Prosite, and InterPro 
databases. The Pfam database can be accessed through web sites supported by Genome Sequencing 
Center at the Washington University School of Medicine or by the European Molecular Biology 
Laboratories in Heidelberg, Germany. The Prosite database can be accessed at the ExPASy Molecular 
Biology Server on the internet. The InterPro database can be accessed at a web site supported by the 
EMBL European Bioinformatics Institute. The public information available on the Pfam, Prosite, and 
InterPro databases regarding the various profiles, including but not limited to the activities, function, 
and consensus sequences of various proteins families and protein domains, is incorporated herein by 
reference. 

[00186] Seven Transmembrane Integral Membrane Proteins - Rhodopsin Family (7tm 1: Pfam 
Accession No. PF0000R SEQ ID NOS:2973, 5467, and 5508 correspond to sequences encoding a 
polypeptides that are members of the seven transmembrane (7tm) receptor rhodopsin family. G- 
protein coupled receptors of the (7tm) rhodopsin family (also called R7G) are an extensive group of 
hormones, neurotransmitters, and light receptors which transduce extracellular signals by interaction 
with guanine nucleotide-binding (G) proteins (Strosberg, Eur. J. Biochem. (1991) 196:1; Kerlavage, 
Curr. Opin. Struct. Biol. (1991) 1:394; Probst et al, DNA Cell Biol (1992) 11:1; Savarese et al, 
Biochem. J. (1992) 283: 1. The consensus pattern that contains the conserved triplet and that also 
spans the major part of the third transmembrane helix is used to detect this widespread family of 
proteins: [GSTALIVMFYWC]-[GSTANCPDE]-{EDPKRH}-x(2)-[LIVMNQGA]-x(2)- [LIVMFT]- 
[GSTANC]-[LIVMFYWSTAC]-[DENH]-R-[FYWCSH]-x(2)- [LIVM]. 
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[00187] Ank Repeats f ANK: Pfam Accession No. PF0023Y SEQ ID NOS:445, 487 and 3013 
represent polynucleotides encoding Ank repeat-containing proteins. The ankyrin motif is a 33 amino 
acid sequence named after the protein ankyrin which has 24 tandem 33-amino-acid motifs. Ank 
repeats were originally identified in the cell-cycle-control protein cdclO (Breeden et al., Nature (1987) 
329:65 1). Proteins containing ankyrin repeats include ankyrin, myotropin, I-kappaB proteins, cell 
cycle protein cdclO, the Notch receptor (Matsuno et al, Development (1997) 124(2 1):4265); G9a (or 
BAT8) of the class III region of the major histocompatibility complex (Biochem J. (1993) 290:81 1- 
818); FABP, GABP, 53BP2, Linl2 5 glp-1, SW14, and SW16. The functions of the ankyrin repeats 
are compatible with a role in protein-protein interactions (Bork, Proteins (1993) 17(4):363; Lambert 
and Bennet, Eur. J. Biochem. (1993) 211:1; Kerr et al. 5 Current Op. Cell Biol. (1992) 4:496; Bennet et 
al., J. Biol. Chem. (1980) 255:6424). 

[00188] Basic Region Plus Leucine Zipper Transcription Factors (BZIP: Pfam Accession 
No. PF00170). SEQ IDNOS:108, 1714, 3931, and 4356 represent polynucleotides encoding novel 
members of the family of basic region plus leucine zipper transcription factors. The bZIP superfamily 
(Hurst, Protein Prof. (1995) 2:105; and Ellenberger, Curr. Opin. Struct. Biol. (1994) ¥:12) of 
eukaryotic DNA-binding transcription factors encompasses proteins that contain a basic region 
mediating sequence-specific DNA-binding followed by a leucine zipper required for dimerization. The 
consensus pattern for this protein family is: [KR]-x(l,3)-[RKSAQ]-N-x(2)-[SAQ](2)-x- 
[RKTAENQ] -x-R-x-[RK] . 

[00189] DEAD and DEAH box families ATP-deoendent helicases (Dead box helic: Pfam 
Accession No. PF00270. SEQ ID NOS:38, 415, and 5756 correspond to sequences encoding a 
polypeptides that are members of the DEAD box family. A number of eukaryotic and prokaryotic 
proteins have been characterized (Schmid et al, Mol Microbiol. (1992) 5:283; Linder et al, Nature 
(1989) 357:121; Wassarman et al, Nature (1991) 349:463) on the basis of their structural similarity. 
All are involved in ATP-dependent, nucleic-acid unwinding. All DEAD box family members of the 
above proteins share a number of conserved sequence motifs, some of which are specific to the DEAD 
family while others are shared by other ATP-binding proteins or by proteins belonging to the 
helicases 'superfamily' (Hodgman, Nature (1988) 333:22 and Nature (1988) 333:578 (Errata)). One 
of these motifs, called the 'D-E-A-D-box 1 , represents a special version of the B motif of ATP-binding 
proteins. Some other proteins belong to a subfamily which have His instead of the second Asp and are 
thus said to be 'D-E-A-H-box* proteins (Wassarman et al., Nature (1991) 349:463; Harosh et al., 
Nucleic Acids Res. (1991) 19:633 1; Koonin et al., J. Gen. Virol. (1992) 73:989). The following 
signature patterns are used to identify member for both subfamilies: 1) [LIVMF](2)-D-E-A-D- 
[RKEN] -x- [LIVMF YGS TN] ; and 2) [GSAH]-x-[LIVMF](3)-D-E-[ALIV]-H-[NECR]. 
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[00190] Defensins (defensins: Pfam Accession No. PF00323. SEQ ID NO:486 corresponds to a 
sequence encoding a polypeptide that is a member of the mammalian defensin family. Defensins are a 
family of structurally related cysteine-rich peptides which are active against many Gram-positive 
bacteria, fungi, and enveloped viruses (Lehrer et al, ASM News (1990) 56:3 15-3 18; Lehrer et al, Cell 
(1991) 64:229-230; Kagan et al., Toxicology (1994) 87: 13 1-149; Lehrer et al, Annu. Rev. Immunol 
(1993) 77:105-128; White et al, Curr. Opin. Struc/Bioi (1995) 5:521-527). Some defensins inhibit 
corticotropoid-stimulated corticosteroid production and also play a significant role in innate immunity 
to infection and neoplasia. The peptides known to belong to this family range in length from 29 to 35 
amino acids and have seven invariant residues, including six cysteines that are all involved in intrachain 
disulfide bonds. The following consensus pattern is used to identify members of this protein family: C- 
x-C-x(3,5)-C-x(7)-G-x-C-x(9)-C-C. 

[00191] EF Hand HEfhand: Pfam Accession No. PF00036y SEQ ID NO:4373 corresponds to a 
polynucleotide encoding a member of the EF-hand protein family, a calcium binding domain shared by 
many calcium-binding proteins belonging to the same evolutionary family (Kawasaki et al, Protein, 
Prof. (1995) 2:305-490). The domain is a twelve residue loop flanked on both sides by a twelve 
residue alpha-helical domain, with a calcium ion coordinated in a pentagonal bipyramidal 
configuration. The six residues involved in the binding are in positions 1, 3, 5, 7, 9 and 12; these 
residues are denoted by X, Y, Z, -Y, -X and -Z. The invariant Glu or Asp at position 12 provides two 
oxygens for liganding Ca (bidentate ligand). The consensus pattern includes the complete EF-hand 
loop as well as the first residue which follows the loop and which seem to always be hydrophobic: D- 
x-[DNS]-{ILVFW}-[DENSTG]-[DNQGHRK]-{GP}-[LIVMC]-[DENQSTAGC]-x(2)-[DE]- 
[LIVMFYW]. 

[00192] Epidermal Growth Factor (EGF: Pfam Accession No. PF00008Y SEQ ID NO:3689 
represents a polynucleotide encoding a member of the EGF family of proteins. The distinguishing 
characteristic of this family is the presence of a sequence of about thirty to forty amino acid residues 
found in epidermal growth factor (EGF) which has been shown to be present, in a more or less 
conserved form, in a large number of other proteins (Davis, New Biol (1990) 2:410-419; Blomquist et 
al, Proa Natl Acad. Sci. U.S.A. (1984) 57:7363-7367; Barkert et al., Protein Nucl AcidEnz. (1986) 
29:54-86; Doolittle et al, Nature. (1984) 307:558-560; Appella et al, FEBSLett. (1988) 237: 1-4; 
Campbell and Bork, Curr. Opin. Struct. Biol (1993) 3:385-392). A common feature of the domain is 
that the conserved pattern is generally found in the extracellular domain of membrane-bound proteins 
or in proteins known to be secreted. The EGF domain includes six cysteine residues which have been 
shown to be involved in disulfide bonds. The main structure is a two-stranded beta-sheet followed by a 
loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines strongly 
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vary in length. The following consensus patterns are used to identify members of this family: C-x-C- 
x(5)-G-x(2)-C and C-x-C-x(s)-[GP]-[FYW]-x(4,8)-C. 

[00193] Ets Domain (Ets Cterm; Pfam Accession N o , PF00 1 SEQ ID NO:6 represents a 
polynucleotide encoding a polypeptide with C-terminal homology in the ETS domain. Proteins of this 
family contain a conserved domain, the "ETS-domain," that is involved in DNA binding. The domain 
appears to recognize purine-rich sequences; it is about 85 to 90 amino acids in length, and is rich in 
aromatic and positively charged residues (Wasylyk et al, Eur. J. Biochem. (1993) 27 7:718). The ets 
gene family encodes a novel class of DNA-binding proteins, each of which binds a specific DNA 
sequence and comprises an ets domain that specifically interacts with sequences containing the 
common core tri-nucleotide sequence GGA. In addition to an ets domain, native ets proteins comprise 
other sequences which can modulate the biological specificity of the protein. Ets genes and proteins 
are involved in a variety of essential biological processes including cell growth, differentiation and 
development, and three members are implicated in oncogenic process. 

[° 0194 1 Pleckstrin Homology fPH- P fam Accession No. PF00 1 69t SEQ ID NOS:228 and 6001 
correspond to polynucleotides encoding members of the PH family. The pleckstrin homology domain 
is a domain of about 100 residues that occurs in a wide range of proteins involved in intracellular 
signaling or as constituents of the cytoskeleton (Mayer et al, Cell (1993) 75:629-630; Haslam et al., 
Nature (1993) 3(55:309-310; Musacchio et al, Trends Biochem. ScL (1993) 75:343-348; Gibson et 
al, Trends Biochem. Sci. (1994) 79:349-353; Pawson, Nature (1995) 573:573-580; Ingley and 
Hemmings, J. Cell. Biochem. (1994) 5<5:436-443; Saraste and Hyvonen, Curr. Opin. Struct. Biol. 
(1995) 5:403-408). All known structures of PH domains have two perpendicular anti-parallel beta 
sheets, followed by a C-terminal amphipathic helix. The loops connecting the beta-strands differ 
greatly in length, making the PH domain relatively difficult to detect (Riddihough, Nat. Struct. Biol. 
(1994) 7:755-757). There are no totally invariant residues within this domain. 

[00195] Protein Kinase (protkina se; Pfam Accession No. PF00069V SEQ ID NOS:9, 39, 69, 1 18, 
229 and 4151 represent polynucleotides encoding protein kinases, which catalyze phosphorylation of 
proteins in a variety of pathways, and are implicated in cancer. Eukaryotic protein kinases (Hanks et 
al.,FASEBJ. (1995) 9:576; Hunter,MefA. Enzymol. (1991) 200:3; Hanks etal.,Meth. Enzymol. 
(1991) 200:38; Hanks, Curr. Opin. Struct. Biol. (1991) 7:369; Hanks etal, Science (1988) 247:42) 
belong to a very extensive family of proteins that share a conserved catalytic core common to both 
serine/threonine and tyrosine protein kinases. There are a number of conserved regions in the catalytic 
domain of protein kinases. The first region, located in the N-terminal extremity of the catalytic 
domain, is a glycine-rich stretch of residues in the vicinity of a lysine residue, which has been shown to 
be involved in ATP binding. The second region, located in the central part of the catalytic domain, 
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contains a conserved aspartic acid residue that is important for the catalytic activity of the enzyme 
(Knighton et al, Science (1991) 253:407). 

[00196] The protein kinase profile includes two signature patterns for this second region: one 
specific for serine/threonine kinases and the other for tyrosine kinases. A third profile is based on the 
alignment in Hanks et al (FASEB J. (1995) 9:576) and covers the entire catalytic domain. The 
consensus patterns are as follows: 1) [LIV]-G-{P}-G-{P}-[FYWMGSTNH]-[SGA]-{PW}- 
[LIVCATHPD}-x-[GSTACLIVM^ 

where K binds ATP; 2) [LIVMFYC]-x-[HY]-x-D-[LIVMFY]-K-x(2)-N-[LIVMFYCT](3) 3 where D is 
an active site residue; and 3) [LrVMFYC]-x-[HY]-x-D4LIVMFY]-[RSTAC]-x(2)-N-[LIVMFYC] ? 
where D is an active site residue. 

[00197] Retroviral aspartvl protease (RVP; Pfam Accession No. PF00077Y SEQIDNO:2038 
corresponds to a polynucleotide encoding a member of the Aspartyl Protease family. Aspartyl 
proteases, also know as acid proteases, are a widely distributed family of proteolytic enzymes known to 
exist in vertebrates, fungi, plants, retroviruses and some plant viruses (Foltmann, Essays Biochem. 
(1981) 77:52-84; Davies, Annu. Rev. Biophys. Chem. (1990) 19: 189-215; Rao et al, Biochemistry 
(1991) 30:4663-4671). Most retroviral aspartyl proteases (RVP) are homodimers of a chain of about 
95 to 125 amino acids. In most retroviruses, the protease is encoded as a segment of a polyprotein 
which is cleaved during the maturation process of the virus. RVP is generally part of the pol 
polyprotein and, more rarely, of the gag polyprotein. The consensus pattern is as follows: 
[LIVMFGACHLIVMTADN]-[L^ 
[LIVMFGTA] [D is the active site residue]. 

[00198] Reverse Transcriptase frvt: Pfam Accession No. PF00078). SEQ ID NOS: 15 1 1 and 25 14 
represent polynucleotides encoding reverse transcriptases, which occur in a variety of mobile elements, 
including retrotransposons, retroviruses, group II introns, bacterial msDNAs, hepadnaviruses, and 
caulimoviruses (Xiong and Eickbush, EMBOJ (1990) P:3353-3362). Reverse transcriptases catalyze 
RNA-template-directed extension of the 3'-end of a DNA strand by one deoxynucleotide at a time and 
require an RNA or DNA primer. 

[00199] Transformi ng growth factor beta like domain (TGF beta: Pfam Accession No. PF00019V 
SEQ ID NO:5522 represents a polynucleotide encoding a polypeptide of the TGF-beta family. 
Proteins from the transforming growth factor-beta family are active as homo- or hetero-dimers with the 
two chains being linked by a single disulfide bond (Roberts and Sporn, In Peptide growth factors and 
their receptors, Handbook of Experimental Pharmacology, Vol. 95, pp419-475, Springer Verlag, 
Heidelberg, (1990); Burt, Biochem. Biophys. Res. Commun. (1992) 754:590-595; Burt and Law, 
Prog. Growth Factor Res. (1994) 5:99-1 18). It is known from X-ray studies that all the other 
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cysteines of the sequence are involved in interchain disulfide bonds (Daopin et al, Science (1992) 
257:369-373). Members of this family can be recognized by the following consensus pattern: [LIVM]- 
x(2)-P-x(2)-[FY]-x(4)-C-x-G-x-C (The two C's are involved in disulfide bonds). 
[00200] WD Domain (WD4Q1 G-Beta Repeats (WD domain; Pfam Accession No. PF00400Y 
SEQ ID NO: 1 17 represents a member of the WD domain/G-beta repeat family. Beta-transducin (G- 
beta) is one of the three subunits (alpha, beta, and gamma) of the guanine nucleotide-binding proteins 
(G proteins) which act as intermediaries in the transduction of signals generated by transmembrane 
receptors (Gilman, Annu. Rev. Biochem. (1987) 55:615). The alpha subunit binds to and hydrolyzes 
GTP; the beta and gamma subunits are required for the replacement of GDP by GTP as well as for 
membrane anchoring and receptor recognition. In higher eukaryotes, G-beta exists as a small 
multigene family of highly conserved proteins of about 340 amino acid residues. Structurally, G-beta 
has eight tandem repeats of about 40 residues, each containing a central Trp-Asp motif (this type of 
repeat is sometimes called a WD-40 repeat). The consensus pattern for the WD domain/G-Beta repeat 
family is: [LIVMSTAC]-[LIVMFYWSTAGC]-[LIMSTAG]4LIVMSTAGC]-x(2)-[DN]-x(2)- 
[LIVMWSTAC]-x-[LIVMFSTAG]-W-[DEN]-[LIVMFSTAGCN]. 

[00201] Zinc Finger. C2H2 Type (Zincfing C2H2: Pfam Accession No. PF0Q096V SEQIDNOS: 
61, 502, 700, 847, 2034, 2054, 3403, 3524, 3653, 3723, 4688, and 4979 correspond to 
polynucleotides encoding members of the C2H2 type zinc finger protein family, which contain zinc 
finger domains that facilitate nucleic acid binding (Klug et al, Trends Biochem. Sci. (1987) 72:464; 
Evans et al, Cell (1988) 52: 1; Payre et al, FEBSLett. (1988) 234:245; Miller et al, EMBOJ. (1985) 
4: 1609; and Berg, Proa Natl Acad Sci. USA (1988) 55:99). In addition to the conserved zinc ligand 
residues, a number of other positions are also important for the structural integrity of the C2H2 zinc 
fingers (Rosenfeld et al,J. Biomol Struct. Dyn. (1993) 77:557). The best conserved position, which 
is generally an aromatic or aliphatic residue, is located four residues after the second cysteine. The 
consensus pattern for C2H2 zinc fingers is: C-x(2,4)-C-x(3)-[LIVMFYWC]-x(8)-H-x(3,5)-H. The 
two C's and two H's are zinc ligands. 

[00202] Zinc Finger. C2HC Type (Zincfing C2HC; Pfam Accession No. PF01530) . SEQ ID 
NO:38 14 represents a member of the C2HC zinc finger family. The 18 residue C2HC domain is 
mainly found in the retroviral nucleocapsid protein, and is required for viral genome packaging and for 
the early infection process (Katz and Jentoft, Bioessays (1989) 77: 176-181; Urbaneja et al, JMol 
Biol (1999) 257:59-75). In addition, the CCHC domain is found in eukaryotic proteins involved in 
RNA binding or single-strand DNA binding. 

[00203] Zinc finger. C3HC4 type (RING finger), signature (Zincfing C3HC4; Pfam Accession 
No. PF00097V SEQ ID NO: 3 140 represents a polynucleotide encoding a polypeptide having a 
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C3HC4 type zinc finger signature. A number of eukaryotic and viral proteins contain this^signature, 
which is primarily a conserved cysteine-rich domain of 40 to 60 residues (Borden et al, Curr. Opin. 
Struct BioL (1996) 5:395) that binds two atoms of zinc, and is probably involved in mediating 
protein-protein interactions. The 3D structure of the zinc ligation system is unique to the RING 
domain and is referred to as the "cross-brace" motif. The spacing of the cysteines in such a domain is 
C-x(2)-C-x(9 to 39)-C-x(l to 3)-H-x(2 to 3)-C-x(2)-C-x(4 to 48)-C-x(2)-C. The signature pattern for 
the C3HC4 finger is based on the domain's central region: C-x-H-x-[LIVMFY]-C-x(2)-C-[LIVMYA]. 
[00204] Zinc finger. CCCH type. (Zincfine CCCH: Pfam Accession No. PF00642Y SEQ ID 
NO:877 corresponds to a polyncleotide encoding a member of the CCCH zinc finger protein family. 
This domain is present in many eukaryotic proteins, including zinc finger proteins involved in cell cycle 
or growth phase-related regulation and regulatory proteins involved in regulating the response to 
growth factors. It has been shown that proteins containing the CCCH zinc finger interact with the 3 ' 
untranslated region of various mRNA (Carballo et al 9 Science (1998) 281: 1001-1005; Lai et al, Mol 
Cell Biol (1999) 19:43 1 1-4323) and that this domain is often present in two copies. The consensus 
pattern for the CCCH zinc fingers is: C-x8-C-x5-C-x3-H. 

Example 4: Description of Libraries and Detection of Differential Expression 
[00205] The relative expression levels of the polynucleotides of the invention were assessed in 
several libraries prepared from various sources, including cell lines and patient tissue samples. Table 1 
above provides a summary of these libraries, including the shortened library name, the mRNA source 
used to prepared the cDNA library, the "nickname" of the library that is used in the tables below (in 
quotes), and the approximate number of clones in the library. 

[00206] Each of the libraries is composed of a collection of cDNA clones that in turn are 
representative of the mRNAs expressed in the indicated mRNA source. In order to facilitate the 
analysis of the millions of sequences in each library, the sequences were assigned to clusters. The 
concept of "cluster of clones" is derived from a sorting/grouping of cDNA clones based on their 
hybridization pattern to a panel of roughly 300 7bp oligonucleotide probes (see Drmanac et al, 
Genomics (1996) 57(1):29). Random cDNA clones from a tissue library are hybridized at moderate 
stringency to 300 7bp oligonucleotides. Each oligonucleotide has some measure of specific 
hybridization to that specific clone. The combination of 300 of these measures of hybridization for 
300 probes equals the "hybridization signature" for a specific clone. Clones with similar sequence will 
have similar hybridization signatures. By developing a sorting/grouping algorithm to analyze these 
signatures, groups of clones in a library can be identified and brought together computationally. These 
groups of clones are termed "clusters". Depending on the stringency of the selection in the algorithm 
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(similar to the stringency of hybridization in a classic library cDNA screening protocol), the "purity" of 
each cluster can be controlled. For example, artifacts of clustering may occur in computational 
clustering just as artifacts can occur in "wet-lab" screening of a cDNA library with 400 bp cDNA 
fragments, at even the highest stringency. The stringency used in the implementation of cluster herein 
provides groups of clones that are in general from the same cDNA or closely related cDNAs. Closely 
related clones can be a result of different length clones of the same cDNA, closely related clones from 
highly related gene families, or splice variants of the same cDNA. 

[00207] Differential expression for a selected cluster was assessed by first determining the number 
of cDNA clones corresponding to the selected cluster in the first library (Clones in 1 st ), and the 
determining the number of cDNA clones corresponding to the selected cluster in the second library 
(Clones in 2 nd ). Differential expression of the selected cluster in the first library relative to the second 
library is expressed as a "ratio 1 ' of percent expression between the two libraries. In general, the "ratio" 
is calculated by: 1) calculating the percent expression of the selected cluster in the first library by 
dividing the number of clones corresponding to a selected cluster in the first library by the total number 
of clones analyzed from the first library; 2) calculating the percent expression of the selected cluster in 
the second library by dividing the number of clones corresponding to a selected cluster in a second 
library by the total number of clones analyzed from the second library; 3) dividing the calculated 
percent expression from the first library by the calculated percent expression from the second library. 
If the "number of clones" corresponding to a selected cluster in a library is zero, the value is set at 1 to 
aid in calculation. The formula used in calculating the ratio takes into account the "depth" of each of 
the libraries being compared, i.e., the total number of clones analyzed in each library. 
[00208] In general, a polynucleotide is significantly differentially expressed between two samples 
when the ratio value is greater than at least about 2, preferably greater than at least about 3, more 
preferably greater than at least about 5 , where the ratio value is calculated using the method described 
above. The significance of differential expression is determined using a z score test (Zar, Biostatistical 
Analysis . Prentice Hall, Inc., USA, "Differences Between Proportions " pp 296-298 (1974). 
[00209] Using this approach, a number of polynucleotide sequences were identified as being 
differentially expressed between, for example, cells derived from high metastatic potential cancer tissue 
and low metastatic cancer cells, and between cells derived from metastatic cancer tissue and normal 
tissue. Evaluation of the levels of expression of the genes corresponding to these sequences can be 
valuable in diagnosis, prognosis, and/or treatment (e.g., to facilitate rationale design of therapy, 
monitoring during and after therapy, etc.). Moreover, the genes corresponding to differentially 
expressed sequences described herein can be therapeutic targets due to their involvement in regulation 
(e.g., inhibition or promotion) of development of, for example, the metastatic phenotype. For example, 

59 



sequences that correspond to genes that are increased in expression in high metastatic potential cells 
relative to normal or non-metastatic tumor cells may encode genes or regulatory sequences involved in 
processes such as angiogenesis, differentiation, cell replication, and metastasis. 
[00210] Detection of the relative expression levels of differentially expressed polynucleotides 
described herein can provide valuable information to guide the clinician in the choice of therapy. For 
example, a patient sample exhibiting an expression level of one or more of these polynucleotides that 
corresponds to a gene that is increased in expression in metastatic or high metastatic potential cells 
may warrant more aggressive treatment for the patient. In contrast, detection of expression levels of a 
polynucleotide sequence that corresponds to expression levels associated with that of low metastatic 
potential cells may warrant a more positive prognosis than the gross pathology would suggest. 
[0021 1] A number of polynucleotide sequences of the present invention are differentially expressed 
between human microvascular endothelial cells (HMVEC) that have been treated with growth factors 
relative to untreated HMVEC. Sequences that are differentially expressed between growth factor- 
treated HMVEC and untreated HMVEC can represent sequences encoding gene products involved in 
angiogenesis, metastasis (cell migration), and other development and oncogenic processes. For 
example, sequences that are more highly expressed in HMVEC treated with growth factors (such as 
bFGF or VEGF) relative to untreated HMVEC can serve as drug targets for chemotherapeutics, e.g., 
decreasing expression of such up-regulated genes or inhibiting the activity of the encoded gene product 
would serve to inhibit tumor cell angiogenesis. Detection of expression of these sequences in colon 
cancer tissue can be valuable in determining diagnostic, prognostic and/or treatment information 
associated with the prevention of achieving the malignant state in these tissues, and can be important in 
risk assessment for a patient. A patient sample displaying an increased level of one or more of these 
polynucleotides may thus warrant closer attention or more frequent screening procedures to catch the 
malignant state as early as possible. 

[00212] The differential expression of the polynucleotides can thus be used as, for example, 
diagnostic and/or prognostic markers, for risk assessment, patient treatment and the like. These 
polynucleotides can also be used in combination with other molecular and/or biochemical markers. 
[00213] The differential expression data for polynucleotides of the invention that have been 
identified as being differentially expressed across various combinations of the libraries described above 
is summarized in Table 5 (inserted prior to the claims). Table 5 provides: 1) the Sequence 
Identification Number ("SEQ ID NO") assigned to the polynucleotide; 2) the cluster ("CLUSTER") to 
which the polynucleotide has been assigned as described above; 3) the library comparisons that 
resulted in identifcation of the polynucleotide as being differentially expressed ("PAIR AB"), where 
the cDNA libraries used are referenced by their library numbers; 4) the number of clones 

60 



ru 

ill 
^4 



corresponding to the polynucleotide in the first library listed ("CLONES A"); 5) the number of clones 
corresponding to the polynucleotide in the second library listed ("CLONES B"); 6) the "RATIO 
PLUS" where the comparison resulted in a finding that the number of clones in library A is greater than 
the number of clones in library B; and 7) the "RATIO MINUS" where the comparison resulted in a 
finding that the number of clones in library B is greater than the number of clones in library A. 
[00214] Detection of expression of genes that correspond to the above polynucleotides may be of 
particular interest in diagnosis, prognosis, risk assessment, and monitoring of treatment. Furthermore, 
differential expression of a specific gene across multiple libraries can also be indicative of a gene 
whose expression is associated with, for example, suppression of the metastatic phenotype or with 
development of the cell toward a metastatic phenotype. For example, SEQ ID NO:3744 corresponds 
to a gene that is expressed at relatively higher levels in metastatized colon tumor than in normal colon 
tissue. Thus a relatively increased level of expression of the gene corresponding to SEQ ID NO:3744 
may be used as marker of a metastatic or pre-metastatic colon eels either alone or in combination with 
other markers. 

[00215] Some polynucleotides exhibited similar differential expression trends in libraries of 
different tissue origin (see, e.g.., SEQ ID NO:337). These data suggest that the differential expression 
patterns of some genes associated with development of tumors indicate a role for those genes that is 
non-specific to the tissue of origin. 

Example 5: Detection of Differential Expression Using Arrays 
[00216] mRNA isolated from samples of cancerous and normal colon tissue obtained from patients 
were analyzed to identify genes differentially expressed in cancerous and normal cells. Normal and 
cancerous cells collected from cryopreserved patient tissues were isolated using laser capture 
microdissection (LCM) techniques, which techniques are well known in the art (see, e.g., Ohyama et 
al. (2000) Biotechniques 29:530-6; Currant a/. (2000) Mol Pathol 53:64-8; Suarez-Quian et al 
(1999) Biotechniques 26:328-35; Simone et al (1998) Trends Genet 14:272-6; Conia et al. (1997) J. 
Clin. Lab. Anal 1 1:28-38; Emmert-Buck et al (1996) Science 274:998-1001). 
[00217] Table 6 (inserted before the claims) provides information about each patient from which 
colon tissue samples were isolated, including: the Patient ID ("PT ID")and Path ReportID (}Path ID"), 
which are numbers assigned to the patient and the pathology reports for identification purposes; the 
group ("Grp'Oto which the patients have been assigned; the anatomical location of the tumor ("Anatom 
Loc"); the primary tumor size ("Size"); the primary tumor grade ("Grade"); the identification of the 
histopathological grade ("Histo Grade"); a description of local sites to which the tumor had invaded 
("Local Invasion"); the presence of lymph node metastases ("LN Met"); the incidence of lymph node 
metastases (provided as a number of lymph nodes positive for metastasis over the number of lymph 
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nodes examined) ("Incidence Lymphnode Met"); the "Regional Lymphnode Grade"; the identification 
or detection of metastases to sites distant to the tumor and their location ("Dist Met & Loc"); the grade 
of distant metastasis ("Dist Met Grade"); and general comments about the patient or the tumor 
("Comments"). Histophatology of all primary tumors incidated the tumor was adenocarcinmoa except 
for Patient ID Nos. 130 (for which no information was provided), 392 ( in which greater than 50% of 
the cells were mucinous carcinoma), and 784 (adenosquamous carcinoma). Extranodal extensions 
were described in three patients, Patient ID Nos. 784, 789, and 791. Lymphovascular invasion was 
described in Patient ID Nos. 128, 278, 517, 534, 784, 786, 789, 791, 890, and 892. Crohn's-like 
infiltrates were described in seven patients, Patient ID Nos. 52, 264, 268, 392, 393, 784, and 791. 
Table 7 (below) provides information about the patients from whom the prostate tissue was isolated. 



Table 7. Prostate paitent data. 



Prostate Patient ID 


Tumor Gleason Grade 


Normal Prostate Description 


96 


3+3 Adenocarcinoma 


Normal prostate; Benign hyperplasia 


282 


4+3 Adenocarcinoma 


Normal prostate; Benign hyperplasia 


286 


3+3 Adenocarcinoma 


Normal prostate; Benign hyperplasia 


294 


3+4 Adenocarcinoma 


Normal prostate; Benign hyperplasia 


362 


3+3 Adenocarcinoma 


Normal prostate; Benign hyperplasia 


428 


4+3 Adenocarcinoma 


Normal prostate; Benign hyperplasia 


492 


3+3 Adenocarcinoma 


Normal prostate; Benign hyperplasia 


492 


3+3 Adenocarcinoma 


Normal prostate; Benign hyperplasia 


493 


3+4 Adenocarcinoma 


Normal prostate; Benign hyperplasia 


510 


3+3 Adenocarcinoma 


formal Prostate; Benign hyperplasia 



Identification of differentially expressed genes 
[00218] cDNA probes were prepared from total RNA isolated from the patient cells described 
above. Since LCM provides for the isolation of specific cell types to provide a substantially 
homogenous cell sample, this provided for a similarly pure RNA sample. 

[00219] Total RNA was first reverse transcribed into cDNA using a primer containing a T7 RNA 
polymerase promoter, followed by second strand DNA synthesis. cDNA was then transcribed in vitro 
to produce antisense RNA using the T7 promoter-mediated expression (see, e.g., Luo et al (1999) 
Nature Med 5:1 17-122), and the antisense RNA was then converted into cDNA. The second set of 
cDNAs were again transcribed in vitro, using the T7 promoter, to provide antisense RNA. Optionally, 
the RNA was again converted into cDNA, allowing for up to a third round of T7-mediated 
amplification to produce more antisense RNA. Thus the procedure provided for two or three rounds of 
in vitro transcription to produce the final RNA used for fluorescent labeling. 

[00220] Fluorescent probes were generated by first adding control RNA to the antisense RNA mix, 
and producing fluorescently labeled cDNA from the RNA starting material. Fluorescently labeled 
cDNAs prepared from the tumor RNA sample were compared to fluorescently labeled cDNAs prepared 
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from normal cell RNA sample. For example, the cDNA probes from the normal cells were labeled 
with Cy3 fluorescent dye (green) and the cDNA probes prepared from the tumor cells were labeled 
with Cy5 fluorescent dye (red), and vice versa. 

[00221] Each array used had an identical spatial layout and control spot set. Each microarray was 
divided into two areas, each area having an array with, on each half, twelve groupings of 32 x 12 spots, 
for a total of about 9,2 16 spots on each array. The two areas are spotted identically which provide for 
at least two duplicates of each clone per array. 

[00222] Polynucleotides for use on the arrays were obtained from both publicly available sources 
and from cDNA libraries generated from selected cell lines and patient tissues. PCR products of from 
about 0.5kb to 2.0 kb amplified from these sources were spotted onto the array using a Molecular 
Dynamics Gen III spotter according to the manufacturer's recommendations. For polynucleotides 
described herein, the microarray spot contained a clone having a cDNA from which the sequence was 
derived. The first row of each of the 24 regions on the array had about 32 control spots, including 4 
negative control spots and 8 test polynucleotides. The test polynucleotides were spiked into each 
sample before the labeling reaction with a range of concentrations from 2-600 pg/slide and ratios of 
1:1. For each array design, two slides were hybridized with the test samples reverse-labeled in the 
labeling reaction. This provided for about four duplicate measurements for each clone, two of one color 
and two of the other, for each sample. 

[00223] Table 8 (inserted before the claims) describes sequences present on the arrays. Table 8 
includes: 1) athe SEQ ID NO of the sequence of the polynucleotide; and 2)the Spot ID, which is a 
unique identifier for each spot ontaining target sequence of interest on all arrays used. 
[00224] The differential expression assay was performed by mixing equal amounts of probes from 
tumor cells and normal cells of the same patient. The arrays were prehybridized by incubation for 
about 2 hrs at 60°C in 5X SSC/0.2% SDS/1 mM EDTA, and then washed three times in water and 
twice in isopropanol. Following prehybridization of the array, the probe mixture was then hybridized 
to the array under conditions of high stringency (overnight at 42°C in 50% formamide, 5X SSC, and 
0.2% SDS. After hybridization, the array was washed at 55°C three times as follows: 1) first wash in 
IX SSC/0.2% SDS; 2) second wash in 0. IX SSC/0.2% SDS; and 3) third wash in 0. IX SSC. 
[00225] The arrays were then scanned for green and red fluorescence using a Molecular Dynamics 
Generation III dual color laser-scanner/detector. The images were processed using BioDiscovery 
Autogene software, and the data from each scan set normalized to provide for a ratio of expression 
relative to normal. Data from the microarray experiments was analyzed according to the algorithms 
described in U.S. application serial no. 60/252,358, filed November 20, 2000, by EJ. Moler, MA. 
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Boyle, and F.M. Randazzo, and entitled "Precision and accuracy in cDNA microarray data," which 
application is specifically incorporated herein by reference. 

[00226] The experiment was repeated, this time labeling the two probes with the opposite color in 
order to perform the assay in both "color directions." Each experiment was sometimes repeated with 
two more slides (one in each color direction). The level fluorescence for each sequence on the array 
expressed as a ratio of the geometric mean of 8 replicate spots/genes from the four arrays or 4 replicate 
spots/gene from 2 arrays or some other permutation. The data were normalized using the spiked 
positive controls present in each duplicated area, and the precision of this normalization was included 
in the final determination of the significance of each differential. The fluorescent intensity of each spot 
was also compared to the negative controls in each duplicated area to determine which spots have 
detected significant expression levels in each sample. 

[00227] A statistical analysis of the fluorescent intensities was applied to each set of duplicate 
spots to assess the precision and significance of each differential measurement, resulting in a p-value 
testing the null hypothesis that there is no differential in the expression level between the tumor and 
normal samples of each patient. During initial analysis of the microarrays, the hypothesis was accepted 
if p > 10" 3 , and the differential ratio was set to 1.000 for those spots. All other spots have a significant 
difference in expression between the tumor and normal sample. If the tumor sample has detectable 
expression and the normal does not, the ratio is truncated at 1000 since the value for expression in the 
normal sample would be zero, and the ratio would not be a mathematically useful value (e.g., infinity). 
If the normal sample has detectable expression and the tumor does not, the ratio is truncated to 0.001, 
since the value for expression in the tumor sample would be zero and the ratio would not be a 
mathematically useful value. These latter two situations are referred to herein as "on/off" Database 
tables were populated using a 95% confidence level (p>0.05). 

[00228] Table 8 (inserted before the claims) provides the results for gene products differentially 
expressed in the colon tumor samples relative to normal tissue samples. Table 8 includes: 1) the SEQ 
ID NO; 2) the spot identification number ("SpotID"); 3) the percentage of patients tested in which 
expression levels of the gene (as detected using the correponding clone) was at least 2-fold greater in 
cancerous colon tissue (primary colon tumor) than in matched normal tissue ("Colon>2x T/N "); 4) the 
percentage of patients tested in which expression levels of the gene was less than or equal to one-half 
of the expression level in matched normal cells ("Colon <=halfx T/N M ); and 5) the colon number ratios, 
indicating the number of patients upon which the provided ratios was based. 
[00229] Table 9 below provides the data for differential expression analysis on the arrays using 
samples from metastazed colon tissue. In this example, the samples used for hybridization to 
sequences on the microarray were derived from the matched metastasized (MT) colon tissue and 
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normal (N) colon tissues of the patients. Table 9 includes: 1) the SEQ ID NO; 2) the percentage of 
patients tested in which expression levels of the gene (as detected using the correponding clone) was j 
least 2-fold greater in metastisized cancerous colon tissue (MT) than in matched normal tissue 
( M Colon>2x MT/N "); 5) the percentage of patients tested in which expression levels of the gene was 
less than or equal to one-half of the expression level in matched normal cells ("Colon <=halfx T/N M ); 
and 8) the colon number ratios, indicating the number of patients upon which the provided ratios was 
based. The corresponding data with the same sequence of the colon tumor tissue versus matched 
normal colon tissue (T/N) are provided for convenience in comparison. 

Table 9. Polynucleotides Corresponding to Differnetially Expressed Genes in Metastasized 

Colon Cancer Tissue 











Colon MT/N 








EIJ 


SEQ ID 
NO 


Colon 

MT/N >2x 


Colon MT/N 
<halfx 


Num Ratios 
by Clone 


Colon T/N 
>2x 


Colon T/N 
<halfx 


Colon T/N 
Num Ratios 




217 


40.0 


0.0 


5.0 


0.0 


50.0 


8.0 




324 


0.0 


40.0 


5.0 


0.0 


37.5 


8.0 


ry 
r\ 


1653 


0.0 


40.0 


5.0 


0.0 


37.5 


8.0 




1972 


40.0 


0.0 


5.0 


0.0 


42.9 


7.0 




2346 


20.0 


40.0 


5.0 


0.0 


85.7 


7.0 




2352 


20.0 


80.0 


5.0 


0.0 


71.4 


7.0 


f?\ 


2353 


20.0 


40.0 


5.0 


0.0 


85.7 


7.0 




2647 


0.0 


40.0 


5.0 


0.0 


62.5 


8.0 


Tin 

9.s ; 


5341 


0.0 


40.0 


5.0 


0.0 


37.5 


8.0 



[00230] Table 10 below provides the data for differential expression analysis on the arrays using 
samples from matched cancerous and normal prostate tissue (PT/N). Table 10 includes: 1) the SEQ ID 
NO; 2) the percentage of patients tested in which expression levels of the gene (as detected using the 
correponding clone) was at least 2-fold greater in metastisized cancerous prostate tissue (PT) than in 
matched normal tissue ( n Colon>2x PT/N "); 3) the percentage of patients tested in which expression 
levels of the gene was less than or equal to one-half of the expression level in matched normal cells 
("Colon <=halfx PT/N"); and 4) the prostate PT/N number ratios, indicating the number of patients 
upon which the provided ratios was based. The corresponding data with the same sequences for the 
colon tumor versus normal (T/N) and metastasized colon tissue versus normal (MT/N) are provided for 
convenience in comparison. 
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Table 10. Polynucleotides Corresponding to Differnetially Expressed Genes in Prostate 
Cancer Tissue 



SEQ 
NO 


Prostate 

\r 1/iN ) 

>2x 


Prostate 
<halfx 


Prostate 
(PT/N) 
Num 

R sttiriQ 


Colon 

T /XT 

T/N 

-^Z. A. 


Colon 
T/N 


Colon T/N 
Num 
Ratios 


Colon 
MT/N 

>zx 


Colon 
MT/N 
<nallx 


Colon 
MT/N 
Num 
Ratios 


139 


11.1 


33.3 


9 0 


0 0 

\J.\J 


Ju.U 


o.U 








490 


37.5 


12.5 


R 0 


n n 

w.u 


71 A 


/.U 








629 


33.3 


11.1 


9.0 














644 


12.5 


37.5 


8.0 


0.0 


42.9 


7.0 








1674 


33.3 


0.0 


9.0 














2346 


37.5 


25.0 


8.0 


0.0 


85.7 


7.0 


20.0 


40.0 


5.0 


2352 


37.5 


12.5 


8.0 


0.0 


71.4 


7.0 


20.0 


80.0 


5.0 


2420 


22.2 


33.3 


9.0 














3296 


33.3 


0.0 


9.0 


0.0 


37.5 


8.0 









Example 6: Antisense Regulation of Gene Expression 
[00231] The expression of the differentially expressed genes represented by the polynucleotides in 
the cancerous cells can be further analyzed using antisense knockout technology to confirm the role and 
function of the gene product in tumorigenesis, e.g., in promoting a metastatic phenotype. 
[00232] Methods for analysis using antisense technology are well known in the art. For example, a 
number of different oligonucleotides complementary to the mRNA generated by the differentially 
expressed genes identified herein can be designed as antisense oligonucleotides, and tested for their 
ability to suppress expression of the genes. Sets of antisense oligomers specific to each candidate 
target are designed using the sequences of the polynucleotides corresponding to a differentially 
expressed gene and the software program HYBsimulator Version 4 (available for Windows 
95/Windows NT or for Power Macintosh, RNAture, Inc. 1003 Health Sciences Road, West, Irvine, CA 
92612 USA). Factors considered when designing antisense oligonucleotides include: 1) the secondary 
structure of oligonucleotides; 2) the secondary structure of the target gene; 3) the specificity with no or 
minimum cross-hybridization to other expressedgenes; 4) stability; 5) length and 6) terminal GC 
content. The antisense oligonucleotide is designed to so that it will hybridize to its target sequence 
under conditions of high stringency at physiological temperatures {e.g., an optimal temperature for the 
cells in culture to provide for hybridization in the cell, e.g., about 37°C), but with minimal formation of 
homodimers. 

[00233] Once synthesized and quantitated, the oligomers are screened for efficiency of a transcript 
knock-out in a panel of cancer cell lines. The efficiency of the knock-out is determined by analyzing 
mRNA levels using lightcycler quantification. The oligomers that resulted in the highest level of 
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transcript knock-out, wherein the level was at least about 50%, preferably about 80-90%, up to 95% or 
more up to undetectable message, are selected for use in a cell-based proliferation assay, an anchorage 
independent growth assay, and an apoptosis assay. 

[00234] For example, where the polynucleotide is identified as having a role in colon cancer, the 
ability of the corresponding designed antisense oligonucleotide to inhibit gene expression is tested 
through transfection into SW620 colon colorectal carcinoma cells. For each transfection mixture, a 
carrier molecule, preferably a lipitoid or cholesteroid, is prepared to a working concentration of 0.5 
mM in water, sonicated to yield a uniform solution, and filtered through a 0.45 fxm PVDF membrane. 
The antisense or control oligonucleotide is then prepared to a working concentration of 100 \xM in 
sterile Millipore water. The oligonucleotide is further diluted in OptiMEM™ (Gibco/BRL), in a 
microfiige tube, to 2 \iM, or approximately 20 \ig oligo/ml of OptiMEM™. In a separate microfuge 
tube, lipitoid or cholesteroid, typically in the amount of about 1.5-2 nmol lipitoid/jig antisense 
^ oligonucleotide, is diluted into the same volume of OptiMEM™ used to dilute the oligonucleotide. The 

diluted antisense oligonucleotide is immediately added to the diluted lipitoid and mixed by pipetting up 
^ and down. Oligonucleotide is added to the cells to a final concentration of 30 nM. 

[ U [00235] The level of target mRNA that corresponds to a target gene of interest in the transfected 

Q cells is quantitated in the cancer cell lines using the Roche LightCycler™ real-time PCR machine. 

& 1 Values for the target mRNA are normalized versus an internal control (e.g. , beta-actin). For each 20 \il 

C3 reaction, extracted RNA (generally 0.2-1 \ig total) is placed into a sterile 0.5 or 1.5 ml microcentrifuge 

^ tube, and water added to a total volume of 12.5 |il. To each tube 7.5 \il of a buffer/enzyme mixture is 

m added, which is prepared by mixing (in the order listed) 2.5 |il H 2 0, 2.0 |il 10X reaction buffer, 10 ]nl 

W oligo dT (20 pmol), 1.0 ^1 dNTP mix (10 mM each), 0.5 ^1 RNAsin® (20u) (Ambion, Inc., Hialeah, 

FL), and 0.5 \il MMLV reverse transcriptase (50u) (Ambion, Inc.). The contents are mixed by 
pipetting up and down, and the reaction mixture incubated at 42°C for 1 hour. The contents of each 
tube are centrifuged prior to amplification. 

[00236] An amplification mixture is prepared by mixing in the following order: IX PCR buffer II, 
3 mM MgCl 2 , 140 ^M each dNTP, 0.175 pmol each oligo, 1:50,000 dil of SYBR® Green, 0.25 mg/ml 
BSA, 1 unit Taq polymerase, and H 2 0 to 20 \xl (PCR buffer II is available in 10X concentration from 
Perkin-Elmer, Norwalk, CT). In IX concentration it contains 10 mM Tris pH 8.3 and 50 mM KC1. 
SYBR® Green (Molecular Probes, Eugene, OR) is a dye which fluoresces when bound to double 
stranded DNA. As double stranded PCR product is produced during amplification, the fluorescence 
from SYBR® Green increases. To each 20 jul aliquot of amplification mixture, 2 \A of template RT are 
added, and amplification carried out according to standard protocols. 
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[00237] The results can be expressed as the percent decrease in expression of the corresponding 
gene product relative to non-transfected cells, vehicle-only transfected (mock-transfected) cells, or cells 
transfected with reverse control oligonucleotides. 

Example 7: Effect of Expression on Proliferation 
[00238] The effect of gene expression on the inhibition of cell proliferation can be assessed in, for 
example, metastatic breast cancer cell lines (MDA-MB-23 1 ("23 1")), SW620 colon colorectal 
carcinoma cells, or SKOV3 cells (a human ovarian carcinoma cell line). 

[00239] Cells are plated to approximately 60-80% confluency in 96-well dishes. Antisense or 
reverse control oligonucleotide iss diluted to 2 ^M in OptiMEM™ and added to OptiMEM™ into 
which the delivery vehicle, lipitoid 1 16-6 in the case of SW620 cells or 1 : 1 lipitoid 1 xholesteroid 1 in 
the case of MDA-MB-23 1 cells, had been diluted. The oligo/delivery vehicle mixture is then further 
diluted into medium with serum on the cells. The final concentration of oligonucleotide for all 
experiments was 300 nM, and the final ratio of oligo to delivery vehicle for all experiments iss 1.5 
nmol lipitoid/|jg oligonucleotide. 

[00240] Antisense oligonucleotides are prepared as described above (see Example 6). Cells are 
transfected overnight at 37°C and the transfection mixture replaced with fresh medium the next 
morning. Transfection is carried out as described above in Example 3. 

[00241] Those antisense oligonucleotides that inhibit proliferation represent genes that play a role 
in production or maintenance of the cancerous phenotype. 

Example 8: Effect of Gene Expression on Colony Formation 
[00242] The effect of gene expression upon colony formation of, for example, SW620 cells, 
SKOV3 cells, and MD-MBA-231 cells can be tested in a soft agar assay. Soft agar assays are 
conducted by first establishing a bottom layer of 2 ml of 0.6% agar in media plated fresh within a few 
hours of layering on the cells. The cell layer is formed on the bottom layer by removing cells 
transfected as described above from plates using 0.05% trypsin and washing twice in media. The cells 
are counted in a Coulter counter, and resuspended to 10 6 per ml in media. 10 (xl aliquots are placed 
with media in 96-well plates (to check counting with WST1), or diluted further for the soft agar assay. 
2000 cells are plated in 800 ^il 0.4% agar in duplicate wells above 0.6% agar bottom layer. After the 
cell layer agar solidifies, 2 ml of media is dribbled on top and antisense or reverse control oligo 
(produced as described in Example 6) added without delivery vehicles. Fresh media and oligos are 
added every 3-4 days. Colonies usually are expected to form in 10 days to 3 weeks. Fields of colonies 
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are counted by eye. Wst-1 metabolism values can be used to compensate for small differences in 
starting cell number. Larger fields can be scanned for visual record of differences. 
[00243] Those antisense oligonucleotides that inhibited colony formation represent genes that play 
a role in production or maintenance of the cancerous phenotype. 

Example 9: Induction of Cell Death upon Depletion of Polypeptides by Depletion of mRNA 
("Antisense Knockout") 

[00244] In order to assess the effect of depletion of a target message upon cell death, SW620 cells, 
or other cells derived from a cancer of interest, are transfected for proliferation assays. For cytotoxic 
effect in the presence of cisplatin (cis), the same protocol is followed but cells are left in the presence 
of 2 drug. Each day, cytotoxicity was monitored by measuring the amount of LDH enzyme 
released in the medium due to membrane damage. The activity of LDH is measured using the 
Cytotoxicity Detection Kit from Roche Molecular Biochemicals. The data is provided as a ratio of 
LDH released in the medium vs. the total LDH present in the well at the same time point and treatment 
(rLDH/tLDH). A positive control using antisense and reverse control oligonucleotides for BCL2 (a 
known anti-apoptotic gene) is included; loss of message for BCL2 leads to an increase in cell death 
compared with treatment with the control oligonucleotide (background cytotoxicity due to 
transfection). 

Example 10: Functional Analysis of Gene Products Differentially Expressed in Cancer 
[00245] The gene products of sequences of a gene differentially expressed in cancerous cells can 
be further analyzed to confirm the role and function of the gene product in tumorigenesis, e.g., in 
promoting or inhibiting development of a metastatic phenotype. For example, the function of gene 
products corresponding to genes identified herein can be assessed by blocking function of the gene 
products in the cell. For example, where the gene product is secreted or associated with a cell surface 
membrane, blocking antibodies can be generated and added to cells to examine the effect upon the cell 
phenotype in the context of, for example, the transformation of the cell to a cancerous, particularly a 
metastatic, phenotype. 

[00246] Where the gene product of the differentially expressed genes identified herein exhibits 
sequence homology to a protein of known function (e.g., to a specific kinase or protease) and/or to a 
protein family of known function (e.g., contains a domain or other consensus sequence present in a 
protease family or in a kinase family), then the role of the gene product in tumorigenesis, as well as the 
activity of the gene product, can be examined using small molecules that inhibit or enhance function of 
the corresponding protein or protein family. 
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[00247] Additional functional assays include, but are not necessarily limited to, those that analyze 
the effect of expression of the corresponding gene upon cell cycle and cell migration. Methods for 
performing such assays are well known in the art. 

Example 1 1 : Contig Assembly and Additional Gene Characterization 
[00248] The sequences of the polynucleotides provided in the present invention can be used to 
extend the sequence information of the gene to which the polynucleotides correspond (e.g., a gene, or 
mRNA encoded by the gene, having a sequence of the polynucleotide described herein). This expanded 
sequence information can in turn be used to further characterize the corresponding gene, which in turn 
provides additional information about the nature of the gene product (e.g., the normal function of the 
gene product). The additional information can serve to proyide additional evidence of the gene 
product's use as a therapeutic target, and provide further guidance as to the types of agents that can 
modulate its activity. 

[00249] For example, a contig can be assembled using the sequence of a polynucleotide described 
herein. A "contig" is a contiguous sequence of nucleotides that is assembled from nucleic acid 
sequences having overlapping (e.g., shared or substantially similar) sequence information. The 
sequences of publicly-available ESTs (Expressed Sequence Tags) and the sequences of various clones 
from several cDNA libraries synthesized at Chiron were used in the contig assembly. The contig is 
assembled using the software program Sequencher, version 4.05, according to the manufacturer's 
instructions. The resulting contig can then be used to search both the public databases as well as 
databases internal to the applicatns to matchthe polynucleotide contiged with homology data and/or 
differential gene expressed data. 

[00250] The sequence information obtained in the contig assembly described above can be used to 
obtain a consensus sequence derived from the contig using the Sequencher program. The consensus 
sequence can then be used as a query sequence in a BLASTN search of the DGTI DoubleTwist Gene 
Index (DoubleTwist, Inc., Oakland, CA), which contains all the EST and non-redundant sequence in 
public databases. Alternatively, a sequence of a polynucleotide described herein can be used directly as 
a query sequence in a BLASTN search of the DGTI DoubleTwist Gene Index. 
[00251] Through contig assembly and the use of homology searching software programs, the 
sequence information provided herein can be readily extended to confirm, or confirm a predicted, gene 
having the sequence of the polynucleotides described in the present invention. Further the information 
obtained can be used to identify the function of the gene product of the gene corresponding to the 
polynucleotides described herein. While not necessary to the practice of the invention, identification of 
the function of the corresponding gene, can provide guidance in the design of therapeutics that target 
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the gene to modulate its activity and modulate the cancerous phenotype (e.g., inhibit metastasis, 
proliferation, and the like). 

[00252] Although the foregoing invention has been described in some detail by way of illustration 
and example for purposes of clarity of understanding, it is readily apparent to those of ordinary skill in 
the art in light of the teachings of this invention that certain changes and modifications may be made 
thereto without departing from the spirit or scope of the appended claims. Those skilled in the art will 
recognize, or be able to ascertain, using not more than routine experimentation, many equivalents to the 
specific embodiments of the invention described herein. Such specific embodiments and equivalents 
are intended to be encompassed by the following claims. 
Deposit Information . 

[00253] A deposit of the biological materials in the tables referenced below was made with the 
American Type Culture Collection, 10801 University Blvd., Manasas, VA 201 10-2209, under the 
provisions of the Budapest Treaty, on or before the filing date of the present application. The 
accession number indicated is assigned after successful viability testing, and the requisite fees were 
paid. Access to said cultures will be available during pendency of the patent application to one 
determined by the Commissioner to be entitled to such under 37 C.F.R. §1.14 and 35 U.S.C. §122. All 
restriction on availability of said cultures to the public will be irrevocably removed upon the granting 
of a patent based upon the application. Moreover, the designated deposits will be maintained for a 
period of thirty (30) years from the date of deposit, or for five (5) years after the last request for the 
deposit; or for the enforceable life of the U.S. patent, whichever is longer. Should a culture become 
nonviable or be inadvertently destroyed, or, in the case of plasmid-containing strains, lose its plasmid, 
it will be replaced with a viable culture(s) of the same taxonomic description. 

[00254] These deposits are provided merely as a convenience to those of skill in the art, and are not 
an admission that a deposit is required. A license may be required to make, use, or sell the deposited 
materials, and no such license is hereby granted. The deposit below was received by the ATCC on or 
before the filing date of the present application. 

Table 11. Cell Lines Deposited with ATCC 



Cell Line 


Deposit Date 


ATCC Accession No. 


CMCC Accession No. 


KM12L4-A 


March 19, 1998 


CRL- 12496 


11606 


Kml2C 


May 15, 1998 


CRL- 12533 


11611 


MDA-MB- 
231 


May 15, 1998 


CRL-12532 


10583 


MCF-7 


October 9, 1998 


CRL- 12584 


10377 



[00255] In addition, pools of selected clones, as well as libraries containing specific clones, were 
assigned an "ES" number (internal reference) and deposited with the ATCC. Table 13 below provides 
the ATCC Accession Nos. of the ES deposits, all of which were deposited on or before June 13, 2000. 
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Table 12: Pools of Clones and Libraries Deposited with ATCC on or before June 13, 2000. 



Library No. 


CMCC No. 


ATCC Accession No. 


Library No. 


CMCC No. 


ATCC Accession No. 


ES168 


5276 


PTA-2027 


ES 189 


5304 


PTA-2052 


ESI 69 


5277 


PTA-2028 


ES 190 


5305 


PTA-2053 


ESI 70 


5284 


PTA-2029 


ES 191 


5306 


PTA-2054 


ES171 


5285 


PTA-2030 


ES 192 


5307 


PTA-2055 


ES172 


5286 


PTA-2031 


ES 193 


5308 


PTA-2056 


ESI 73 


5287 


PTA-2032 


ES 194 


5309 


PTA-2057 


ESI 74 


5288 


PTA-2033 


ES 195 


5310 


PTA-2058 


ESI 75 


5289 


PTA-2034 


ES 196 


5311 


PTA-2059 


ESI 76 


5290 


PTA-2035 


ES 197 


5312 


PTA-2060 


ESI 77 


5291 


PTA-2036 


ES 198 


5313 


PTA-2061 


ESI 78 


5292 


PTA-2037 


ES 199 


5314 


PTA-2062 


ESI 79 


5293 


PTA-2038 


ES 200 


5315 


PTA-2048 


ESI 80 


5294 


PTA-2039 


ES 201 


5316 


PTA-2049 


ES181 


5295 


PTA-2040 


ES 202 


5317 


PTA-2063 


ESI 82 


5296 


PTA-2041 


ES 203 


5318 


PTA-2064 


ESI 83 


5297 


PTA-2042 


ES 204 


5319 


PTA-2065 


ESI 84 


5298 


PTA-2043 


ES 205 


5320 


PTA-2066 


ESI 85 


5299 


PTA-2044 


ES 206 


5321 


PTA-2067 


ES 186 


5301 


PTA-2045 


ES 207 


5322 


PTA-2068 


ES 187 


5302 


PTA-2046 


ES 208 


5253 


PTA-2050 


ES 188 


5303 


PTA-2047 


ES 209 


5324 


PTA-2051 



[00256] Table 13 (inserted before the claims) provides the clones in each of the above libraries. 
[00257] Retrieval of Individual Clones from Deposit of Pooled Clones . Where the ATCC deposit 
is composed of a pool of cDNA clones or a library of cDNA clones, the deposit was prepared by first 
transfecting each of the clones into separate bacterial cells. The clones in the pool or library were then 
deposited as a pool of equal mixtures in the composite deposit. Particular clones can be obtained from 
the composite deposit using methods well known in the art. For example, a bacterial cell containing a 
particular clone can be identified by isolating single colonies, and identifying colonies containing the 
specific clone through standard colony hybridization techniques, using an oligonucleotide probe or 
probes designed to specifically hybridize to a sequence of the clone insert (e.g., a probe based upon 
unmasked sequence of the encoded polynucleotide having the indicated SEQ ID NO). The probe 
should be designed to have a T m of approximately 80°C (assuming 2°C for each A or T and 4°C for 
each G or C). Positive colonies can then be picked, grown in culture, and the recombinant clone 
isolated. Alternatively, probes designed in this manner can be used to PCR to isolate a nucleic acid 
molecule from the pooled clones according to methods well known in the art, e.g. , by purifying the 
cDNA from the deposited culture pool, and using the probes in PCR reactions to produce an amplified 
product having the corresponding desired polynucleotide sequence. 
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Table 2 



SEQ 
ID NO 


V^J-^UO 1 CIV 


CCA 'MAN/TP 


T 
1 


CLONE ID 


LIBRARY 


i 
l 


498509 


2102.H02.gz43 275066 


.F 


M00063514B:E04 


GRRpz 


z 


£. A A AT"7 

644927 


2102.F08.gz43 275160 


F 


M00063493D:G07 


GRRpz 


*j 
3 


1 AO 1 

621081 


2102.G10.gz43 275193 


F 


M00063507B:B08 


GRRpz 


4 


558900 


2 1 02 . F 1 4 .gz43_2 75 25 6 


F 


M00063496B:F07 


GRRpz 


5 


464791 


2102.A16.gz43_275283 


F 


M00063157B:B09 


GRRpz 


6 


d C O 1 /IT 

558147 


2 102.B 1 8.gz43_2753 1 6 


F 


M00063457A:B12 


GRRpz 


7 


379154 


2102.124.gz43_275419 


F 


M00063601D:C05 


GRRpz 


o 
5 


643609 


2 1 03 .F0 1 .gz43_275432 


F 


M00063866B;H02 


GRRpz 


a 


377696 


2 1 03 .M06.gz432755 1 9 


F 


M00063943B:G12 


GRRpz 


10 


C C vl *^ A£T 

554395 


2 103. K10.gz43 275581 


F 


M00063927A:B08 


GRRpz 


11 


>i at* o 

402353 


2 1 03 .B 1 3 .gz43_275620 


F 


M00063803B:F11 


GRRpz 


12 


1 O AC 1 A 

380514 


2103.E17.gz43 275687 


F 


M00063864A:H10 


GRRpz 


13 


1 A A 

147279 


2 103 . J2 1 .gz43_275756 


F 


M00063923B:A04 


GRRpz 


1 A 

14 


884 


2128.K06.gz43_277507 


F 


M00028627B:F12 


MV-522 


1 c 

15 


188377 


2128.H15.gz43_277648 


F 


M00028359D:F09 


MV-522 


16 


155615 


2128.P17.gz43_277688 


F 


M00032476D:F07 


MV-522 


17 


125604 


2 1 28 .P 1 8 .gz43_277704 


F 


M00032477A:B02 


MV-522 


1 o 

18 


432159 


2128.A24.gz43_277785 


F 


M00028061D:D10 


MV-522 


i a 
19 


429735 


2130.N02.gz43_277847 


F 


M00032736A:B06 


MV-522 


OA 

20 


72838 


2 130.005 .gz43 277896 


F 


M00032745C:F03 


MV-522 


21 


436888 


^ 1 A "KTAA /I ^> ^****r\cr\ 

2130.N09.gz43_277959 


F 


M00032738D:G11 


MV-522 


22 


427571 


2130.H13.gz43_278017 


F 


M00032685B:C10 


MV-522 


23 


38494 


2130.H16.gz43_278065 


F 


M00032686C:D10 


MV-522 


24 


78607 


2130.H18.gz43_278097 


F 


M00032686D:G09 


MV-522 


25 


90192 


2130.J18.gz43_278099 


F 


M00032703D:E10 


MV-522 


26 


44615 


2 130.120.gz43_278 130 


F 


M00032695B:A01 


MV-522 


27 


376753 


2130.M21.gz43 278150 


F 


M00032732A:A03 


MV-522 


28 


6342 


2152.H10.gz43_278370 


F 


M00039109A:H09 


UCP-3 


or* 

29 


387530 


2152.D12.gz43_278398 


F 


M00039069D:E12 


UCP-3 


30 


36453 


2152.B14.gz43_278428 


F 


M00039052B:H03 


UCP-3 


31 


376044 


2 1 5 2 .A 1 8 .gz43_27849 1 


F 


M00039047C:A05 


UCP-3 


32 


375415 


2152.J18.gz43_278500 


F 


M00039121C:C06 


UCP-3 


33 


375706 


2153.F03.gz43_278756 


F 


M00039222C:G06 


UCP-3 


34 


379046 


2153J06.gz43_278808 


F 


M00039244C:F10 


UCP-3 


35 


390017 


2153.108.gz43_278839 


F 


M00039240A:H08 


UCP-3 


36 


377596 


2153.O08.gz43278845 


F 


M00039274C:D12 


UCP-3 


j / 




Z 1 J J . 1 1 1 .gz*f J z/ooo/ 


r 


XjfflAAim/l 1 A .Til 1 

M0U03y24 1 A:H1 1 


UCP-3 


38 


372952 


2153.K14.gz43_278937 


F 


M00039251A:G12 


UCP-3 


39 


377696 


2154.M04.gz43_279163 


F 


M00039404B:G11 


UCP-3 


40 


60037 


2154.G05.gz43_279173 


F 


M00039330B:H09 


UCP-3 


41 


390968 


2 154.11 8.gz43_279383 


F 


M00039346C:G08 


UCP-3 


42 


376044 


2154.G21.gz43_279429 


F 


M00039336A:C07 


UCP-3 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


43 


60037 


2155J03.gz43_279528 


F 


M00039632A:C01 


UCP-3 


44 


376839 


2155.N05.gz43_279564 


F 


M00039664B:H10 


UCP-3 


45 


373680 


2155.Pll.gz43_279662 


F 


M00039674B:G1 1 


UCP-3 


46 


373973 


2155.C12.gz43_279665 


F 


M00039457A:D07 


UCP-3 


47 


379805 


2155.N15.gz43_279724 


F 


M00039665C:B01 


UCP-3 


48 


376439 


2155.F16.gz43_279732 


F 


M00039479A:A10 


UCP-3 


49 


376994 


2155.116.gz43_279735 


F 


M00039619C:B01 


UCP-3 


50 


378420 


2155.B21.gz43_279808 


F 


M00039451B:F01 


UCP-3 


51 


375510 


2155.F21.gz43_279812 


F 


M00039479D:C06 


UCP-3 


52 


24210 


2155.P22.gz43_279838 


F 


M00039675C:C05 


UCP-3 


53 . 


397167 


2164J03.gz43_279912 


F 


M00039778D:D05 


UCP-3 


54 


380025 


2164.110.gz43_280023 


F 


M00039774A:E1 1 


UCP-3 


55 


216179 


2164.P10.gz43_280030 


F 


M00039823D:D09 


UCP-3 


56 


379046 


2164.B15.gz43_280096 


F 


M00039681B:F05 


UCP-3 


57 


379279 


2164.A24.gz43_280239 


F 


M00039679C:A02 


UCP-3 


58 


376074 


2164O24.gz43_280253 


F 


M00039820D:F08 


UCP-3 


59 


373905 


2165.H01.gz43_280262 


F 


M00039883D:G06 


UCP-3 


60 


1300 


2165K01.gz43_280265 


F 


M00039905B:F09 


UCP-3 


61 


393635 


2165.H06.gz43_280342 


F 


M00039885B:A10 


UCP-3 


62 


396969 


2165.P07.gz43_280366 


F 


M00039978D:C04 


UCP-3 


63 


398061 


2165.F09.gz43_280388 


F 


M00039871C:C01 


UCP-3 


64 


376808 


2165.110.gz43_280407 


F 


M00039893A:G12 


UCP-3 


65 


6342 


2166A06.gz43_281279 


F 


M00039986B.A11 


UCP-3 


66 


233814 


2166.K07.gz43_281305 


F 


M00040076B:D01 


UCP-3 


67 


24210 


2166N07.gz43_281308 


F 


M00040096D:C03 


UCP-3 


68 


24210 


2166.All.gz43_281359 


F 


M00039988B:C08 


UCP-3 


69 


377696 


2166.Jll.gz43_281368 


F 


M00040070C:D11 


UCP-3 


70 


398831 


2166.A23.gz43_281551 


F 


M00040001A:H02 


UCP-3 


71 


185432 


2166.P24.gz43_281582 


F 


M00040113C:H09 


UCP-3 


72 


505275 


2104.D01.gz43_297323 


F 


M00064082C:D11 


GRRpz 


73 


11379 


2104.N01.gz43_297333 


F 


M00064194B:A02 


GRRpz 


74 


446397 


2104.N03.gz43_297365 


F 


M00064195C:B02 


GRRpz 


75 


418763 


2104.P04.gz43_297383 


F 


M00064220B:E01 


GRRpz 


76 


469367 


2104.N05.gz43_297397 


F 


M00064196D:C10 


GRRpz 


77 


639578 


2104.C08.gz43_297434 


F 


M00064077B:H02 


GRRpz 


78 


376559 


2104E08.gz43_297436 


F 


M00064101B:E12 


GRRpz 


79 


649035 


2104.Dll.gz43_297483 


F 


M00064085B:A12 


GRRpz 


80 


647586 


2104.Hll.gz43_297487 


F 


M00064131B:A09 


GRRpz 


81 


646187 


2104.F12.gz43_297501 


F 


M00064112A:G03 


GRRpz 


82 


727888 


2104.N13.gz43_297525 


F 


M00064201A:C08 


GRRpz 


83 


558382 


2104.E17.gz43_297580 


F 


M00064104B:A01 


GRRpz 


84 


52644 


2104.C19.gz43_297610 


F 


M00064081A:D04 


GRRpz 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


85 


454622 


2104.120.gz43_297632 


F 


M00064147B:G08 


GRRpz 


86 


554032 


2104.A21.gz43_297640 


F 


M00064064D:D11 


GRRpz 


87 


463217 


2 104.E23 .gz43_297676 


F 


M00064107C:E03 


GRRpz 


88 


189073 


1521.H06.gz43_303496 


F 


M00027103D:B05 


MCF-7 


89 


188309 


1521.J06.gz43_303498 


F 


M00027219B:G12 


MCF-7 


90 


187525 


1521.N10.gz43_303566 


F 


M00027514C:F01 


MCF-7 


91 


215366 


1521.N12.gz43_303598 


F 


M00027517C:F08 


MCF-7 


92 


186594 


1521.H13.gz43_303608 


F 


M00027123D:F02 


MCF-7 


93 


218904 


1521.O13.gz43_303615 


F 


M00027586B:B03 


MCF-7 


94 


189993 


1521.C14.gz43_303619 


F 


M00023406A:G03 


MCF-7 


95 . 


222818 


1521.H14.gz43_303624 


F 


M00027126C:H05 


MCF-7 


96 


185056 


1521.O16.gz43_303663 


F 


M00027587C:F02 


MCF-7 


97 


186404 


1521.P20.gz43_303728 


F 


M00027694C:C11 


MCF-7 


98 


649744 


2116.106.gz43_306217 


F 


M00063580A:A07 


WOca 


99 


535955 


2116.P08.gz43_306256 


F 


M00063994B:D10 


WOca 


100 


375328 


2116J09.gz43_306266 


F 


M00063592B:E09 


WOca 


101 


449206 


2116.A13.gz43_306321 


F 


M00063165C:F10 


WOca 


102 


89082 


2116.E15.gz43_306357 


F 


M00063552B:B06 


WOca 


103 


730600 


2116.D21.gz43_306452 


F 


M00063546B:F01 


WOca 


104 


372621 


2116.O22.gz43_306479 


F 


M00063988A:C02 


WOca 


105 


416886 


2117.M03.gz43_306557 


F 


M00064394A:C02 


WOca 


106 


372621 


2117.D05.gz43_306580 


F 


M00064307C:E12 


WOca 


107 


644919 


2118.N03.gz43_306942 


F 


M00064601C:H06 


WOca 


108 


446397 


2118.A09.gz43_307025 


F 


M00064446D:C08 


WOca 


109 


177443 


2 1 1 8.M09.gz43_307037 


F 


M00064592D:F05 


WOca 


110 


730238 


2118.H24.gz43_307272 


F 


M00064534D:H04 


WOca 


111 


427907 


2131.A01.gz43_307885 


F 


M00032766A:A10 


MV-522 


112 


48238 


2131.M02.gz43_307913 


F 


M00032886A:D04 


MV-522 


113 


226324 


2131.B04.gz43_307934 


F 


M00032783A:H08 


MV-522 


114 


441801 


2131.E06.gz43_307969 


F 


M00032809B:E10 


MV-522 


115 


62016 


2131.Kll.gz43_308055 


F 


M00032872B:A02 


MV-522 


116 


48238 


2131.E12.gz43_308065 


F 


M00032811A:G10 


MV-522 


117 


34071 


2131.113.gz43_308085 


F 


M00032857A:B02 


MV-522 


118 


221686 


2131.B14.gz43_308094 


F 


M00032786A:H04 


MV-522 


119 


440284 


2131J16.gz43_308134 


F 


M00032865A:D11 


MV-522 


120 


12481 


2131.P18.gz43_308172 


F 


M00032914B:D09 


MV-522 


121 


20453 


2131.A19.gz43_308173 


F 


M00032779A:A04 


MV-522 


122 


26926 


213I.I19.gz43_308181 


F 


M00032858D:H11 


MV-522 


123 


37805 


2131.P19.gz43_308188 


F 


M00032915B:D01 


MV-522 


124 


441874 


2131.121.gz43_308213 


F 


M00032859C:E04 


MV-522 


125 


48238 


2131.A23.gz43_308237 


F 


M00032780A:B09 


MV-522 


126 


169458 


1513.O03.gz43_300292 


F 


M00022648C:D08 


MDA-MB-231 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


127 


145815 


1513.F09.gz43_300379 


F 


M00022135D:D06 


MDA-MB-23 1 


128 


158321 


1513.H09.gz43_300381 


F 


M00022255B:F12 


MDA-MB-23 1 


129 


168195 


1513.L10.gz43_300401 


F 


M00022537D:C05 


MDA-MB-23 1 


130 


101499 


1513.J13.gz43_300447 


F 


M00022445D:E12 


MDA-MB-23 1 


131 


142842 


1513.A14.gz43_300454 


F 


M00007960D:E09 


MDA-MB-23 1 


132 


153316 


1513.E14.gz43_300458 


F 


M00022072A:E12 


MDA-MB-23 1 


133 


142614 


1513.A15.gz43_300470 


F 


M00007963D.D03 


MDA-MB-23 1 


134 


99011 


1513.N16.gz43_300499 


F 


M00022622A:G01 


MDA-MB-23 1 


135 


171073 


1513.P18.gz43_300533 


F 


M00022710C:H03 


MDA-MB-23 1 


136 


120049 


1513.D19.gz43_300537 


F 


M00021925A:H07 


MDA-MB-23 1 


137 . 


446572 


1562.B01.gz43_207804 


F 


M00042546A:D03 


UC2-NormColon 


138 


464091 


1562.F01.gz43_207808 


F 


M00042552D:A1 1 


UC2-NormColon 


139 


408386 


1562J01.gz43_207812 


F 


M00042560B:A01 


UC2-NormColon 


140 


446829 


1562.L02.gz43_207830 


F 


M00042563A:F10 


UC2-NormColon 


141 


43338 


1562.P03.gz43_207850 


F 


M00042569B:G07 


UC2-NormColon 


142 


451780 


1562.F04.gz43_207856 


F 


M00042554A:C02 


UC2-NormColon 


143 


456462 


1562.J04.gz43_207860 


F 


M00042560B:G10 


UC2-NormColon 


144 


469511 


1562.G05.gz43_207873 


F 


M00042555D:G10 


UC2-NormColon 


145 


455075 


1562.H05.gz43_207874 


F 


M00042557A:D09 


UC2-NormColon 


146 


447346 


1562.P05.gz43_207882 


F 


M00042569C:B05 


UC2-NormColon 


147 


147196 


1562.M06.gz43_207895 


F 


M00042564D:F10 


UC2-NormColon 


148 


467262 


1562.F08.gz43_207920 


F 


M00042554C:E02 


UC2-NormColon 


149 


459536 


1562.L09.gz43_207942 


F 


M00042563C:D08 


UC2-NormColon 


150 


460190 


1562.P09.gz43_207946 


F 


M00042569D:D02 


UC2-NormColon 


151 


447597 


1562.F10.gz43_207952 


F 


M00042554C:F09 


UC2-NormColon 


152 


451737 


1562.G10.gz43_207953 


F 


M00042556A:G12 


UC2-NormColon 


153 


446614 


1562.L10.gz43_207958 


F 


M00042563C:E02 


UC2-NormColon 


154 


432159 


1562.M10.gz43_207959 


F 


M00042565A:G05 


UC2-NormColon 


155 


459523 


1562.Lll.gz43_207974 


F 


M00042563D:D02 


UC2-NormColon 


156 


460516 


1562.Pll.gz43_207978 


F 


M00042570A:E08 


UC2-NormColon 


157 


446674 


1562.D12.gz43_207982 


F 


M00042550A.D12 


UC2-NormColon 


158 


465594 


1562F12.gz43_207984 


F 


M00042554D:C08 


UC2-NormColon 


159 


466719 


1562.G12.gz43_207985 


F 


M00042556B:D12 


UC2-NormColon 


160 


463487 


1562.M12.gz43_207991 


F 


M00042565A:H03 


UC2-NormColon 


161 


446389 


1562.B13.gz43_207996 


F 


M00042546D:E06 


UC2-NonnColon 


162 


462149 


1562.C14.gz43_208013 


F 


M00042548B:G01 


UC2-NonnColon 


163 


447174 


1562.G14.gz43_208017 


F 


M00042556B:E10 


UC2-NormColon 


164 


457405 


1562.K14.gz43_208021 


F 


M00042562C:A07 


UC2-NormColon 


165 


460766 


1562O15.gz43_208041 


F 


M00042568C:E08 


UC2-NormColon 


166 


446703 


1562.N16.gz43_208056 


F 


M00042567C:E02 


UC2-NormColon 


167 


446981 


1562.D18.gz43_208078 


F 


M00042550C:H10 


UC2-NormColon 


168 


387077 


1562.H18.gz43_208082 


F 


M00042558A:D03 


UC2-NonnColon 
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# • 



Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


169 


460972 


1562.N18.gz43 208088 


F 


M00049^fi7P*Ffi7 


u i^z -in ormcolon 


170 


417078 


1562.P18.gz43 208090 


F 


M00049S7IYRF 1 1 




171 


461559 


1562.L19.gz43 208102 


F 


M00042^64A*Fm 


T TP"") XT^»^v»/^ A 1«.»* 

ULz -in ormc olon 


172 


464905 


1562.F20.gz43 208112 


F 


M00042555RRO7 


TTP9 \lArmpAl rt « 


173 


468330 


1562.H20.gz43 208114 


F 


M00042558AF1 1 


T TP 9 ~NJrit-tviP/Ort*-i 


174 


446620 


1562.J21.gz43 208132 


F 


M0004256 1 RF 1 9 


T TP9_tJr»rrYiP/-\1^« 

vJv^Z-iNOnnVyOlOn 


175 


457842 


1562.P21.gz43 208138 


F 


M00042570CB12 


T TP9-Mr»rrnPri1/vri 


176 


446531 


1562.B22.gz43 208140 


F 


M00042547BD11 

J- T M. vy vy Vy T / X^ » X X 


T TP9-r\Tr.rrnPriirm 


177 


446621 


1562.J24.gz43 208180 


F 


M00042561CE12 




178 


380127 


1562.N24.gz43 208184 


F 


M00042567DF 1 2 




179 . 


464275 


1563.D01.gz43 208190 


F 


M00042575DA1 1 


T TP 9 -T\Torrn Pnl r»n 


180 


456920 


1563.C04.gz43 208237 


F 


M00042574PA04 


uVyZ-iN orrnVyOion 


181 


466971 


1563.B05.gz43 208252 


F 


M00042571ADOS 


u v^z-in Oxxni^oion 


182 


142559 


1563.J05.gz43_208260 


F 


M00042698 A • A0 1 


u v>z -in onTiv>oion 


183 


457890 


1563.G08.gz43 208305 


F 


M0004269 1 TjRO^ 


uv^z -in orTni^ oion 


184 


453605 


1563.K08.gz43 208309 


F 


M00042699P FTOfi 


T TP7-MnrmPnl/M-i 
U V^Z-IN OilIlV_>010n 


185 


465339 


1563 .N08.gz43 2083 12 


F 


M0004270S A -R07 


TTP9 NArmfnlnn 


186 


461734 


1563.109.gz43_208323 


F 




T TP9-T\TnrmPrJr«i 

u v>z-in orrnv^oion 


187 


459961 


1563.D12.gz43 208366 


F 


IVlUV/ut/, J / V/U.Xyi X 


uv^z-iNonnv^oion 


188 


463951 


1563.F13.gz43 208384 


F 


M000425R0PAn^ 

lvXV/V/V/*T^»/0 V/\_y . AuJ 


u v>z -in oniiv^oion 


189 


468783 


1563.A14.gz43 208395 


F 


M00042S7 1 PFO^ 


TTP9 "WnrrnPnlnn 

u v>z-in orrnvyOion 


190 


447597 


1563.E14.gz43 208399 


F 


M00042578AF08 


T TP9-*Mr*rmPr»1r>ri 


191 


463368 


1563.114.gz43_208403 


F 


M00042695DHO 1 

j. ▼ -i. vy vy v i v/ .y _y j_y „ x XV/ X 


T TP9-r\TnrmPn1r»n 
u v>Zr _ i> uiuiL/Uiun 


192 


451383 


1563.C15.gz43 208413 


F 


M00042575AF02 

J.»AV/vyVy~Z»_/ / -/.TV. X_/Vy.y_ 


T TP9-T\TrirmPntnn 


193 


467293 


1563.G16.gz43 208433 


F 


M00042692AF1 1 

1TXV/ W^^VJ £*£\., X_i X X 


T TP9-T\TnrmPnlr\« 

u v/ z -in uxxiiV/Uiun 


194 


446922 


1563.016.gz43 208441 


F 


M00042707OA09 

i. ▼ x vy vy w Ti* / vy / . .zvVy Jy 


T TP 9 - TSlnrm P r»1 on 
\J V^^-1N UI IIIV^OIUII 


195 


446213 


1563.D17.gz43_208446 


F 


M000425 76D * AO 8 


uiy/."iN urui Vy ui on 


196 


460244 


1563.F19.gz43 208480 


F 


M00042691AD08 

i" Avy vy u vy -y lii.L/vu 


T TP9-"MnrmPn1nn 
\J -IN UI II1V>UIUI1 


197 


460789 


1563.A20.gz43 208491 


F 


M00042572BE05 


T TP 9 - tsJnrm Pnl nn 
%J V/Z, — IN Ui II1V/U1UI1 


198 


451382 


1563.C20.gz43 208493 


F 


M000425 75 B F02 


T TP9-lvIrirmPnlnn 
yj VyZ— in ui iiiVvUiuix 


199 


468109 


1563.K20.gz43 208501 


F 


M00042700CF1 1 

x ▼ x vy vy vy r Zw / v/v/\^ # x x x 


T TP9-l\TnrmPn1r»n 

A ^UllHVyUIUII 


200 


447326 


1563.L20.gz43_208502 


F 


M00042702B * B07 


T TP9-T\TnrmPo1nn 

\J V^^* 1 ti Ui lilVsUIUil 


201 


463896 


1563.B21.gz43_208508 


F 


M00042573DA10 

1 * M. vy vy V-/ t z^ # **y X/ t i xi v/ 


T IP9-TMnrmPn1on 

\J Vy/* 1 ^ UI 1 II Vy UIUIX 


202 


446839 


1563.D21.gz43 208510 


F 


M000425 76D • F0 1 

*»x v/ vy vy t / v/x*^ * X V/ X 


T TP9-iVjnrmPn1nn 
yj v> /- ii vji iiiv^v/iwii 


203 


446933 


1563.E22.gz43_208527 


F 


M00042579A:B05 


\J V/Z< i. ^1 KJ 1111 V-^ \J 1 Vy 1 1 


204 


447826 


1563.L22.gz43_208534 


F 


M00042702B:G07 


UC2-NormColon 


205 


446409 


1563.B23.gz43_208540 


F 


M00042574A:F05 


UC2-NormColon 


206 


453908 


1573.N01.gz43 208584 


F 


M00042895C:C10 


UC2-NormColon 


207 


447421 


1573D02.gz43_208590 


F 


M00042747D:C08 


UC2-NormColon 


208 


447141 


1573.P02.gz43_208602 


F 


M00042898A:H05 


UC2-NormColon 


209 


447645 


1573.E04.gz43_208623 


F 


M00042750D:E07 


UC2-NormColon 


210 


401426 


1573.K06.gz43_208661 


F 


M00042890D:G05 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


211 


466894 


1573.B07.gz43_208668 


F 


M00042744A:D11 


UC2-NormCoIon 


212 


447161 


1573.J08.gz43 208692 


F 


M00042889D:A01 


UC2 -Norm Pol on 


213 


633946 


1573.M08.gz43 208695 


F 


M00042894B:E05 


I JC 2 -Norm C ol on 

w V_/^. 1 ^1 Vyl lll\^Ulv_/ll 


214 


557974 


1573.L09.gz43_208710 


F 


M00042892C:E03 


IJC2-NormC l o1on 

1 1 \Ji lllV-^VJlUll 


215 


466920 


1573.D10.gz43 208718 


F 


M00042748D:D08 


UC2-Normro1on 

\-/V>a» i ^1 VJ1 1 IIV-'UIUII 


216 


642146 


1 573 .F 1 0.gz43_208720 


F 


M00042882C:F06 


UC2-NormCo1on 

\J\s<&0 L^KJl. lilV^VJIUll 


217 


464205 


1573.J10.gz43 208724 


F 


M00042889D.A12 


UC2-NormColon 


218 


641890 


1573.L10.gz43_208726 


F 


M00042892D:C04 


UC2-NormColon 


219 


650195 


1573.Hll.gz43_208738 


F 


M00042886C:F01 


UC2-NormColon 


220 


467293 


1573.F12.gz43_208752 


F 


M00042882C:G07 


UC2-NormColon 


221 . 


650756 


1573.N12.gz43_208760 


F 


M00042896A:D04 


UC2-NormColon 


222 


452611 


1573.N13.gz43_208776 


F 


M00042896A:E03 


UC2-NormColon 


223 


650944 


1573.N14.gz43 208792 


F 


M00042896A:F09 


UC2-NormColon 


224 


639372 


1573.E15.gz43_208799 


F 


M00042881C:C11 


UC2-NormColon 


225 


645690 


1573.F15.gz43_208800 


F 


M00042882D:C04 


UC2-NormColon 


226 


645470 


1573.H16.gz43_208818 


F 


M00042886D:E10 


UC2-NormColon 


227 


651029 


1573.M17.gz43_208839 


F 


M00042894D:G05 


UC2-NormColon 


228 


639849 


1573.F18.gz43_208848 


F 


M00042883A:F06 


UC2-NormCoIon 


229 


486238 


1573.K19.gz43_208869 


F 


M00042891C:H01 


UC2-NormColon 


230 


463060 


1573.G21.gz43_208897 


F 


M00042885A:G09 


UC2-NormColon 


231 


469150 


1573.C22.gz43_208909 


F 


M00042747A:G12 


UC2-NormColon 


232 


472101 


1574.00 l.gz43_208969 


F 


M00054797D:F01 


UC2-NormColon 


233 


641875 


1574.P02.gz43_208986 


F 


M00054799C:G11 


UC2-NormColon 


234 


465104 


1574.A03.gz43_208987 


F 


M00042900B:B10 


UC2-NormColon 


235 


470641 


1574.C03.gz43_208989 


F 


M00042904A:H10 


UC2-NormColon 


236 


451624 


1574.D04.gz43_209006 


F 


M00042906C.A10 


UC2-NormCoIon 


237 


639391 


1574.N04.gz43_209016 


F 


M00054796B:A01 


UC2-NormColon 


238 


470462 


1574.B05.gz43_209020 


F 


M00042902B:H01 


UC2-NormColon 


239 


482043 


1574.J05.gz43_209028 


F 


M00042915A:E06 


UC2-NormColon 


240 


641525 


1 5 74.N05 .gz43_209032 


F 


M00054796B:C08 


UC2-NormColon 


241 


642691 


1574.E06.gz43_209039 


F 


M00042908A:B01 


UC2-NormColon 


242 


466697 


1574.G06.gz43_209041 


F 


M00042910D:A02 


UC2-NormColon 


243 


649965 


1574.H06.gz43_209042 


F 


M00042912A:C01 


UC2-NormColon 


244 


651051 


1574.J06.gz43_209044 


F 


M00042915A:G10 


UC2-NormColon 


245 


450506 


1574.M06.gz43_209047 


F 


M00054794C:G11 


UC2-NormColon 


246 


453572 


1574.O06.gz43_209049 


F 


M00054798B:A01 


UC2-NormColon 


247 


447660 


1574.B07.gz43_209052 


F 


M00042902C:E11 


UC2-NormColon 


248 


447147 


1574.C07.gz43_209053 


F 


M00042905A:A07 


UC2-NormColon 


249 


647639 


1574.F07.gz43_209056 


F 


M00042909B:C04 


UC2-NormColon 


250 


649965 


1574.G08.gz43_209073 


F 


M00042910D:E11 


UC2-NormColon 


251 


639371 


1574.109.gz43_209091 


F 


M00042914A:B05 


UC2-NormCoIon 


252 


474298 


1 5 74M09.gz43_209095 


F 


M00054794D:D02 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


253 


646568 


1574.M10.gz43 209111 


F 


M00054794DD08 


T TP ? -Norm Pnl nn 


254 


639787 


1574.P10.gz43 209114 


F 


M00054800BC06 

\.y x.\j\j\j .j ~ o uv/ij . 


T TP^-TsInrmPrilnn 

u v^z."*!^ ui iiiv^uiuri 


255 


447750 


1574 All ez43 209115 


F 


M0004290 1 A *F 1 ? 


T Tr , 9-"WnrmPrt1nn 


256 


649810 


1574Gllez43 209121 


F 


M00047Q1 1 A* AO? 


T TP 9 -TsJrtrm C*rA nn 

*j v./^ ~iN oriii^oion 


257 


645924 


1574.Hll.gz43 209122 


F 


M00042912BF1 1 




258 


413767 


1574Lllez43 209126 


F 


M000^47Qir*Dl 1 


KJ \*s<£~TH UIIIIV^UIUII 


259 


641069 


1574 E12 ez43 209135 


F 


M00042908BA1 1 




260 


649900 


1574.G12.gz43 209137 


F 


M00042911AB02 


UC2-NormColon 


261 


560868 


1574.J12.gz43 209140 


F 


M00042915CE05 


I TC 2 -Norm Col on 


262 


513262 


1574.P12.gz43 209146 


F 


M00054800BC11 


UC2-NormColon 

\«y v>^. i i viniv>v/ivii 


263 - 


640306 


1574 L13 ez43 209158 


F 


M00054793DH1 1 


I JC2-NotmColon 


264 


389591 


1574.A14.gz43 209163 


F 


M00042901AH11 

i ▼ x \y v/ r *s \j li X « X X X X 


UC2-NormColon 

V-/ i. ^ V/A X&L^rifV^XxyXX 


265 


447815 


1574.C14.gz43 209165 


F 


M00042905CG08 


IJC2-NonnCo1on 


266 


95617 


1574.D14.gz43 209166 


F 


M00042907AB11 

x ™x v/ vy v/ i ^< -X vy # x x . x«/ x x 


UC2-NormColon 

\_y X l V/J. lllv VSXVyXX 


267 


465984 


1574.B15.gz43 209180 


F 


M00042903BC09 


I JC2-NormColon 

\y Vv£< X ^ KJX k X Ly^/\J IKJX X 


268 


447692 


1574.D15.gz43 209182 


F 


M00042907AF03 


I TP7-NormPo1on 


269 


641029 


1574.G15.gz43 209185 


F 


M0004291 1AD04 


UC2-NormColon 


270 


650527 


1574 J15 ez43 209188 


F 


M00042915DA11 


UC2-NormColon 

V_y V^^> 1 ^1 vyl 1 1 W Ivyi X 


271 


641315 


1574 015 ez43 209193 


F 


M00054798D- A 1 2 


T JC 2 -Norm C ol on 

\J\^f4Lr X \JX XkX\*S\SX\JXX 


272 


474298 


1574.P15.gz43 209194 


F 


M00054800BE08 


UC2-NormColon 


273 


645954 


1574 HI 6 ez43 209202 


F 


M00042912DH08 


I JC^-NormColon 

KJ\^£* \r\\J\ XXX\^/\JX\JXX 


274 


640181 


1574M16ez43 209207 


F 


M00054795AA08 

ItX vy vy vy ^y » / w*yv . xv Vy O 


I JC t 2-Normr , olon 

UV//' x^l \Jx xxl\^\Jx\JXl 


275 


650235 


1574 F17 ez43 209216 


F 


M00042909CF10 

i tx vy \y vy i vy ^* ■ x iu 


I JC2 -Norm Col on 

vy v^^> I ^ Ul 1 1 iv^VylVv'Xl 


276 


640356 


1574 H17 ez43 209218 


F 


M00042912DH10 


I JC2-NormColon 

\J V>^» X ^ VJ1 111V - VJlWll 


277 


649744 


1574 El 8 ez43 209231 

x ~s i r . x_y x ./jf ' ■ — ' <£. vy .y ^« — ' x 


F 


M00042908CA03 


UC2-NormColon 


278 


650564 


1574 F18 ez43 209232 


F 


M00042909DB1 1 


I JC2-NormColon 


279 


527355 


1574.G18.gz43 209233 


F 


M00042911AH12 

iTlV/WTfcy X X ^X i X X X 


UC2-NormColon 


280 


452989 


1574 HI 8 ez43 209234 

x r r » x x x w . x < r ^/ x^ v/ ^/ r 


F 


M000429 1 3 A- D09 


T TC7-N^ormColon 

KJ JL, X ^( KJX IXXX^KJXKJXX 


281 


648159 


1574K18ez43 209237 


F 


M00054792BA03 

iTivw vy »y r / *s 4* x«# . l vvy »7 


UC2-NormColon 

KJ\ m /£^ X ^1 Vyl 1 1 Iv^ VyX\»yl 1 


282 


714629 


1574 E20 ez43 209263 


F 


M00042908CD12 

iTiuuv/TAi -y vy vy . x/ x ^ 


UC2-NormColon 

KJ V/^ X t| Vyl lll^yVyAVyXl 


283 


467364 


1574.I20.gz43 209267 


F 


M00042914BH03 


I JC2 -Norm Col on 


284 


644789 


1574.N20.gz43 209272 


F 


M00054797C:F03 


UC2-NormColon 

\_/ X_/Xrf X ^VllllV^vAvll 


285 


646404 


1 574. J2 1 .gz43_209284 


F 


M00054791AG04 


UC2-NormColon 


286 


639740 


1574.H22.gz43_209298 


F 


M00042913B:E10 


UC2-NormColon 


287 


649852 


1 5 74 .122 .gz43_209299 


F 


M00042914D:B10 


UC2-NormColon 


288 


644376 


1574.G23.gz43_209313 


F 


M00042911B:H08 


UC2-NormColon 


289 


639901 


1574.P23.gz43_209322 


F 


M00054800D:D08 


UC2-NormColon 


290 


647086 


1574.E24.gz43_209327 


F 


M00042908D:G12 


UC2-NormColon 


291 


562247 


1574.G24.gz43_209329 


F 


M00042911C:D01 


UC2-NormColon 


292 


446974 


1574.J24.gz43_209332 


F 


M00054791B:C09 


UC2-NormCoIon 


293 


513248 


1575.A01.gz43_209339 


F 


M00054800D:F08 


UC2-NormColon 


294 


467989 


1575.M01.gz43_209351 


F 


M00055429B:B12 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


295 


650184 


1575.N01.gz43_209352 


F 


M00055430D:F04 


UC2-NormColon 


296 


649872 


1575.E02.gz43_209359 


F 


M00054916A:A05 


UC2-NormColon 


297 


649506 


1575G02.gz43_209361 


F 


M00054918A:D02 


UC2-NormColon 


298 


446254 


1 575 .M02.gz43_209367 


F 


M00055429B:E12 


UC2-NormColon 


299 


641542 


1575.O02.gz43_209369 


F 


M00055432B:H02 


UC2-NormColon 


300 


649933 


1575. 103 .gz43_209379 


F 


M00054920B:C04 


UC2-NormColon 


301 


639444 


1575.M03.gz43_209383 


F 


M00055429B:G04 


UC2-NormColon 


302 


553100 


1575.G04.gz43_209393 


F 


M00054918A:F09 


UC2-NormColon 


303 


549699 


1575.A05.gz43_209403 


F 


M00054911A:C08 


UC2-NormColon 


304 


639256 


1575.E05.gz43_209407 


F 


M00054916A:E05 


UC2-NormColon 


305 . 


644556 


1575.H05.gz43_209410 


F 


M00054919C:F06 


UC2-NormColon 


306 


500337 


1575.105.gz43_209411 


F 


M00054920B:C12 


UC2-NormColon 


307 


643924 


1575.K05.gz43_209413 


F 


M00055427A:F01 


UC2-NormColon 


308 


639394 


1575.E06.gz43_209423 


F 


M00054916A:F10 


UC2-NormColon 


309 


452986 


1575J06.gz43_209428 


F 


M00055426B:B02 


UC2-NormColon 


310 


654723 


1575.K06.gz43_209429 


F 


M00055427A:F02 


UC2-NormColon 


311 


645344 


1 575 .N07.gz43_209448 


F 


M00055431A:H05 


UC2-NormColon 


312 


645162 


1575.O07.gz43_209449 


F 


M00055432C:F01 


UC2-NormColon 


313 


589098 


1575.B08.gz43_209452 


F 


M00054912B:C05 


UC2-NormColon 


314 


643843 


1575.F08.gz43_209456 


F 


M00054917C:D03 


UC2-NormColon 


315 


647109 


1575P08.gz43_209466 


F 


M00055434A:A03 


UC2-NormColon 


316 


467381 


1575.D09.gz43_209470 


F 


M00054914D:G07 


UC2-NormColon 


317 


570812 


1575.N09.gz43_209480 


F 


M00055431B:A01 


UC2-NormColon 


318 


559776 


1575.A10.gz43_209483 


F 


M00054911A:G01 


UC2-NormColon 


319 


648532 


1575.F10.gz43_209488 


F 


M00054917C:F03 


UC2-NormColon 


320 


449861 


1575.G10.gz43_209489 


F 


•M00054918B:H01 


UC2-NormColon 


321 


643843 


1575.110.gz43_209491 


F 


M00054920C:A06 


UC2-NormColon 


322 


41141 


1575.K10.gz43_209493 


F 


M00055427B:E01 


UC2-NormColon 


323 


648664 


1575.Ell.gz43_209503 


F 


M00054916B:E02 


UC2-NormColon 


324 


640814 


1575.Kll.gz43_209509 


F 


M00055427B:F06 


UC2-NormColon 


325 


584071 


1575.Mll.gz43_209511 


F 


M00055429D:G07 


UC2-NormColon 


326 


450225 


1575.C12.gz43_209517 


F 


M00054913C:G03 


UC2-NormColon 


327 


452707 


1575.112.gz43_209523 


F 


M00054920C:D05 


UC2-NormColon 


328 


243722 


1575.L12.gz43_209526 


F 


M00055428C:G06 


UC2-NormColon 


329 


641057 


1575.B13.gz43_209532 


F 


M00054912C:C01 


UC2-NormColon 


330 


413767 


1575.D13.gz43_209534 


F 


M00054915A:G03 


UC2-NormColon 


331 


504568 


1575K13.gz43_209541 


F 


M00055427C:A06 


UC2-NormColon 


332 


639992 


1575.A15.gz43_209563 


F 


M0005491 1B-.E10 


UC2-NormColon 


333 


456923 


1575.G15.gz43_209569 


F 


M00054918D:C03 


UC2-NormColon 


334 


649555 


1575.115.gz43_209571 


F 


M00054920C:F02 


UC2-NormColon 


335 


649746 


1575.J15.gz43_209572 


F 


M00055426C:C10 


UC2-NormColon 


336 


644692 


1575.F16.gz43_209584 


F 


M00054917D:A12 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


1 CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


337 


452204 


1575.G16.gz43 209585 


F 


M00054918TVn 1 


UL^z-JNonnColon 


338 


61616 


1575.H16.gz43 209586 


F 


M 00054920 A * A 0 S 




339 


495143 


1575.M16.gz43 209591 


F 


M00055430RF08 


u z -in ormi^oion 


340 


644371 


1575.016.gz43 209593 


F 


M00055433AR07 


uv^z -in ormi_,oion 


341 


630269 


1575.K17.gz43 209605 


F 


M00055427PF 1 ? 


U Z -IN Ollll^OIOn 


342 


446932 


1575.0 17.gz43 209609 


F 


M00055433AR08 


u z -in onri^oion 


343 


639662 


1575.F18.gz43 209616 


F 


M000549 1 7D-D 1 7 


u\^z-iNoriTii^Oion 


344 


611927 


1575.L18.gz43 209622 


F 


M00055428DG12 


T ir t 7-T\TnrmPn1nn 


345 


641925 


1575.018.gz43 209625 


F 


M00055433AC02 


f TP 7 -IVTnrm Prtlnri 

u v^z - x>< uniiv^oion 


346 


651020 


1575.P18.gz43 209626 


F 


M00055434PR 1 1 


U V^Z -IN OillllsOlOn 


347. 


639255 


1575.F19.gz43 209632 


F 


M000549 1 7D-F0S 


T TP 7 - TSJrtrm Prtlrtn 
\J v^Z -IN OITTll^OIOn 


348 


468222 


1575.G19.gz43 209633 


F 


M000549 1 8DH09 

iTi\/v/\/^'~y X. uly .J. X Vy y 


T TP 7 JSJnrm P^li^n 


349 


468783 


1575.K19.gz43 209637 


F 


M00055427PH1 1 


T TP7-XTr*rmPrt1rt*-t 

ulz -in oirni^oion 


350 


417130 


1575.B20.gz43 209644 


F 


M000549 17DG01 


T TP 7 JWnrm P nl r\r\ 

u \-^z -in orniv^ oion 


351 


448606 


1575.B21.gz43 209660 


F 


M000549 1 7DO04 


T TP7_T\Ir»rmPr»1^n 


352 


456420 


1575.K21.gz43 209669 


F 


M00055427DF05 


T TP7-7SJnrmPrJn« 


353 


640369 


1575.P21.gz43 209674 


F 


M00055434DR06 


T TP7-MnrmPo1r*n 


354 


473854 


1575.E22.gz43 209679 


F 


M00054917AF07 


T TP7-MnrmPn1n« 


355 


564440 


1575.H22.gz43 209682 


F 


M00054920AP 1 1 


T TP7-7>Jr»rmPr*1r*ri 


356 


639520 


1575.L22.gz43 209686 


F 


M00055429AHO4 


T TP7-T s JriTTnPr*1nn 


357 


649195 


1575.022.gz43 209689 


F 


M00055433AF04 


T TP7_T\JormP^lr»« 

u v^z -in uiinv^oion 


358 


57183 


1575P22.gz43_209690 


F 


M00055434DE09 


T TP7-TMr>rmPri1nn 


359 


648609 


1575.E23.gz43 209695 


F 


M00054917BA05 


T TP7-TVTnrmPri1riri 


360 


648532 


1575.F23.gz43 209696 


F 


M00054917DH02 


T TP7-T\TnrmPo1r»n 
u \_/Z -in ui x e i v^uiun 


361 


642073 


1575.N23.gz43 209704 


F 


M00055432BB04 


T TP7-r\TortnPr\lnn 


362 


377855 


1575.B24.gz43 209708 


F 


M00054913AB12 


T TP7->Jr*rmPo1n« 


363 


452845 


1575.G24.gz43 209713 


F 


M000549 1 9AH04 


T TP7-MnrmPn1nn 
vJ v>Z IN UI IllV^UIUIl 


364 


638758 


1 575 . 024.gz43_20972 1 


F 


M00055433CA11 


T TP7-TSJnrmPn1nn 


365 


639829 


1576.A04.gz43 209771 


F 


M00055435B:C09 


T TP7-TvJnrmPn1rin 

w V^Z**lN UI HIv_^UIUIl 


366 


484145 


1576I05.gz43_209795 


F 


M0005 5 446B • A 1 2 


T IP7-r\TnrmPr*lnn 

\-/ V-'Z. 1NUI IIlv_^UIUIl 


367 


640055 


1576.A08.gz43_209835 


F 


M00055435CE12 


T TP7-T\TnrmPnlrin 
UV/^, 1> UI IIlv^UIUll 


368 


639928 


1576I08.gz43_209843 


F 


M00055446B:D08 


T JP 2 -Norm Pnlnn 

KJKs£* INUllllV^UIUII 


369 


526606 


1576.L08.gz43_209846 


F 


M00055450B:G07 


T JP 7 -Nnrm Pnl nn 


370 


653616 


1576.E09.gz43 209855 


F 


M00055440D:D02 


T TP 2-Norm Pnl on 


371 


624440 


1576.M12.gz43_209911 


F 


M00055451C:B08 


lJP2-NormPn1nn 


372 


560791 


1576.O12.gz43_209913 


F 


M00055453D:E12 


UC2-NormColon 


373 


639287 


1576.A13.gz43_209915 


F 


M00055435D:G11 


UC2-NormColon 


374 


475184 


1576.M14.gz43_209943 


F 


M00055451CE10 


UC2-NormColon 


375 


626061 


1576.A15.gz43 209947 


F 


M00055436A:E04 


UC2-NormColon 


376 


640734 


1576.E16.gz43_209967 


F 


M00055441A:G07 


UC2-NormColon 


377 


466092 


1576.B17.gz43_209980 


F 


M00055437D:B06 


UC2-NormColon 


378 


639711 


1576.A19.gz43_2 10011 


F 


M00055436B:B06 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


379 


640522 


1576.C21.gz43 210045 


F 


MOOOS S43QR-Rn7 


u L/Z-iN ormColon 


380 


651038 


1585.I01.gz43 210211 


F 


Moons S4fis a -en* 


U\^z-JNorrnColon 


381 


466440 


1585.B02.gz43 210220 


F 


M000'55456r'- A 1 n 


TIP'? XTrx*wi/^^t«« 


382 


648123 


1585.C02.gz43 210221 


F 


M0005545RR / Rft4 


u i^z -in ormcoion 


383 


646318 


1585.K02.gz43 210229 


F 


M00055468A- Aft5 


TTPl Mfirmr/xIrtM 

u v_.z -in ornw^oion 


384 


598589 


1585.E03.gz43 210239 


F 


M00055460F**On6 


ui^z -in orm oion 


385 


447863 


1585.F03.gz43 210240 


F 


M0005546 1 AFT03 


TTP9 Mnrmrnlnn 


386 


562236 


1585.P05.gz43 210282 


F 


M00055494C G 1 0 


u v_/Z -in uiiiiv^uiuri 


387 


559369 


1585.A06.gz43 210283 


F 


M00055456A-B03 


T Tr*7-T\TnrmPn1on 


388 


484126 


1585.B06.gz43 210284 


F 


M000554 5 6P *G04 


T TP7-T\rrirmPr\1r»n 

uv^z-iNoiiiiv^oion 


389 _ 


558839 


1585.P06.gz43 210298 


F 


M00055494PG1 1 


T TP7-T\TnrTnPr*1nn 

u \_/Z-in oi iiiv^oion 


390 


555820 


1585.J07.gz43 210308 


F 


M00055467AA07 


vJ l^Z - IN UlXIlWOlOIl 


391 


544461 


1585.H09.gz43_210338 


F 


M00055463DGO 1 

±w a. \j \j vy vy _y . \j \j x 


T TP 7 - >Jr»rm Pnl nn 

vj v_^z~in uiiiiv>oion 


392 


642478 


1585.A10.gz43 210347 


F 


M00055456AF01 

l » J. v/ vy \y .J ^ \Ji\^. X Vy L 


T TP7-T\TrirT«r i nlr\n 
U^Z-lNOlIIll^OlOn 


393 


643843 


1585.Cll.gz43 210365 


F 


M00055458P*ofl7 


u i^z-in onrii^oion. 


394 


640400 


1585.G12.gz43 210385 


F 


M00055467PA 1 1 


u v^z-in onrii^oion 


395 


645201 


1 5 85 . L 1 2.gz43_2 1 03 90 


F 


M0005 5 469P • FOQ 


u v^z-iNonii^/Oion. 


396 


640792 


1585.F13.gz43 210400 


F 


M0005 546 1 PF05 


u V-/Z -in ouuv^oion 


397 


640913 


1585.H13.gz43 210402 


F 


M00055464RF06 

i. V A \y Vy V/ »y VI *T XJ » Vy VI 


T TP7-T\Tr*rmPrtlnn 


398 


639607 


1585.I13.gz43 210403 


F 


M00055465n-F 1 7 

IVAV/V/V/^/ JTUJL/.r 1 ^> 


T TP9-r\TnrrriPr*lrkTi 

u v^z-in oiiiiv>oion 


399 


639932 


1585.B14.gz43 210412 


F 


M0005 54560004 

ivivuv/J J'tJUL/.\Jv/ 4 t 


u v^z -in oriTi^oion 


400 


640662 


1585.D14.gz43 210414 


F 


M00055460AD01 


T TP7-NTnrmPr*1rin 


401 


598589 


1585.G14.gz43_210417 


F 


M00055462PP03 


T TP7-ATnrmPnin« 

u v^z -in uimv^oion 


402 


556654 


1585.H14.gz43 210418 


F 


M00055464RF 1 1 

i ▼ x\y vy vy -J ^y vy^.i_/ . i - J. x 


T TP7-Mr*rmPrJrtn 


403 


555193 


1585.014.gz43 210425 


F 


M00055493DR07 

i »i. vy vy vy ^y _y ~ -7 «y l^/ . .L/Vy / 


T TP7-T\TnrmPrtlr*n 

u v^z -in uiiiiv^oiori 


404 


641066 


1585.E15.gz43 210431 


F 


M0005 5460DFlfl6 

ivxvy vy vy *j xj . uw Vy 


TTP9 XTrtTTTiPrtlrtri 

u v^z-in orrni^oion 


405 


664711 


1585.H15.gz43_210434 


F 


M00055464BG03 


T TP7-r\TnrmPn1nn 


406 


549611 


1585.I15.gz43 210435 


F 


M00055466A'C05 


T TP7-T\TnrmPnlnn 
Uv/. li vJ* X 1 vJlUXl 


407 


639726 


1585.K16.gz43_2 10453 


F 


M00055468CB07 


T TP7-T\Tr*rmPntr»ri 
Uv>Z"l> VJI IIIV_>U10I1 


408 


607422 


1585.F17.gz43 210464 


F 


M0005 546 1 D P09 


T TP 7 - Worm Pnl r»n 
vJ v^Z-iN UI Illv^vylUIl 


409 


584745 


1585.M17.gz43 210471 


F 


M0005549 1AH0 1 

■t»AV/vy vy^y »/"T ^ J. vrA. .X X\J x 


T TP7-TvJnrmPrt1nti 
UvvZ li UI IIIV_/U1UI1 


410 


661194 


1585.H19.gz43 210498 


F 


M00055464DA04 


T TP7-Nnrm Pnlnn 

VJ V_/Z, IN UI III v^UlUIl 


411 


559549 


1585.J20.gz43_210516 


F 


M00055467D:A01 


T TP 7 -Norm Pol nn 


412 


641645 


1585.A22.gz43 210539 


F 


M00055456B:G08 


T TP7-NnrmPr»1nn 


413 


641467 


1585.C22.gz43 210541 


F 


M00055459B:A02 


T TP7-T\InrmPnlnn 


414 


640368 


1585.F22.gz43 210544 


F 


M00055462A:A09 


UC2-NormColon 


415 


412416 


1585.G22.gz43_210545 


F 


M00055462D:H12 


UC2-NonnColon 


416 


650914 


1585.H22.gz43_210546 


F 


M00055464D:F08 


UC2-NormColon 


417 


3 


1585.J22.gz43 210548 


F 


M00055467D:C10 


UC2-NormCoIon 


418 


646318 


1585.N22.gz43_210552 


F 


M00055492C:H07 


UC2-NormColon 


419 


237288 


1585.P22.gz43_210554 


F 


M00055495CF03 


UC2-NormColon 


420 


650605 


1585.B23.gz43 210556 


F 


M00055457D:F09 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


» CLUSTER 


SEQ NAME 


ORIEN 
T 


f 

CLONE ID 


T.TRR ARY 


421 


643991 


1585.E23.gz43 210559 


F 


1V1UUU J JHO 1 A. rtUO 


U Cz -N ormCoion 


422 


398061 


1585.G23.gz43 210561 


F 


Monns^dft^ a- a 1 1 


T TOO \T«— -r 1 -!*— 

UCz-NormCoIon 


423 


505933 


1585.J24.gz43 210580 


F 


1VX\J\J\J J JtU / Lt.xjyJo 


TTOO IS T ^ .-,..n. L i 

U Cz -N ormCoIon 


424 


640068 


1585.K24.gz43 210581 


F 


MflflO^ <,d&Q A ■Tif\9 

LVL\J\J\JJ UVJo 


U Cz -N ormCoIon 


425 


474821 


1585.N24.gz43 210584 


F 


1'lUuU J *J*T s £.U . DUO 


TTOO M/Mwr 1 .,!*- 

u *^z -IN ormCoIon 


426 


234606 


1587.C01.gz43 211361 


F 




T TOO XT rt «— »%0«1 — 

ucz-jn ormCoIon 


427 


597780 


1587.E01.gz43 211363 


F 




ulz -in onuL, olon 


428 


492483 


1587.B02.gz43 211376 


F 


M00OS55 1 7CFT07 


u ^ z -in orm^ oion 


429 


644051 


1587.C03.gz43 211393 


F 


M000555 1 9 A HOI 


Ul^Z-lNOlTnl^OlOn 


430 


639056 


1587.E03.gz43 211395 


F 


M00055521CC08 


u^z-inoiitii^oioii 


431 - 


643723 


1587.L03.gz43 211402 


F 




uv^z-iNorni^oion 


432 


639459 


1587.N03.gz43 211404 


F 


M0005 5 ^ 3 9 C CwftR 


UvZ -IN orini^ oion 


433 


639826 


1587.P03.gz43 211406 


F 


M00055535BA1 1 


uv/Z -in oiiiii^oion 


434 


552783 


1587.A04.gz43 211407 


F 


M00055516BF1 1 


u V-/Z-1N orrni^oion. 


435 


590218 


1587.D04.gz43 211410 


F 


±vX\J\J\J*s *J ^J£.\JIJ ,1J X X 


u v_^z -in orrn^oion 


436 


559324 


1587.E04.gz43 211411 


F 


M0005552 1 rno? 


T TO 7 -IsJr^rm O r*1 r\rt 

uv^z-in oiiiii^oion 


437 


646711 


1587.L04.gz43 211418 


F 


M00055529DD1 1 


T TP7_XTnrr«0^i1r»ri 

uv^z-iNoiiiiv^oiori 


438 


639132 


1587.M04.gz43 211419 


F 


M00O55531BD1O 


u v^z -in uiiiiv^oiori 


439 


207552 


1587.N04.gz43 211420 


F 


M00055532DA12 




440 


116869 


1587.O04.gz43 211421 


F 


M00055531DO02 


vj\^z-iNoiini^oion 


441 


481220 


1587.D05.gz43 211426 


F 


M0005SS90R-Fn4 


T TOO XTrtrw*Ort1rt« 

iJv^z-iNorni^Oion 


442 


645544 


1587.K05.gz43 211433 


F 




T TOO XfArfYifrtl/Nn 

u i^z-in onni^oion 


443 


650617 


1587.M05.gz43 211435 


F 


M0005553 1 RFOS 


T TOO MnimrrtlATi 
U VxZ -IN OITTll^OlOn 


444 


194095 


15 87.005 .gz43 211437 


F 


M00055534AF06 


kj z -in orrn^oion 


445 


446984 


1587.B06.gz43 211440 


F 


M00055S 1 7D D00 

iTlvvv J J J 1 / X_/.X-/vv~ 


T TO 0 -"Wnrm Cr^\r\n 

vji^z -in orrn v^oion 


446 


290226 


1587.B07.gz43 211456 


F 




T TO 9 .Mrtrm Or\1 

ulz -in oirn^ oion 


447 


634409 


1587.P07.gz43 211470 


F 




T TOO Mr»rmr^1rv« 

u -in ormi^oion 


448 


640419 


1587.G08.gz43 211477 


F 


M00055524BB08 


u wz-in oiiiiv^oion 


449 


649149 


1587.L09.gz43 211498 


F 




T TOO "Mrtn-MOi-*1rt*» 

u ^z-in ormL^oiOxi 


450 


640072 


1587.P10.gz43_211518 


F 


iTlV/V/Uv'J J J JVv.Xiv/O 


T TOO XT/M-mPrtlnri 

uv^z-iNorrn^oion 


451 


641615 


1587.Jll.gz43 211528 


F 


1 VX V/ V/ *J *s O .t\ . V7 O 


T TO 0 -MArm 0/-n 
U v^Z -IN OiTn^OlOil 


452 


639213 


1587.M13.gz43 211563 


F 


1X40005553 1 DF06 


TTOO MnrmO/^l/M^ 


453 


641216 


1587.N15.gz43 211596 


F 


M00055533B-B1 1 


vj v^z-in oirni^oioii 


454 


650161 


1587.A16.gz43 211599 


F 


M00055517AD09 


T TP 9 -Nnrrn O rJ r*n 
U V/Z. n UIIIlV^OIOIl 


455 


558785 


1587.B23.gz43 211712 


F 


M00055519AC01 


T l"P9-T\TnrmPr»1nn 


456 


642631 


1588.B02.gz43 211780 


F 


M00055538C:E04 


UC2-NormColon 


457 


639543 


1588.F02.gz43 211784 


F 


M00055543D:H03 


UC2-NormColon 


458 


473701 


1588.F03.gz43_211800 


F 


M00055544A:A07 


UC2-NormColon 


459 


562274 


1588.G03.gz43_211801 


F 


M00055545B:A04 


UC2-NormColon 


460 


643277 


1588.D05.gz43 211830 


F 


M00055541C:D02 


UC2-NormColon 


461 


556867 


1588.G05.gz43 211833 


F 


M00055545B:C01 


UC2-NormColon 


462 


646350 


1588.P05.gz43 211842 


F 


M00055556A:A04 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


463 


649335 


1588.B06.gz43 211844 


F 


M00055538D:D12 


Ur2-Normrn1nn 


464 


646159 


1588.J06.gz43 211852 


F 


M00055548B:C03 




465 


649099 


1588.P06.gz43 211858 


F 


M00055556AE07 

l»xv/ vy vy «y *y -*y *y vyx X • 1 , y \J / 


i ivji ii i vjiwxi 


466 


32812 


1588.H09.gz43 211898 


F 


M00055546C:F11 


UC2-NormColon 

KJ^S^r 1 lV/llllV^VJlV/11 


467 


639480 


1588.J09.gz43 211900 


F 


M00055548B:G06 


UC2-NormCoIon 


468 


644242 


1588.N09.gz43 211904 


F 


M00055553B:H04 


UC2-NormColon 

VV/w X 1 V/l lllvvv/lv/ll 


469 


644723 


1588.Nll.gz43 211936 


F 


M00055553C:D06 


UC2-NormCoIon 

V Vy** X IvA 4il\y vlvll 


470 


648748 


1588.F13.gz43 211960 


F 


M00055544C:A01 


UC2-NormColon 


471 


639099 


1588.G13.gz43 211961 


F 


M00055545C:D09 


UC2-NormColon 


472 


562414 


1588 F15 ez43 211992 

X U VJ • X J. » fjf * 1 mmJ ** X X _X y 


F 


M00055544CB07 

J. tx vy vy vy »y ^y I r w • x-#vr / 


UC2-NormColon 

V_/ X ^ vyx llivyv/iV/ii 


473 - 


641078 


1588N15ez43 212000 


F 


M00055553CH12 


UC2-NormColon 


474 


505042 


1588.G16.gz43 212009 


F 


M00055545D:B06 


UC2-NormColon 


475 


640747 


1588 N16 ez43 212016 

x —J w kj ii 1 i vy a lb**** x *# v/ x v/ 


F 


M00055553D:C07 


UC2-NormColon 


476 


453606 


1588 D17ez43 212022 


F 


M00055542BB11 

i?i v v/\7*y iy iy i bXy »x/ i x 


UC2-NormColon 

vy vy** X l wi lit v/Viv/ii 


477 


419706 


1588 A18ez43 212035 


F 


M00055538AC05 

A tx vy vy vy ^/ *^y *y vy z x * vy^y 


UC2-NormColon 

Vy vy** 1 1 vl l*i\y V-iv/xi 


478 


557983 


1588.H21.gz43 212090 


F 


M00055547B:C05 


UC2-NormColon 


479 


639114 


1588.D22.gz43 212102 


F 


M00055542C:D07 


UC2-NormColon 


480 


44253 1 


1588 J22 ez43 212108 

X »V VJ VJ * •# jw^rf ' ™* X- ** X V Vy 


F 


M00055549A:B04 


UC2-NormColon 


481 


649668 

vy r y w vj 


1597 N02 gz43 212176 

x »y y # * x i v ** * f7^f,t t ** x ** x * vy 


F 


M00055574D:E02 


UC2-NormColon 


482 


639781 


1597.D05.gz43 212214 


F 


M00055561C:C07 


UC2-NormColon 


483 


639056 


1597.E05.gz43 212215 


F 


M00055562D:B06 


UC2-NormColon 


484 


639698 


1597.P05.gz43 212226 


F 


M00055577C:G02 


UC2 -NormColon 


485 


46973 1 


1597.B06.gz43 212228 


F 


M00055559A:D06 


UC2-NormColon 


486 


639593 


1597.G06.gz43 212233 


F 


M00055566A:E02 


UC2-NormColon 


487 


557975 


1597J06ez43 212236 

x »_y y # » *f vy vy . >S* * * *^ ™* ^ 


F 


M00055569D:C02 


UC2-NormColon 


488 


640356 


1597 E08 sz43 212263 

x y * * x-/ v/ vy 4 f_i* ** ^* x ^taT vy «y 


F 


M00055563AA02 


UC2-NormColon 


489 


64743 1 


1597.E09.gz43 212279 


F 


M00055563A:C03 


UC2-NormColon 


490 


31112 


1597 CIO az43 212293 


F 


M00055560DC03 

j. ▼ x vy vy vy *y ^y «*y v/v/xy • v/ vy «y 


UC2-NormColon 


491 


454825 


1597F10ez43 212296 


F 


M00055565AC08 


UC2-NormColon 

Vy vy** 1 l viiiivv vivii 


492 


555021 

^y my »y vy a* x 


1597B12ez43 212324 

x *_y y / ♦ xy x ** * i| ■ ■ ^ »y x a* *y a* r 


F 


M00055559CD07 

iTlv w*/*/»/ */ y vy iXy v / 


UC2-NormColon 


493 


650740 

Vy \J I I Vy 


1597 112 ez43 212331 


F 


M00055569A:D01 


UC2 -NormColon 


494 


659031 


1597.J12.gz43 212332 


F 


M00055570B:F07 


UC2-NormColon 


495 


592122 


1597D14ez43 212358 

X *y y^ / * xy x r • r> € * i »/ ** x ** *y vj 


F 


M00055562BD03 

x tx vy vy vy *_y «-y vy x^y . x-y vy **j 


UC2-NormColon 


496 


45921 


1597 N14 ez43 212368 

x y » • x t x r « fji < i ^y ** x a# *y vy v# 


F 


M00055575CB04 

i tx vy vy vy w» ^ * w • x^ vy r 


UC2-NormColon 


497 


556511 


1597.H15.gz43 212378 


F 


M00055568A:B04 


UC2-NormColon 


498 


637966 


1597.A16.gz43_212387 


F 


M00055558B:G11 


UC2-NormColon 


499 


553318 


1597.F16.gz43_212392 


F 


M00055565A:F11 


UC2-NormColon 


500 


23961 


1597.C17.gz43_212405 


F 


M00055561A:F07 


UC2-NormColon 


501 


650235 


1597.M17.gz43_212415 


F 


M00055574B:F06 


UC2-NormColon 


502 


596882 


1597.F18.gz43_212424 


F 


M00055565B-.F04 


UC2-NormColon 


503 


644928 


1597.118.gz43_212427 


F 


M00055569A:F06 


UC2-NormColon 


504 


589483 


1597.L18.gz43_212430 


F 


M00055573B:D11 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


[ 

CLONE ID 


LIBRARY 


505 


640956 


1597.N18.gz43 212432 


F 


M00055575D:G04 


UC2 -NormColon 


506 


553602 


1597.L19.gz43 212446 


F 


M00055573B:E01 


UC2-NormColon 


507 


640851 


1597.O20.gz43 212465 


F 


M00055577A:G09 


UC2-NormCoIon 


508 


460666 


1597.A21.gz43 212467 


F 


M00055558D.C08 


UC2-NormColon 


509 


640195 


1597.D22.gz43 212486 


F 


M00055562C:F02 


UC2-NormColon 


510 


668852 


1597.P22.gz43 212498 


F 


M00055578C:F11 


UC2-NormColon 


511 


645913 


1597.A23.gz43 212499 


F 


M00055558D:D07 


UC2-NormColon 


512 


446230 


1597.G23.gz43 212505 


F 


M00055567A:A06 


UC2-NormColon 


513 


649349 


1597.N23.gz43 212512 


F 


M00055576A.F07 


UC2-NormColon 


514 


645538 


1597.C24.gz43 212517 


F 


M00055561B:G12 


UC2-NormColon 


515 


647069 


1597.G24.gz43 212521 


F 


M00055567A:A11 


UC2 -NormColon 


516 


638869 


1597.H24.gz43 212522 


F 


M00055568C:F07 


UC2 -NormColon 


517 


556 


1597.I24.gz43 212523 


F 


M00055569B:G10 


UC2-NormColon 


518 


648483 


1597.024.gz43_2 12529 


F 


M00055577B:F05 


UC2-NormColon 


519 


447272 


1598.M01.gz43_212543 


F 


M00055595C:F12 


UC2-NormColon 


520 


640025 


1598.B02.gz43 212548 


F 


M00055581A:C02 


UC2-NormColon 


521 


415538 


1598.E02.gz43 212551 


F 


M00055585B:F01 


UC2-NormColon 


522 


478229 


1598.M02.gz43 212559 


F 


M00055595C:G03 


UC2-NormColon 


523 


644572 


1598.H03.gz43 212570 


F 


M00055589B:E08 


UC2 -NormColon 


524 


648580 


1598.B04.gz43 212580 


F 


M00055581C:A01 


UC2-NormColon 


525 


470769 


1598.L04.gz43 212590 


F 


M00055594C:B03 


UC 2 -NormColon 


526 


554273 


1598.M04.gz43 212591 


F 


M00055595D:C04 


UC2-NormColon 


527 


641262 


1598.C05.gz43 212597 


F 


M00055583A:A05 


UC2-NormColon 


528 


645707 


1598.F05.gz43 212600 


F 


M00055586D.G07 


UC2-NormColon 


529 


463028 


1598.N05.gz43_212608 


F 


M00055597B:B04 


UC2-NormColon 


530 


640695 


1598.P05.gz43 212610 


F 


M00055601B:D12 


UC2-NormColon 


531 


641925 


1598.D06.gz43 212614 


F 


M00055584A:G11 


UC2-NormColon 


532 


449247 


1598.E06.gz43 212615 


F 


M00055585C:F05 


UC2-NormColon 


533 


646590 


1598.L06.gz43 212622 


F 


M00055594C:F11 


UC2-NormColon 


534 


641191 


1598.A07.gz43 212627 


F 


M00055579C.D04 


UC2-NormColon 


535 


641191 


1598.P07.gz43 212642 


F 


M00055601B:H02 


UC2-NormColon 


536 


641838 


1598.K09.gz43 212669 


F 


M00055593A:F08 


UC2-NormColon 


537 


641440 


1598.N09.gz43 212672 


F 


M00055597D:B05 


UC2-NormColon 


538 


640826 


1598.M10.gz43 212687 


F 


M00055596A:E07 


UC2-NormColon 


539 


459521 


1598.All.gz43 212691 


F 


M00055579D:C11 


UC2-NormColon 


540 


140909 


1598.Gll.gz43 212697 


F 


M00055588B:H11 


UC2-NormColon 


541 


641467 


1598.J12.gz43 212716 


F 


M00055592B:C10 


UC2-NormColon 


542 


455864 


1598.L12.gz43_212718 


F 


M00055594D:F07 


UC2-NormColon 


543 


641957 


1598.A13.gz43 212723 


F 


M00055579D:G09 


UC2-NormCoIon 


544 


638941 


1598.K14.gz43 212749 


F 


M00O55593C:D08 


UC2-NormColon 


545 


648159 


1598.B15.gz43 212756 


F 


M00055582B:A06 


UC2-NormColon 


546 


509973 


1598.G15.gz43 212761 


F 


M00055588C:G09 


UC2-NormColon 



85 




Table 2 



SEQ 
ID NO 


1 CLUSTER 


SEQ NAME 


ORIEN 
T 


[ 

CLONE ID 


LIBRARY 


547 


641425 


1598.A17.gz43 212787 


F 


M000555ROR-Rf)8 


ui^z-i\iorniL.oion 


548 


647312 


1598.017.gz43 212801 


F 


M0005S600PPO? 


T TP'") MArmr/\U« 


549 


86311 


1598.C18.gz43 212805 


F 


M00055583CA01 


TTP9 XTrir*nPrtlrt« 


550 


515931 


1598.019.gz43 212833 


F 


M00055600D-RO? 


-in ornii^oion 


551 


651000 


1598.D21.gz43 212854 


F 


M00055585AF1? 


T TP 9 -Mnrm Pol on 


552 


553087 


1598.E21.gz43 212855 


F 


M00055586CA06 


T IP 9 -Mnrmrnl on 


553 


646914 


1598.G21.gz43 212857 


F 


M00055589AB06 


T IP?-NormPnlon 

UV/i. liUl 111V_/U1UI1 


554 


446371 


1598.K21.gz43 212861 


F 


M00055594B:A01 


UC2-NorrnPoIon 


555 


640298 


1598.N23.gz43 212896 


F 


M00055599DC08 


T TP?-NorrnPolnn 

WV/^. 1 ^UllllX^UIUIl 


556 


201904 


1599.A01.gz43 212923 


F 


M00055602BB 1 0 


T TP? -Norm Pol on 
kj v^z,~rN ui iiiv^uiun 


557 - 


641338 


1599.B01.gz43 212924 


F 


M00055603D:A09 


T JP 2 -Norm Pol on 

KJ X if UI 1 11 V^-UIUII 


558 


640221 


1599L01.gz43_2 12934 


F 


M00055619C:F07 


11C 2 -Norm Pol on 


559 


454527 


1599.F02.gz43 212944 


F 


M00055610B:E04 


TJC2-NorrnPo1on 


560 


640868 


1599.I08.gz43 213043 


F 


M00055615CE01 


T TP7-T\rormPo1nn 


561 


641069 


1599.N09.gz43 213064 


F 


M00055623D:G05 


1 TP?-lSJormPo1on 
\J \^-^~lN UI II1V_sU1UI1 


562 


559380 


1599.B10.gz43 213068 


F 


M0005 5 604D :F05 


T TP?-TsJormPnlon 

\J w 1 N UI 1 1 1V^ U1U1I 


563 


641680 


1599.D10.gz43_213070 


F 


M00055608C:E03 


IJC 2 -Norm Pol on 

UVi. 1 ^HJllllV^UlUll 


564 


639703 


1599.11 l.gz43 213091 


F 


M00055615DC07 


T IP? -Norm Pol nn 


565 


641542 


1599.C12.gz43 213101 


F 


M00055606DP05 


T TP? -Norm Pol nn 


566 


5201 


1599.A15.gz43 213147 


F 


M00055602D-G08 


T TP?-NormPn1on 
UV/^"1^I UI IIlV^UlVJll 


567 


607715 


1599.D15.gz43 213150 


F 


M00055608CG1 1 


T TP?-NormPntnn 
U KsZ. Iv UI IIlv^UlUIl 


568 


640416 


1599.L17.gz43 213190 


F 


M00055620D;D05 


T IP? -Norm Pol on 
\j \~r£* iy ui iiiwuiuii 


569 


554833 


1599.B19.gz43_213212 


F 


M00055606A:B11 


T TP?-NormPolon 

UV/i'n UI 1 1 1 UIUH 


570 


639480 


1599.K19.gz43 213221 


F 


M00055619BH04 


T TP? -Norm Pol on 

kj v>z. -in uriii^uiun 


571 


607138 


1599.P21.gz43 213258 


F 


M00055628BB07 


T TP? -Norm Pol on 

u i^i uriii^uiun 


572 


561626 


1599.L23.gz43 213286 


F 


M00055621B:G03 


T IP? -Norm Pol on 


573 


548959 


1600I01.gz43_213315 


F 


M00055639D:D03 


T 1C 2 -Norm Polon 

KsKs£* lyyji iiiv_/Uiuii 


574 


639934 


1600.H02.gz43 213330 


F 


M00055638DD07 


T TP? -Norm Pol on 
U N UI IllV^UiUIl 


575 


640956 


1 600.103. gz43 213347 


F 


M00055639D:F08 


T TP? -Norm Pol on 

KJ W UI IIIV^UIUII 


576 


554722 


1600.L03.gz43 213350 


F 


M0005 5 644 A" D 1 2 


T IP? -Norm Pol on 
kj v^z. i> ui niv^uiun 


577 


649717 


1600.D04.gz43 213358 


F 


M00055633D.A02 


I JP2-NormPo1on 

w V>^# l^tUllll^/UlUil 


578 


646695 


1600H04.gz43_2 13362 


F 


M00055638D:E09 


UC 2 -Norm Polon 

v V-f^. 1 1 UI A11WU1U11 


579 


639886 


1600.N04.gz43 213368 


F 


M00055647C:D02 


TIC2-NormPolon 

KJ 1 if %J1 1 U\_sUlUIl 


580 


451615 


1600.K05.gz43 213381 


F 


M00055643A:C01 


UC2-NormPolon 


581 


635965 


1600.I07.gz43 213411 


F 


M00055640A:G03 


UC2-NormColon 


582 


379040 


1 600.L07.gz43_2 13414 


F 


M00055644B:H12 


UC2-NormColon 


583 


640799 


1600.C10.gz43 213453 


F 


M00055632D:A06 


UC2-NormColon 


584 


642246 


1600.Kll.gz43 213477 


F 


M00055643B:E05 


UC2-NormColon 


585 


467563 


1600.Nll.gz43 213480 


F 


M00055647D:B11 


UC2-NormColon 


586 


640320 


1600.01 l.gz43 213481 


F 


M00055649A:H07 


UC2-NormColon 


587 


557401 


1 600.1 12.gz43_2 13491 


F 


M00055640C:E06 


UC2-NormColon 


588 


646352 


1600.M13.gz43 213511 


F 


M00055646C:B04 


UC2«NormColon 



86 



Table 2 



SEQ 
ID NC 


> CLUSTER 


SEQ NAME 


ORIEN 
T 




LIBRARY 


589 


645004 


IfifiO ot4^ 917*.79 

10UU.DlJ.gZ4j ZIjjjZ 


T7 

r 


M00055630B:G04 


UC2-NormColon 


590 


641702 


1 600 PI 7 o-rAI 9 1 7^.78 
luuu.ri / .gZ4j Z1jj/o 


T7 

r 


M0005565 1 A:E06 


UC2-NormColon 


591 


562000 


1 600 T1 0 0*7/17 7 1 7AH7 

10UU.ll7.gZ4j Z 1 juUj 


t 


M00055641A:C12 


UC2-NormColon 


592 


453441 


1 600 Ml Q ct-rA'x 9 i 7/;n.8 

lOUU.n 1 7.gZ4 j ZIjOUo 


b 


M00055648B:C01 


UC2-NormColon 


593 


641210 


1 600 M90 a^AI 917A97 
iuuu.ivizu.gZ4j ZIjuZj 


T7 

r 


M00055647B:A05 


UC2-NormColon 


594 


458736 


1600 P90 <T-rd.7 917A9A 


r 


M0005565 1B:F08 


UC2-NormColon 


595 


553285 


1600 091 otA'X 917641 


T? 

r 


M0005 5 65 OA : B05 


UC2-NormColon 


596 


455075 


1600 A99 t»-7/17 9176A7 
luUU./viZ.gZHj Z Ij04j 


T7 

r 


M00055629B:G09 


UC2-NormColon 


597 


166041 


1600 f*-747 917669 
!UUU..L/Zj.gZ4j Z 1 JUuZ 


17 

r 


JV100055634C:F09 


UC2-NormColon 


598 


—J J J J 1 VJ 


1600 T97 nvAl 9 17AA8 
1 OUU.JZj ,gZ4 j ZIjOOo 


r 


M00055642D:A05 


UC2-NormColon 


599 - 


559423 


1 600 R94 ct-t-zI^ 9 1 767*; 


c 
r 


M00055632A:B1 1 


UC2-NormColon 


600 


131348 


1 600 P94 <*7Zl7 9 1 "3677 
i uuu.v^Zr4,gZ4j zijo / / 


17 

r 


M00055633B:G02 


UC2-NormColon 


601 


650180 


1 600 F94 a-rA'X 9 1 767G 

1uuu.r-rz4.gz4j z ijo /y 


T7 

r 


M00055636A:Hl2 


UC2-NormColon 


602 


642361 


1600 F94 cr-rA'X 9176SO 
louu.r .Zr4.gZ4 j ZijOoU 


T7 

r 


M00055637B:A01 


UC2-NormColon 


603 


561069 


1600 K7A otAI 91768*. 
iuuu.rvz4.gZ4j Z IjOoj 


tr 
r 


M00055643D:G1 1 


UC2-NormColon 


604 


550515 


1 600 P94 a^AI 9 1 76QO 


T7 

r 


M00055651C:E01 


UC2-NormColon 


605 


559854 


16Q^ D01 ota17 9176QA 
1U7 J..L/U 1 .gz*t J z 1 JU74 


T7 

r 


M00055819A:B10 


UC2-NormColon 


606 


650579 


160^ T01 a-zZl^ 917700 
lU7J.JUl.gZHj Zlj/UU 


T7 

r 


A/fAAflCCOn A A 1 ^ 

M00055 827A: A 1 2 


UC2-NormColon 


607 


648567 


16Q3 T 09 n-rAI 917718 
lU7J.IjUZ.gZ4j ZlJ/lo 


T7 

r 


1/fAAACCO^Ari A 

M00055829C:A07 


UC2-NormColon 


608 


645155 


16Q1 HO'? otJ.7 91777H 

1U7J .XT.UJ .gZ4j Z 1 J /jU 


T7 

r 


MAAAf f O^f T> r»i i 

M00055825B:C1 1 


UC2-NormColon 


609 


650204 


1 60^ 1404 otA'X 9 1 77 A6, 

1U7J .n.U4.gZ4j Z 1 J /40 


T7 

r 


M00055825B:E03 


U C z -NormColon 


610 


643054 


16Q^ R06 0747 917778 
iu7j.nuij.gz*Tj zij//o 


T7 

r 


M000558z5B:F09 


UCz-NormColon 


611 


640464 


1 6Q^ T06 otAI 9 1 777Q 
lu^j.iuu.gZT-j zi j / /y 


T7 

r 


MU0055 826A:G04 


UC2-NormColon 


612 


363172 


160^ A07 cttAI 917787 
1U7J.AU / .gZ4j Zlj/o/ 


17 

r 


XIAAACCO 1 /"H A i f\ 

M0005 5 8 1 6D : A 1 0 


UC2-NormColon 


613 


674576 


160^ *Q08 r»TZl7 9178HQ 

iu7j.uuo.gz4j zijouv 


c 
r 


M00055823B:F02 


UC2-NormColon 


614 


481864 


16Q7 >J08 0-7/17 9i7ai*; 

lU7J.l>IUO.gZ*f J ZlJOlO 


17 

r 


MO0O55832A:A08 


UC2-NormColon 


615 


492242 


160^ AflO 91781Q 

lU7J.AU7.gZ4j Zl J0I7 


17 

r 


li/TAAACCOl /T\ T» 1 t 

M00055816D:B1 1 


UC2-NormCoIon 


616 


VJJ OOJ / 


1 6Q7 (P1 0 rr-747 917877 
1 utj . 1 U.gZ4 j ZIjoj/ 


T7 

r 


M00055818B:H03 


UC2-NormColon 


617 


J 1 f Z,Ou 


I^IH ct7/17 9178/1^ 
lU7J.lVlU.gZ4j Z1jo4j 


T7 

r 


M00055828B:E10 


UC2«NormColon 


618 


^ J O VJ J 


1 607 A 17 cttAI 91788*3 
1 U7 J ./\ 1 J .gZ4 J ZlJOOJ 


T7 

r 


M00055816D:E10 


UC2-NormColon 


619 


642967 


1 603 R1 7 W7Z17 917SGO 
iu7j.nij.gz4j ZJ.J07U 


T? 

r 


AJfAAACCO^m A 

M00055825D:A03 


UC2-NormColon 


620 


645746 


1 60^ T 1 7 otAI 9178Q1 

1 U7 J .1 1 J .gZ4 J Z 1 J07 1 


17 

r 


Mu0055826B:G07 


UC2-NormColon 


621 




160^ T IS ffyzl7 917Q96 
lU7J.LlJ.gZ4j Zl J7Z0 


T7 

r 


MOOO55829D:H10 


UC2-NormColon 


622 


644781 


1 60^ M 1 S a-7/17 91 7097 

1 U7J .1V11 J .gZ4 J Z1J7Z/ 


17 

r 


MUUUjJoj 1A:C06 


UC2-NormColon 


623 


643397 


169*3 A 17 07AI 917Q47 

1U7 J / .gZ4J Z 1 J 74 / 


r 


MUUUjjoI /A:ll05 


UC2-NormColon 


624 


645073 


1693.L17.gz43_213958 


F 


M00055830A:G10 


UC2-NormCo1on 

L^yji iiiv^\ji\jii 


625 


648580 


1693.A19.gz43_213979 


F 


M00055817A:H07 


UC2-NormColon 


626 


650487 


1693.I19.gz43 213987 


F 


M00055826C:G06 


UC2-NormColon 


627 


455716 


1693.J19.gz43 213988 


F 


M00055827D:A01 


UC2-NormColon 


628 


551681 


1693.H20.gz43 214002 


F 


M00055825D:D11 


UC2-NormColon 


629 


642054 


1693.P21.gz43_2 14026 


F 


M00055835C.F08 


UC2-NormCoIon 


630 


556286 


1693J22.gz43_214036 


F 


M00055827D:C02 


UC2-NormColon 
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Table 2 



SEQ 
ID NC 


> CLUSTER 


SEQ NAME 


ORIE1S 
T 


r 

CLONE ID 


LIBRARY 


631 


449542 


1693.A23.gz43 214043 


F 


M00055817B.C04 


UC2-NormColon 


632 


517237 


1693.M23.gz43 214055 


F 


M00055831B:C04 


UC2-NormColon 


633 


647191 


1694.J01.gz43 214084 


F 


M00055846B:F11 


UC2-NormColon 


634 


644212 


1694.G04.gz43 214129 


F 


M00055843B.D10 


UC2-NormColon 


635 


549624 


1694.P04.gz43 214138 


F 


M00055854C.C07 


UC2-NormColon 


636 


645149 


1694.A05.gz43 214139 


F 


M00055836A:B12 


UC2-NormColon 


637 


981 


1694.F05.gz43 214144 


F 


M00055842B:A04 


UC2-NormColon 


638 


647448 


1694.I05.gz43 214147 


F 


M00055845C:A11 


UC2-NormColon 


639 


570939 


1694.P05.gz43 214154 


F 


M00055854C.E03 


UC2-NormColon 


640 


547841 


1694.L06.gz43 214166 


F 


M00055849C:G07 


UC2-NormColon 


641 


466265 


1694.M06.gz43 214167 


F 


M00055851A.C03 


UC2-NormColon 


642 


649842 


1694.M07.gz43 214183 


F 


M00055851A:C09 


UC2-NormColon 


643 


468222 


1694.I08.gz43 214195 


F 


M00055845C:C12 


UC2-NormColon 


644 


450949 


1694.L08.gz43 214198 


F 


M00055849D:B04 


UC2-NonnColon 


645 


603388 


1694.O08.gz43 214201 


F 


M00055853C:C12 


UC2-NormColon 


646 


485237 


1694.E09.gz43 214207 


F 


M00055841B:F09 


UC2-NormColon 


647 


644063 


1694.F09.gz43 214208 


F 


M00055842B:D04 


UC2-NormColon 


648 


639341 


1 694N09.gz43_2 142 1 6 


F 


M00055852A:C12 


UC2-NormColon 


649 


471364 


1694.P09.gz43 214218 


F 


M00055854C:H11 


UC2-NormColon 


650 


452735 


1694.A10.gz43 214219 


F 


M00055836C.D01 


UC2-NonnColon 


651 


503546 


1694.E10.gz43 214223 


F 


M00055841B.H03 


UC2-NormColon 


652 


646420 


1694.I10.gz43 214227 


F 


M00055845C:E02 


UC2-NormColon 


653 


647577 


1694.O10.gz43 214233 


F 


M00055853C:H03 


UC2-NormColon 


654 


504944 


1694.Bll.gz43 214236 


F 


M00055837D:G10 


UC2-NormColon 


655 


522869 


1694.Dll.gz43 214238 


F 


M00055840B:B02 


UC2-NormColon 


656 


644548 


1694.Fll.gz43 214240 


F 


M00055842C.A11 


UC2-NormColon 


657 


644314 


1694.Hll.gz43 214242 


F 


M00055844D:E12 


UC2-NormColon 


658 


650492 


1694.01 l.gz43 214249 


F 


M00055853D:A07 


UC2-NormColon 


659 


643800 


1694.B12.gz43 214252 


F 


M00055838A:A03 


UC2-NormColon 


660 


643843 


1694.E12.gz43 214255 


F 


M00055841C:A03 


UC2-NormColon 


661 


448450 


1694.B13.gz43 214268 


F 


M00055838A:B02 


UC2-NormColon 


662 


643804 


1694.C13.gz43 214269 


F 


M00055839B:A10 


UC2-NormColon 


663 


469511 


1694.L13.gz43 214278 


F 


M00055849D:H09 


UC2-NormColon 


664 


643130 


1694.013.gz43 214281 


F 


M00055853D:B04 


UC2-NormColon 


665 


640171 


1694.J14.gz43 214292 


F 


M00055846D:G11 


UC2-NormColon 


666 


647522 


1694.M14.gz43 214295 


F 


M00055851A:H10 


UC2-NormColon 


667 


642293 


1694.N14.gz43 214296 


F 


M00055852B:G09 


UC2-NormColon 


668 


404816 


1694P14.gz43_214298 


F 


M00055855A:B11 


UC2-NormColon 


669 


447597 


1694.A15.gz43 214299 


F 


M00055837A:B08 


UC2-NortnColon 


670 


467901 


1694.B15.gz43 214300 


F 


M00055838B:D06 


UC2-NormColon 


671 


648039 


1694.C15.gz43 214301 


F 


M00055839B:C07 


1 JC2-NormColon 


672 


607430 


1694.E15.gz43 214303 


F 


M00055841C:D05 


UC2-NormColon 



88 




Table 2 



SEQ 
ID NO 


1 CLUSTER 


SFO NAMF 


T 


pT /~WT"C TT\ 

LXUJNi} ID 


LIBRARY 


673 


OHO JUJ 


10y4.r 15.gz43 214304 


F 


M00055842D:C02 


UC2-NormColon 


61d 


477 1 1 Q 


10y4.LrlJ.gz4J 214305 


F 


M00055843D:H01 


UC2-NormColon 


675 


645 1 07 


1 AG/1 I^K rt—yi 0 Ti^-inn 
1 0y4.lv 1 3 .gZ4 J 2 143U9 


F 


M00055848C:A02 


UC2-NormColon 


616 


650? 1 7 


1 AQ/1 HK 1 ou^n 
10y4.LJ15.gz43 2143 13 


F 


M00055853D:C07 


UC2-NormColon 


fill 


644? 1 0 


10y4.A10.gZ43 214315 


F 


M00055837A.D09 


UC2-NormColon 


61% 

\J / o 


466607 


1 AO/1 I-T1A /vnr/l'S TUITl 

10y4.rll0.gz43 214322 


F 


M00055844D:H09 


UC2-NormColon 


679 


650976 


1 AO A T 1 A *3 OHno 
10y4.110.gZ43 214323 


F 


M00055845D:G11 


UC2-NormColon 


680 


569990 


1 AO/1 \yflA rr-^/l'J oiyim 
loy4.lVllO.gz43 21432 / 


F 


M00055851B:B09 


UC2-NormColon 


UO 1 


469650 


1 AQ/1 D 1 A rrrr/1 ^ 7 1/1 HA 

10y4.Jrl0.gz43 21433U 


F 


M00055855A:G05 


UC2-NormColon 


689 


644404 


1 AO/1 1317 r>-r/l Q Tt/im 

10y4.t$l /.gZ43 214332 


F 


M00055838B:H04 


UC2-NormColon 


6$n 


9473H 


1 AO/i nn ^/lo Ti/i*5iyt 
10y4.Lil /.gz43 214334 


F 


M00055840C:D06 


UC2-NormColon 


684 


S0075Q 


1 AO A T7 1 7 rtrrA 1 inoic 
10y4.iil /.gz43 214335 


F 


M00055841C:D11 


UC2-NormColon 


685 


1 5001 


1 AO/1 17 1 7 *vr»/l *i 

loy4.r 1 / .gz43 2 14336 


F 


M00055842D:D07 


UC2-NormColon 


6X6 


55XQ8 1 


1 AO/f Tf 1 7 sw/l "2 ti -iTyi 1 

10y4.Jvl /.gz43 214341 


F 


M00055848C:G07 


UC2-NormColon 


6X7 
uo / 


509199 

J7Z 1ZZ 


1 AO/1 O 1 7 rr-wA 1 O 1 

10y4.LJi /.gZ43 214345 


F 


M00055854A:B07 


UC2-NormColon 


6XX 


4&yz4y 


1 AO/1 A 1 Q r*~wAi '^lAiA'n 

loy4.Alo.gz43 214347 


F 


M00055837A:F02 


UC2-NormColon 


6RQ 


447nn? 


1 AO/1 0 10 **~A1 7 1/iO/fO 

loy4.jDlo.gz43 214348 


F 


M00055838C:A08 


UC2-NormColon 


600 


JJ007U 


1 AO /I Pi Q ^_/f 0 O 1 jiO>1f\ 

16y4.L,lo.gz43 214349 


F 


M00055839B:E07 


UC2-NormColon 


oy i 


A/Ll^/lft 
O44045 


loy4.Lllo.gz43 214350 


F 


M00055840C:H06 


UC2-NormColon 


AO? 




1 AO/1 PI O «._/l*5 11 ylOfl 

loy4.Lrlo.gz43 214353 


F 


M00055844A:D03 


UC2-NormColon 


6Q3 


04JZoy 


loy4.Aly.gz43 214363 


F 


M00055837A:H08 


UC2-NormColon 


604 


A43048 


1 AO/I 13 1 Q ,rv**/1 "3 T 1 /ll^yl 

10y4.i5iy.gz43 214364 


F 


M00055838C:B08 


UC2-NormColon 


605 


^7^»a 

Dd / joO 


1 AG /I rifl r*-,A 1 7 1/f3zTC 

loy4.L,19.gz43 214365 


F 


M00055839B:H09 


UC2-NormColon 


606 


4A?^7 


1 AO/1 17 1 O n^A 1 "t1A*)£LO 

loy4.r ly.gz43_2 14368 


F 


M00055842D:F07 


UC2-NormColon 


607 


A47^77 
04 ZD f f 


1 AO/1 PlIO r^r^-wA "3 T1 /f3£n 

1 oy4.Lrl y.gz43_2 14369 


F 


M00055844A:D07 


UC2-NormColon 


60X 


A430QQ 


1 AO/1 Ul Q r+~,A 1 TI/fJTA 

16y4.ril9.gz43 214370 


F 


M00055845A:C03 


UC2-NormColon 


600 


65 1 1 3 1 


10y4.J iy.gz43 2143/2 


F 


M00055847B:G12 


UC2-NormColon 


700 


49^Q?^ 


1 AO/1 AilO 7 1/f3"7C 

loy4.Mly.gz43 214375 


F 


M00055851B:G10 


UC2-NormColon 


701 


6^0^ 1 n 

OJ7J 1U 


1 AO/l Pi 1 O i Ti/iin 

10y4.Uly.gz43_2 14377 


F 


M00055854A:D01 


UC2-NormColon 


709 


643Q77 

0*t J7 / / 


1 AO/1 r*r,A1 11 /HOI 

loy4.L)2U.gz43__2 14382 


F 


M00055840D:B03 


UC2-NormColon 


703 


1 4064K 


1 AO/1 T7?A. rrrrAI TI/IIOI 

10y4.jb2U.gZ43 214383 


F 


M00055841C:H04 


UC2-NormColon 


704 


64006ft 


1 AO/1 f^7 n r»rr/1 'J 7 1 y1 *3 O C 

1 oy4 .Lr2U.gZ43_2 1 43 85 


F 


M00055844A:F11 


UC2-NormColon 


705 


64470X 

O a T*T / I/O 


1 AO/1 AJ7A rv-r/1 3 7 1 /I 3fl7 

1 Oy4 . JN ZV.gZ'tD 2 1 43 92 


17 

F 


M00055852D:B11 


UC2-NormColon 


706 


643000 


1 AO/1 I>?Pi rv^/1 3 7 1/1*30/1 

10y4.r2U.gZ43 214394 


F 


lk /AAA F €\ ^ ^ T\ T^fc -a i 

M00055855B:B11 


UC2-NormColon 


707 


65059X 


1AQ4T791 n-rAI 7 1/faoo 

10y4.riz1.gz4 J 2143yy 


r 


M0005584 1 D:C 1 1 


UC2-NormColon 


708 


644569 


1694.H2 1 .gz43_2 14402 


F 


M00055X45A'Hl 0 




709 


646459 


1694.J21.gz43 214404 


F 


M00055847C:A11 


UC2-NormColon 


710 


608873 


1694.L21.gz43 214406 


F 


M00055850C:D01 


UC2-NormColon 


711 


649611 


1694.N21.gz43 214408 


F 


M00055852D:G12 


UC2-NormColon 


712 


465576 


1694.P21.gz43 214410 


F 


M00055855B:D12 


UC2-NormColon 


713 


467521 


1694.C22.gz43 214413 


F 


M00055839C:B11 


UC2-NormColon 


714 


646810 


1694.J22.gz43_214420 


F 


M00055847C:C01 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 

A-(1AJ A. V/VX V X 


715 


647318 


1694.L22.gz43 214422 


F 


M00055850Pnns 


Uv^z-iNonncolon 


716 


638854 


1694.022.gz43 214425 


F 


1 T X \J \J \J *J J O +J *T J^Y , J_y \J *T 


TTP9 "Mrt***-*^^!^** 

u v^z-in ormvyOion 


717 


644987 


1694.A23.gz43_214427 


F 


M00055837BE07 


u v>z -in onni_/Oion 


718 


645948 


1694C23.gz43_214429 


F 


M00055839PT)Or; 


uv^z-ANonTiv>oion 


719 


419751 


1694.123 .gz43 214435 


F 


M00055846RR19 


u v^z-iN onuv^oion 


720 


643594 


1694M23.gz43_214439 


F 


M00055851CF12 


T TP 9 - "NJoT-tn Prtl ^« 
\j v^z, _ i>ui iiiv^oion 


721 


603388 


1694.P23.gz43_2 14442 


F 


M00055855CF11 

A ▼ M. V W »y *J \J *S . X X X 


T TP?-TVTnrmPri1nri 


722 


644075 


1694.G24.gz43_2 14449 


F 


M00055844BC12 


T TP?-r\Tnrmr i nlnn 


723 


542282 


1 694.124 .gz43_2 1445 1 


F 


M00055846RP 1 9 


VJVyZ,-INOIIIlV^t)10Il 


724 


452976 


1694.J24.gz43„2 14452 


F 


M0005 5 847C H09 

a. vy vy vy r / » j, A vy 


T IP ? -l\Tnrm Pnl on 
Vyz,~i > vji iiiV/Uiiyii 


725 - 


472801 


1694.M24.gz43 214455 


F 


M00055851CH05 


T TC 9 -Nnrm Pnl nn 


726 


646633 


1695.P03.gz43_214910 


F 


M00055875BE09 


VJ VyZ,~l N vll lllV^UliJll 


727 


509027 


1695.A04.gz43 21491 1 


F 


M00055855DD12 

iTiv/u vy «*y vy **y *_y x-' » j_ 


T TP9-T\TnrmPn1nn 
vj VvZ, ui in v>uivjn 


728 


596882 


1695.D05.gz43 214930 


F 


M0005 5 8 60D : E04 


T TP 9 -IMnrm Pnl nn 


729 


644442 


1695.A07.gz43_214959 


F 


M00055855D:G08 


T JP2-NnrmPn1nn 

UV/ 7 * 1 ^( Ul 111V-/ vJlvJll 


730 


644047 


1695F07.gz43_2 14964 


F 


M00055863DD09 


T IP? -Nnrm Pnl nn 

VJ V_^Z. 1 > vjl lllV_^vJI\jll 


731 


397399 


1695.H07.gz43_214966 


F 


M00055866AG10 

ATA Uv \/«y^ »y U \J\jn., VJ XV/ 


T IP 9 -Nnrm Pnl nn 

UV/Z, n 4J1 111 VyUlLIll 


732 


564854 


1695A08.gz43_2 14975 


F 


M00055856AC06 


T TP 9 -Nnrm Pnl nn 


733 


646372 


1695.B08.gz43 214976 


F 


M0005 5 85 7C • D09 


T TP 9 -Nnrm Pnl nn 

VJ V^Z. UI IIIV^UIUII 


734 


645848 


1695. D08.gz43 214978 


F 


M00055861BF04 


T TP 9 -Nnrm Pnl nn 

U vZ.li UI IllV^UIUIl 


735 


447035 


1695.M08.gz43_214987 


F 


M00055872DD 1 2 


T TP 9 -Nnrm Pnl nn 

vJ V^Z. 1 1 Ul 1 1 1 Vy U1UI 1 


736 


644149 


1695.A09.gz43 214991 


F 


M00055856AD 1 2 


T TP9-NnrmPrtlnn 

u v^z,~i> ui i ii v^uiun 


737 


640147 


1695.E09.gz43 214995 


F 


M00055862DB02 


T TP9-NnrmPn1nn 

UvZ ll UI 1 11 Vy UlUIl 


738 


524261 


1695.K09.gz43 215001 


F 


M00055871 AH06 


T TP9-NnrmPo1nn 

U Vy2.-1NU1 IllVyUIOIl 


739 


449500 


1695.Ell.gz43_215027 


F 


M0005 5 8 62D • D06 


T TP 9 -Nnrm Pnl nn 

VJ V>Z 1 ^Ul HlVyUlUll 


740 


648819 


1695Kll.gz43_2 15033 


F 


M00055871BB03 


T TP 9 -Nnrm P nl nn 

VJ vZ iiUl I11V_/U1UI1 


741 


644755 


1695.Lll.gz43 215034 


F 


M0005 5 8 72 A- C08 


T TP9-NnrmPn1on 


742 


516512 


1695.Pll.gz43 215038 


F 


M00055875CC07 

ITIVV/V JJ U / «JVy. VyV/ / 


T TP 9 -Nnrm Pnl rvn 

vj VyZ.-nui inv^uiun 


743 


380550 


1695.L12.gz43 215050 


F 


M0005 5 8 72 A • D08 


T TP9-NnrmPn1nn 

VJ V^Z, 1> UI 1I1V>U1UI1 


744 


645538 


1695D13.gz43_215058 


F 


M00055861C/G03 


T TP 9 -Nnrm Pnl nn 

VJ V-'Z.—l^l Ul IIIV^UIUII 


745 


644047 


1 695 .F 1 3 .gz43_2 15060 


F 


M00055864AC09 

a ▼ i. vy vy vy *y *y vy vy r,i . \j 


TTP2-NnrmPnlnn 

VJ Vy Z» 1 > Vjl lllVyvJlVJll 


746 


650773 


1695.F14.gz43 215076 


F 


M00055864AE11 


T TP9-NnrmPn1nn 

VJ V/^ I^IUl IIIV^UIUII 


747 


449936 


1695.J14.gz43 215080 


F 


M00055869C:G06 


TJC2-NormPn1nn 

VJ Vy*( 1 1 VI XtlVy VI1V/11 


748 


554646 


1695.014.gz43 215085 


F 


M00055875A:G05 


UC2-NorniColon 


749 


640534 


1695L15.gz43_2 15098 


F 


M00055872A:E11 


UC2-NormCoIon 

V-/ x ^ vy i. in v/ Vy i. vyx J. 


750 


642411 


1695J16.gz43_215112 


F 


M00055869D:A07 


UC2-NormColon 


751 


643451 


1695M16.gz43_215115 


F 


M00055873A:E03 


UC2-NormColon 


752 


644435 


1695H17.gz43_215126 


F 


M00055866C:G09 


UC2-NormColon 


753 


561877 


1695.F18.gz43 215140 


F 


M00055864B:C09 


UC2-NormColon 


754 


645497 


1695.N18.gz43_215148 


F 


M00055874A:F06 


UC2-NormColon 


755 


644030 


1695.E20.gz43_215171 


F 


M00055863B:C07 


UC2-NormColon 


756 


140224 


1695D21.gz43_215186 


F 


M00055862A:C01 


UC2-NormColon 
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Table 2 



SEQ 
I ID NO 



CLUSTER 



SEQ NAME 



lORIEN 
T 



CLONE ID 



LIBRARY 



643099 



1695.K21.gz43 215193 



M00055871C:C07 I UC2-NormColr,n 



643333 
649660 



1695.P22.gz43 215214 



M00055875D:D01 | UC2-NormColnn 



1695.G23.gz43 215221 



M00055865C:G11 | UC2-NormColnn 



645288 



1695.H23.gz43 215222 



M00055867A:B02 I UC2-NonnColon 



650517 



1695.G24.gz43 215237 



M00055865C:H06 I UC2-NormColon 



552201 



1696.J01.gz43 215256 



M00055885C:B07 UC2-NormColon 



643513 



1696.P02.gz43 215278 



M00055891B:F09 UC2-NormColon 



447807 



1696.D04.gz43 215298 



639256 



1696.M04.gz43 215307 



M00055880B:A06 UC2-NonnColon 



M00055888C:F07 UC2-NormColon 



179760 



1696.G05.gz43 215317 



M00055883A:C02 UC2-NormColon 



459274 



1696.G06.gz43 215333 



M00055883A:C10 UC2-NormColon 



345761 



1696.I06.gz43 215335 



640356 



1696.M06.gz43_215339 



M00055884D:F07 UC2-NonnColon 



M00055888C:G09 UC2-NormColon 



645638 



1696.A07.gz43 215343 



M00055877A:H04 UC2-NormColon 



464171 



1696.Nll.gz43 215420 



446225 



1696.E13.gz43 215443 



M00055890A:A06 UC2-NonnColon 



450559 



1696.F14.gz43 2 1 5460 



F_ 
F 



M00055881A:GO7 UC2-NormColon 



643248 



1696.L14.gz43 215466 



M00055882C:A06 UC2-NonnColon 



M00055887D:C11 UC2-NormColon 



549114 



1696.N14.gz43 215468 



M00055890A:D01 UC2-NormColon 



645215 



1696.D15.gz43 215474 



M00055880C:F07 UC2-NormColon 



555172 



1696.I15.gz43 215479 



M00055885B:A11 UC2-NormColon 



593715 



1696.B16.gz43 215488 



M00055878C:C02 UC2-NonnColon 



586992 



1696.116.gz43_215495 



451670 



1696.H17.gz43 215510 



M00055885B:B04 UC2-NormColon 



M00055884B:F10 UC2-NormColon 



552628 



1696.H18.gz43 215526 



M00055884B:H07 UC2-NormColon 



643233 



1696.A21.gz43_2 15567 



640400 



1696.F21.gz43 215572 



M00055877D:C05 UC2-NormColon 



M00055882D:B02 UC2-NormColon 



644468 



1696.G21.gz43_2 15573 



M00055883D:B06 UC2-NormColon 



643046 



1696.H21.gz43 215574 



M00055884C:B07 UC2-NormColon 



649068 



1696.N21.gz43 215580 



M00055890A:G12 UC2-NormColon 



648063 



1696.E22.gz43 215587 



M00055881D:A02 UC2-NormColon 



570248 



1696.M23.gz43_215611 



M00055889B:E12 UC2-NonnColon 



643724 



1696.N23.gz43 215612 



M00055890A:H11 UC2-NormColon 



649027 



1696.P23.gz43 215614 



M00055892A:F04 UC2-NormColon 



643366 



1696.P24.gz43 215630 



M00055892B:D02 UC2-NormColon 



526575 



1705.E01.gz43 215635 



M00055900D:D03 UC2-NonnColon 



449737 



1705.O02.gz43_215661 



M00055914A:A03 UC2-NormColon 



394373 



1705.B03.gz43 215664 



M00055894B:E09 UC2-NormColon 



557361 



1705.C03.gz43 2 1 5665 



M00055896D:G10 UC2-NormColon 



644364 



1705.K04.gz43 215689 



M00055908D:F09 UC2-NormColon 



644242 



705.M05.gz43_215707 



M00055911B:E06 UC2-NormColon 



644407 



1705.N05.gz43_215708 



M00055912D:F04 UC2-NonnColon 
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Table 2 



SEQ 
IIDNOI 



CLUSTER 



SEQ NAME 



lORIENI 
T 



CLONE ID 



LIBRARY 



799 
800 



643513 | 1705.F06.gz43 215716 



M00055902A:H11 I UC2-NormCnlnn 



639194 | 1705.H06.gz43 215718 



M00055905B:H01 | UC2-NormColon 



801 



802 
803 



454720 | 1705.I06.gz43 215719 



643300 | 1705.O09.gz43 215773 



M00055906D:G03 I UC2-NormCnlnn 



M00055914C:D12 I UC2-NormColnn 



805 



806 
807 



647522 | 1705.E10.gz43 215779 



804 648576 | 1705.Bll.gz43 2 1 5792 



M00055901B:A02 UC2-NormColon 



485237 | 1705.Cll.gz43 215793 



M00055896A:G01 I UC2-NormColon 



644108 | 1705.O1 l.gz43 215805 



M00055898B.E07 I UC2-NormColon 



M00055914C:G01 UC2-NonnColon 



643914 I 1705.G12.gz43 215813 



M00055904B:B01 UC2-NormColon 



808 
809 



641645 | 1705.O12.gz43 215821 



M00055914C:G11 UC2-NormColon 



605596 | 1705.H13.gz43 215830 



M00055905D:B06 UC2-NormColon 



810 
811 



564134 | 1705.O13.gz43 215837 



M00055914D:A08 UC2-NormColon 



643948 | 1705.B14.gz43 215840 



M00055896B:C06 UC2-NormColon 



812 
813 



557947 I 1705.F14.gz43 215844 



M00055902D:A07 UC2-NormColon 



15001 | 1705.G15.gz43 215861 



M00055904C:A07 UC2-NormColon 



814 649170 | 1705.P15.gz43 2 1 5 870 



M00055916B:C02 UC2-NormColon 



815 644390 | 1705.D16.gz43 215874 



M00055900A:F08 UC2-NormColon 



816 | 643089 | 1705.K16.gz43 215881 



817 553587 | 1705.O16.gz43 215885 



818 
819 



M00055909B:G10 UC2-NormColon 



550242 | 1705.H17.gz43 2 1 5 894 



M00055914D:G06 UC2-NormColon 



M00055905D.H02 UC2-NoimColon 



639413 | 1705.O17.gz43 215901 



M00055915A.A06 I UC2-NormColon 



820 
821 



645844 1705.C18.gz43 215905 



M00055899A:B03 UC2-NonnColon 



641087 | 1705.E18.gz43 215907 



M00055901C:C10 UC2-NormColon 



822 
823 



482163 | 1705.F18.gz43 215908 



M00055902D:H10 UC2-NonnColon 



643808 I 1705.G18.gz43 215909 



M00055904D:A06 UC2-NormColon 



824 
825 



468147 | 1705.L19.gz43 215930 



M00055910C:G01 UC2-NonnColon 



446728 | 1705.D20.gz43 215938 



M00055900B:E12 I UC2-NormColon 



826 447326 | 1705.K20.gz43 215945 



M00055909C:E08 UC2-NormColon 



827 685001 | 1705.L20.gz43 215946 



M00055910C:G04 UC2-NormColon 



828 



645781 | 1705.K21.gz43 215961 



M00055909C:G09 UC2-NonnColon 



829 643825 | 1705.L21.gz43 215962 



M00055910D:A03 UC2-NoimColon 



830 



374340 | 1705.A22.gz43 215967 



M00055894A:H08 UC2-NormColon 



831 640756 | 1705.M22.gz43 215979 



M00055912C:C08 UC2-NormColon 



832 | 400741 | 1705.B23.gz43 215984 



M00055896C:H10 UC2-NormColon 



833 | 644063 | 1705.F24.gz43 21 6004 



M00055903B:C10 UC2-NormColon 



834 644354 | 1705.G24.gz43 216005 



M00055905A:F11 UC2-NormColon 



835 | 89082 | 1705.K24.gz43 216009 



M00055909D:E01 UC2-NormColon 



836 643991 1705.O24.gz43 216013 



M00055915C:B09 UC2-NormColon 



837 | 459881 | 1706.B01.gz43 216028 



M00055918B:B10 UC2-NormColon 



838 I 644554 | 1706.C01.gz43 2 1 6029 



M00055919D:H07 UC2-NormColon 



839 I 557175 | 1706.B03.gz43 216060 



M00055918B:F10 UC2-NormCoIon 



840 | 641484 | 1706.F03.gz43 2 1 6064 



M00055924C:A09 UC2-NonnColon 
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Table 2 



SEQ 
IIDNOI 



CLUSTER 



841 
842 



SEQ NAME 



|ORIEN 
T 



651131 



1706.B05.gz43 2 1 6092 



CLONE ID 



M00055918C:C04 



LIBRARY 



UC2-NormColon 



644629 
419009 



1706.J05.gz43 216100 



M00055930A:G04 



UC2-NormColon 



843 



1706.E06.gz43 216111 



M00055922D:G10 



UC2-NormColon 



844 
845 



446795 



1706.M06.gz43 216119 



M00055934C.E06 



UC2-NormColon 



638808 



1706.L07.gz43 216134 



M00055933A.-E05 



UC2-NormColon 



846 
847 



646596 



1706.N07.gz43 216136 



M00055936C.B05 



UC2-NormColon 



639901 



1706.P07.gz43 216138 



M00055939B.A11 



UC2-NormColon 



848 



648379 



1706.N08.gz43_216152 



M00055936C:D03 



UC2-NormColon 



849 



650938 



1706.J09.gz43 216164 



850 643170 



M00055930C:D03 



1706.E10.gz43 216175 



UC2-NormColon 



M00055923B:C07 



UC2-NormColon 



851 



643239 



1706.H10.gz43 216178 



M00055927D:E11 



852 639596 



UC2-NormColon 



1706.J10.gz43 216180 



853 605114 



M00055930C:F06 



1706.L10.gz43 216182 



UC2-NormColon 



M00055933B.B11 



UC2-NormColon 



854 456920 



1706.Nll.gz43 216200 



M00055936D.F03 



UC2-NormColon 



855 



647952 



1706.B12.gz43 216204 



M00055919A:A06 



UC2-NormColon 



856 650470 



1706.M12.gz43 216215 



M00055934D:H09 



UC2-NormColon 



857 465576 



1706.D13.gz43 2 1 6222 



M00055922A:F05 



UC2-NormColon 



858 455413 



859 639038 



1706.P13.gz43_216234 



M00055939D.D07 



1706.H15.gz43_216258 



UC2-NormColon 



M00055928A:C06 



UC2-NormColon 



860 



556925 



1706.J15.gz43_216260 



M00055930C:H05 



UC2-NormColon 



861 



862 



644301 



1706.A16.gz43 216267 



M00055917D:E07 



447520 



UC2-NonnColon 



1706.J16.gz43_216276 



M00055930D:B05 



UC2-NormColon 



863 



447426 



1706.D17.gz43 216286 



M00055922B:A06 



UC2-NormColon 



864 610893 



1706.H17.gz43 216290 



M00055928A:E11 



UC2-NormCoIon 



865 613626 



1706.J17.gz43 216292 



M00055930D:B09 



UC2-NormColon 



866 645092 



1706.N17.gz43 2 1 6296 



M00055937A:F07 



UC2-NormColon 



867 



868 



649873 



1706.P19.gz43 216330 



M00055940B:A08 



645028 



1706.A20.gz43 216331 



UC2-NormColon 



M00055918A:F10 



UC2-NormColon 



869 639154 



1706.F20.gz43_216336 



M00055925B:D09 



UC2-NormColon 



870 649085 



1706.D21.gz43 216350 



M00055922B:G09 



UC2-NormColon 



871 



643332 



1706.F21.gz43 216352 



M00055925B.D10 



UC2-NormColon 



872 



643383 



1706.H21.gz43_2 16354 



M00055928B:E11 



UC2-NormColon 



873 



204 



1706.J21.gz43 216356 



M00055930D:F05 



' JC2-NormCoIon 



874 641716 



1706.N21.gz43 216360 



M00055937C:C08 



1 JC2-NormColon 



875 



643627 



1706.E22.gz43 216367 



M00055924B:D02 



UC2-NormColon 



876 640672 



1706.M23.gz43 216391 



M00055935C:E03 



UC2-NormColon 



877 550237 



1707.J02.gz43 216453 



M00055956D:B01 



UC2-NormColon 



878 646317 



1707.B04.gz43_216477 



M00055942B:H10 



UC2-NormColon 



879 643968 



1707.K05.gz43 216502 



M00055958C:E03 



1 JC2-NormColon 



880 



649549 



1707F06.gz43_216513 



M00055949D:B07 



UC2-NormColon 



881 



513619 



1707.O07.gz43_216538 



M00055967A:F11 



UC2-NormColon 



882 



460245 



1707.I08.gz43 216548 



M00055955B.CO6 



UC2-NormColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 




883 


646061 


1707 O08 ffz4"? ?1fiSS4 


r 


ivxUULOjyo /A.LrUl 


UC2-NormColon 


884 


645273 


1707 009^74^ 716S70 


r 




UCz-NormColon 


885 


524546 


1 707 A 1 2 crz4^ 7 1 6604 


x: 
r 


iVLUUU D J y4 1 d . AU4 


UC2-NormColon 


886 


648782 


1 707 B 1 2 ot&TK 7 1 660S 


p 


1V1U UU j j y4zu . AU 1 


UCz-NormColon 


887 


645900 


1 707 0 1 3 &743 9 1 66^4 


17 

r 


wa nn n ^ ^ o ^ *7 T5 • t?m 
iviuuuj jyo / rJ.r U / 


U Cz-N ormColon 


888 


648555 


1 707 D 1 4 £tt4^ 9 1 66^0 


V 
r 


JVLUUU3 3 J .Uvy 


U Cz-N ormColon 


889 


452316 


1 707 T 1 4 <*743 7 1 664 S 




1V1UUUJ jVj /X5.F 1 1 


UCz-JNormColon 


890 


461363 


1707D17ez43 216687 

X 1 \J 1 . X~r X / '£y' it -J X UVJO / 


rr 

X 


iv/f non ^ ^ 04 ^ n • n o i 


u i^z-iN orm^oion 


891 


644720 


1707 D18 2z43 216701 




lV/f 000 ^ Q4 ^ Fi ■ F O SI 


u cz-jn ormcoion 


892 


647312 


1707L18ez43 216711 


p 


iv/f n o o ^ ^ o 6 t r> • po ^ 


u cz-jn ormcoion 


893 - 


646190 


1707N18ez43 216713 


P 

X 


TV4000^^066R-HOQ 


u cz -in ormcoion 


894 


531529 


1 707 H 1 9 2 z43 2 1 6723 


p 




u cz-jn ormcoion 


895 


393599 


1707 E20 ez43 216736 


p 


MOOO^ S 0471^* POO 


u cz-jn ormcoion 


896 


645508 


1 707 N20 ffz43 2 1 674 S 


p 

X 


iv>rooo^^066r**r>n > i 


u c z -in ormcoion 


897 


383609 

W mJ \J \J +f 


1707 023 &743 7167Q4 


p 
r 


1^4000^^0671^*^0^ 


u cz-in ormcoion 


898 


447805 


1707B24az43 2167Q7 


F 

X 


\A OOO ^ ^ 04^ R . p n 1 

ivxuuu.} jy*f jxj.xiu i 


u c z -in ormcoion 


899 


694643 


1707P24ffz43 216811 


p 


lV/f000*^SQ6O A *Pn7 


u cz-in ormcoion 


900 


647924 


1708 101 ez43 716870 


p 


\4000-\SQ80Pl-'Rn7 


u cz -jn ormcoion 


901 


645920 


1708P01ez43 216877 


p 


TV4000 ^ ^ QQO A • P07 


u c z - jn ormc oion 


902 


641484 


1 708 M02 ar43 7 1 6840 


F 


lV/fooos s os ^ r> • pno 


u cz-jn ormcoion 


903 


640672 


1708 A04 az43 216860 


p 

F 


TV/f OOO ^ ^ Q60T> * RO *\ 


u cz-jn ormcoion 


904 


644226 


1708 K04 a 743 716870 


P 

X 


Ayrooo ^ ^ 0 r 7 c • un 1 


u cz - jn ormcoion 


905 


557852 


1 708 C06 {X743 7 1 68Q4 


P 


TV/f OOO *\ Q77 f -r* no 


u cz-jn ormColon 


906 


558052 


1708C07ffz43 716Q10 


F 


^000^*1077^^07 
IVIUUUj J7 /Zl^.FUZ 


u cz - jn ormcoion 


907 


644354 


1708 108 ff743 716Q37 


p 


7V4000^QROr'-m 7 


u cz-jn ormcoion 


908 


645746 


1708 T 08 £7741 71601S 


p 


IVyTOO O *n O ft /I r\ • T7 Ci A 
1VIUU U 3 D y 0 U . Ct U4 


UCz-JN ormColon 


909 


647539 


1 708 M09 a741 7 1 60^7 


P 
F 


lV/f OOO^ ^Q»6 A -PO-n 


u cz -jn ormcoion 


910 


645288 


1708 1,10^743 716Q67 


P 
F 


TV4000^-^Qft^ A 'RHA 


ucz-iNormColon 


911 


645289 


1708 A1 1 ^743 716077 


P 

r 


TV>f OOO*^ ^Q^QPi'TinQ 


UCZ-JNormcolon 


912 


642058 


1708 Gil az43 216Q78 


p 

F 


TV^OOOS ^Q7ft A •HO'? 


u cz -in ormcoion 


913 


642955 


1708 A12 ez43 216988 


F 

F 


A/T000SS06Qn-FOS 


ucz-jNormcoion 


914 


640092 


1708H12ez43 21699^ 


F 

X 


M000^^07QTir^O7 


u cz-in ormcoion 


915 


558452 


1708P12ffz43 217003 


F 

F 


M000S S0Q0rVRO7 
1 VI U \J KJ J J y zf \J U . D u z 


ucz-iNormcoion 


916 


645427 


1708 L13 ez43 217015 


F 

X 


M0O0SS08SPIPO7 


u cz -in onricoion 


917 


649082 


1708B14ez43 217021 


F 


M000S^971RA1 1 




918 


451709 


1708.K14.gz43_2 17030 


F 


M00055983C:C09 


UC2-NormColon 


919 


640204 


1708.H15.gz43_2 17043 


F 


M00055979C:B07 


UC2-NormColon 


920 


640179 


1708.N15.gz43_217049 


F 


M00055988A:A12 


UC2-NormColon 


921 


582549 


1708.P15.gz43 217051 


F 


M00055990D:G09 


UC2-NormColon 


922 


484355 


1708.G16.gz43_217058 


F 


M00055978B:F01 


UC2-NormColon 


923 


646013 


1708.J17.gz43_2 17077 


F 


M00055982AG06 


UC2-NormColon 


924 


649309 


1708.P18.gz43_2 17099 


F 


M00055991A:D10 


UC2-NormColon 



94 




Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


925 


647333 


1708.F19.gz43 217105 


F 


M00055977A:G09 


UC2-NormColon 


926 


647058 


1708.A21.gz43_217132 


F 


M00055970C:A05 


UC2-NormColon 


927 


643723 


1708.F21.gz43 217137 


F 


M00055977B:F06 


UC2-NormColon 


928 


639359 


1708.N21.gz43_217145 


F 


M00055988A:E05 


UC2-NormColon 


929 


647539 


1708.H23.gz43_217171 


F 


M00055980A:H06 


UC2-NormColon 


930 


647211 


1708.C24.gz43_217182 


F 


M00055973C:F10 


UC2-NormColon 


931 


650018 


1708.J24.gz43_217189 


F 


M00055982C:A12 


UC2-NormColon 


932 


645848 


1708.K24.gz43 217190 


F 


M00055984A:F05 


UC2-NonnColon 


933 


646212 


1708.P24.gz43_217195 


F 


M00055991C:H11 


UC2-NormColon 


934 


406436 


1588.N01.gz43_217262 


F 


M00055553A:D08 


UC2-NormColon 


935 


553651 


1588.E02.gz43_2 17269 


F 


M00055542C:H05 


UC2-NormColon 


936 


466092 


1588.H02.gz43_217272 


F 


M00055546B:H03 


UC2-NormColon 


937 


645891 


1588.N02.gz43_2 17278 


F 


M00055553A:F11 


UC2-NormColon 


938 


649390 


1588.A03.gz43_2 17281 


F 


M00055536B:H1 1 


UC2-NormColon 


939 


448358 


1588.H04.gz43_2 17304 


F 


M00055546C:B01 


UC2-NormColon 


940 


650982 


1588.104.gz43_2 17305 


F 


M00055547C:B07 


UC2-NormColon 


941 


644012 


1588.O05.gz43_217327 


F 


M00055554C:B04 


UC2-NormColon 


942 


649099 


1588.P06.gz43_217344 


F 


M00055556A:E07 


UC2-NormColon 


943 


551662 


1588.M07.gz43_217357 


F 


M00055551C:C08 


UC2-NormColon 


944 


639316 


1588.K08.gz43_217371 


F 


M00055549C:E10 


UC2-NormColon 


945 


639480 


1588.J09.gz43_217386 


F 


M00055548B:G06 


UC2-NormColon 


946 


1318 


1588All.gz43_217409 


F 


M00055537C:A01 


UC2-NormColon 


947 


457146 


1588.Lll.gz43_217420 


F 


M00055550D:D02 


UC2-NormColon 


948 


451671 


1588.A15.gz43_217473 


F 


M00055537C:E04 


UC2-NormColon 


949 


641401 


1588.115.gz43_217481 


F 


M00055547D:G06 


UC2-NormColon 


950 


453606 


1588.D17.gz43_217508 


F 


M00055542B:B11 


UC2-NormColon 


951 


642414 


1588.G17.gz43_217511 


F 


M00055545D:C03 


UC2-NormColon 


952 


649564 


1588.H18.gz43_217528 


F 


M00055547A.H11 


UC2-NormColon 


953 


529799 


1588.118.gz43_2 17529 


F 


M00055548A:D07 


UC2-NormColon 


954 


645781 


1588.N18.gz43_217534 


F 


M00055553D:E06 


UC2-NormColon 


955 


447379 


1588.M20.gz43_2 17565 


F 


M00055552C:G03 


UC2-NormColon 


956 


639461 


1588.A21.gz43 217569 


F 


M00055538B:G09 


UC2-NormColon 


957 


451361 


1588.D21.gz43_217572 


F 


M00055542C:B02 


UC2-NormColon 


958 


449891 


1588.122.gz43_217593 


F 


M00055548A:F04 


UC2-NormColon 


959 


639395 


1588.022.gz43_217599 


F 


M00055555D:B05 


UC2-NormColon 


960 


644173 


1588.F23.gz43_217606 


F 


M00055545A:C01 


UC2-NormColon 


961 


648494 


1588.H24.gz43 217624 


F 


M00055547B:G09 


UC2-NormColon 


962 


477064 


1588.P24.gz43_2 17632 


F 


M00055557B:B10 


UC2-NormColon 


963 


535436 


1729.L02.gz43_2 17660 


F 


M00056092B:D10 


UC2-NormColon 


964 


650900 


1729.B04.gz43_217682 


F 


M00056078B:G02 


UC2-NormColon 


965 


649149 


1729.D04.gz43_217684 


F 


M00056081A:C05 


UC2-NormColon 


966 


644054 


1729.K04.gz43_2 17691 


F 


M00056091A:E04 


UC2-NormCoIon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


967 


649082 


1729.P04.gz43 217696 


F 


M00056097BC04 


\j v^-in oxiiiv^oion 


968 


454176 


1 729 J05 .gz43_2 1 7706 


F 


M00056089DA09 


-in orm v^oion 


969 


448485 


1729.A06.gz43 217713 


F 


M00056077B:E03 


wv^z,-inuiiiiv>ljioia 


970 


643440 


1729.N06.gz43 217726 


F 


M00056095B:A07 


U V/Z, —IN UI liiV/UiUIl 


971 


649356 


1729.J08.gz43 217754 


F 


M0005 6089D : E07 


T IP ? -Norm Crd nn 


972 


710155 


1729.L08.gz43 217756 


F 


M00056092C-E12 


T JC2-NormCn1rm 

U V-/ —IN tJI I 1 1 V^UlUIl 


973 


560054 


1729.Ell.gz43 217797 


F 


M00056082A:E12 


UC 2 -Norm C ol nn 


974 


644712 


1729.Kll.gz43 217803 


F 


M00056091C:C06 


UC2-NorrriCo1on 


975 


646696 


1729.A12.gz43 217809 


F 


M00056077C:H02 


I JC2-NormCo1on 

w v>^.— 1 tUJI III V^i_llUll 


976 


644715 


1729.H12.gz43_217816 


F 


M00056087B:G12 


t JC 2 -Norm Cnlrm 


977 


644965 


1 729.N 12.gz43_2 1 7822 


F 


M0005 6095 C * E02 




978 


645050 


1 729.K 1 3 .gz43_2 17835 


F 


M00056091C:D09 


I JC 2 -Norm Col on 


979 


649514 


1729.N14.gz43 217854 


F 


M00056095CG07 


T TC?-T\TormCn1nn 


980 


570939 


1729.E15.gz43 217861 


F 


M00056082RG07 




981 


647437 


1729.F16.gz43 217878 


F 


M00056083DD1 1 




982 


650547 


1729.L16.gz43 217884 


F 


M00056093 AB 1 2 




983 


649054 


1729.P16.gz43_2 17888 


F 


M0005 6098B • C04 


T TC2-r\IormCrk1nn 


984 


645409 


1729.K17.gz43 217899 


F 


M00056091D:C12 


T JC2-Norm Colon 


985 


613722 


1729.B18.gz43 217906 


F 


M00056079AB0 1 


T TC 2 -Norm Col on 

\J liUl 111V>VJI(JII 


986 


418763 


1729.L18.gz43_217916 


F 


M00056093AF08 

-L ▼ jl vy vy vy \J \J *s x X . X \J\3 


T JC2 -Norm Colon 


987 


619635 


1729.K20.gz43_217947 


F 


M00056091DE10 


I JC2-NormCo1on 


988 


645131 


1729.L20.gz43_2 17948 


F 


M00056093BD03 


T IC 2 -Norm Col on 


989 


451899 


1729.M22.gz43 217981 


F 


M0005 6095 A * C 02 

a. »iv vy \y *y vy vy >^ b/ii t v_y \j ^ 


1 JC2-Norm Colon 


990 


597647 


1 729.P22.gz43_2 1 7984 


F 


M00056098DA08 

a ▼ x vy vy vy ^y vy vy ^ km x-^ . u 


T JC2-NormCo1on 


991 


560860 


1717.F01.gz43_218406 


F 


M00055998BD06 

x tav/ vy vy ^y ^ ^ jl# * X-^ vy \j 


I JC2-NormCo1on 


992 


640709 


1717.P01.gz43_218416 


F 


M00056009DC12 

-*» »a v vy vy vy vy vy *r m^f + x 


UC2-NormColon 


993 


447802 


1717.O02.gz43 218431 


F 


M00056008B:G05 


UC2-NormCo1on 

\^v>-z> i vyi iil\_^wiUll 


994 


641736 


1717.103.gz43_218441 


F 


M00056001D:B06 


UC2-Norm Colon 


995 


464498 


1717.J03.gz43_218442 


F 


M00056003B:G11 


TJC2-NormColon 


996 


452775 


17 17.K03.gz43_2 18443 


F 


M00056004D:F03 


UC2-NormCoIon 


997 


561836 


1717.L03.gz43_218444 


F 


M00056005C:F11 


UC2-NormCo1on 


998 


444454 


1717.K04.gz43 218459 


F 


M00056004D;F12 


UC2-NormColon 

V-y' 1 1 vl 111 v/Ull/11 


999 


639178 


1717.O04.gz43 218463 


F 


M00056008CD04 


I JC2-NormColon 


1000 


513888 


17 17.F05.gz43_2 18470 


F 


M00055998CD04 


T JC2-NormCo1on 

\J\^r£* IlUl 111V_^C1U11 


1001 


643971 


17 17.105.gz43_2 18473 


F 


M00056001DG12 

a ▼ x vy vy vy vy vy vy x x_x . \j x ^ 


UC2-NormColon 


1002 


644824 


1717.J05.gz43_218474 


F 


M00056003C:C09 


UC2-NormColon 


1003 


448418 


1717.N05.gz43 218478 


F 


M00056007C:A03 


UC2-NormColon 


1004 


644030 


1717.O05.gz43 218479 


F 


M00056008C:D08 


UC2-NormColon 


1005 


644972 


17 17.P05.gz43_2 18480 


F 


M00056010A:E05 


UC2-NormColon 


1006 


649556 


1 71 7.B06.gz43_2 18482 


F 


M00055993D:F12 


UC2-NormColon 


1007 


644611 


17 17.106.gz43_2 18489 


F 


M00056002A:A03 


UC2-NormColon 


1008 


645162 


1717.J07.gz43_218506 


F 


M00056003C:H10 


UC2-NormColon 



96 




Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1009 


470462 


1717.L07.gz43 218508 


F 




u c z -in ormColon 


1010 


645662 


1717.A08.gz43 218513 


F 


M000S ^909 A -HOB 


ulz -in ormcolon 


1011 


649259 


1717.H08.gz43 218520 


F 


M00056000P DOQ 


u v>z-in ormL^oion 


1012 


650018 


1717.D09.gz43 218532 


F 


M00055996CA02 


u v^z-in onrii_/Oion 


1013 


640645 


1717.D10.gz43 218548 


F 


M00055996CA08 


vJv^Z-xNUIIIlv^UlOIl 


1014 


643477 


1717.M10.gz43 218557 


F 


M00056006D:E03 


T TP9-r\Torn-iPolrm 
uv»/Z, i> ui uiv^uiun 


1015 


541214 


1717.Ell.gz43 218565 


F 


M00055997DA11 


T TP 9 -NJnrm PrJ r»n 


1016 


651083 


1717.Mll.gz43 218573 


F 


M00056006D:F05 


VJ V^^. 1> VJl lllv^UlVJll 


1017 


549889 


1717.G12.gz43 218583 


F 


M00055999DC05 


T TP?-l\JnrmPrilnn 


1018 


558856 


1717.D13.gz43 218596 


F 


M00055996DD07 


T lC7-Nr\rmCrAr%n 
\J -IN Ui IIll^UIvJIl 


1019. 


644022 


1717.N13.gz43 218606 


F 


M00056007DF07 


u \_/Z -in ui iiiv^uiun 


1020 


644612 


1717.013.gz43 218607 


F 


M00056008DD09 


T ir?-Mnrmrnlnn 

uv^'iN ui mv^oxun 


1021 


644949 


1717.P13.gz43 218608 


F 


M00056011A CI 1 


T Tr , ?-T\Tnrmr , n1r»n 

u v^-Zr-iN uirnv^oiuii 


1022 


643808 


1717.C14.gz43_218611 


F 


M00055995AG09 


v^z-in uniiv^uion 


1023 


642260 


1717.014.gz43 218623 


F 


M00056008DD03 


U V_/Z.~1N UI IIlv^UlUIl 


1024 


651119 


1717.B15.gz43 218626 


F 


M00055994BB 12 


u v^z-in uniiv^uiun 


1025 


477521 


1717.C15.gz43_218627 


F 


M00055995BA02 


\j v_/Z -in ui in uiun 


1026 


587106 


1717D15.gz43_2 18628 


F 


M0005 5 996D G02 




1027 


419443 


1717.I15.gz43 218633 


F 


M00056002DF09 




1028 


643440 


1717.L15.gz43 218636 


F 


M0005 6006 A E04 

i. ▼ a vy vy v/ «y vy \y \y v/.iV . i /\ 


UvZ n UIIIl\_/UiUIl 


1029 


555736 


171 7.M16.gz43_2 18653 


F 


M00056007AB02 


KJx^Z, -IN UI in v^oiun 


1030 


644210 


1717.G17.gz43_218663 


F 


M0005 5 999D *H04 


vJ 1 > UI 1 1 1\_* UiUll 


1031 


449261 


1717.I17.gz43 218665 


F 


M00056003AA08 


T TP9-r\Tnrmr , n1nn 


1032 


643498 


1717.J17.gz43 218666 


F 


M00056004B:F04 


1 1(^2^01-111(^01011 
vy V>^. UllllVvUIUll 


1033 


517274 


1717.P17.gz43 218672 


F 


M00056011B:A06 


T ir , 9-l\TnrmPr»1nn 

W ^ i UI 1 1 IV/UIUH 


1034 


532307 


1717.L18.gz43 218684 


F 


M00056006AG09 


T jr2-Normro1on 


1035 


643288 


17 17.A19.gz43_2 18689 


F 


M00055993AE02 


T JC2-Normro1on 


1036 


478025 


1717.F19.gz43 218694 


F 


M0005 5 999 A: A04 


T jr , 2-Normro1on 

vy V_/^. i ^ vyl lUVw^vylUll 


1037 


447822 


1717.G19.gz43 218695 


F 


M00056000AB02 


T IP 9 -Norm Pol nn 


1038 


641580 


1717.I19.gz43 218697 


F 


M00056003A:B07 


T IP2-NormPo1on 


1039 


489426 


1717.P19.gz43_218704 


F 


M00056011BE10 


T JP2-NormPo1on 

kj y_ —j. ^ ui iiiv^uiuil 


1040 


650534 


1717.C20.gz43 218707 


F 


M00055995CF05 


T IP 9 -Norm Pnl on 

vJ V/A. liUl Iiiv>UiUll 


1041 


644990 


1717.E20.gz43 218709 


F 


M00055998A.F06 


UC2-NormPo1on 


1042 


207530 


1717.F20.gz43 218710 


F 


M00055999A:B07 


UC2-NormPolon 


1043 


645197 


1717.120.gz43_218713 


F 


M00056003A:C01 


UC2-NormColon 


1044 


414739 


1717.K20.gz43_218715 


F 


M00056005B:H08 


UC2-NormColon 


1045 


592941 


1717.L20.gz43_218716 


F 


M00056006B:B05 


UC2-NormColon 


1046 


447556 


1717.H21.gz43_218728 


F 


M00056001C:F07 


UC2-NormColon 


1047 


238586 


1717.N21.gz43_2 18734 


F 


M00056008B:E04 


UC2-NormColon 


1048 


644612 


1717.E22.gz43_218741 


F 


M00055998B:A04 


UC2-NormColon 


1049 


646387 


1717.F22.gz43_2 18742 


F 


M00055999A:H10 


UC2-NormColon 


1050 


647443 


1717.G22.gz43_2 18743 


F 


M00056000A:E1 1 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


1 CLUSTER 


SEQ NAME 


ORIEN 
T 


[ 

CLONE ID 


LIBRARY" 


1051 


555172 


1717.P22.gz43 218752 


F 


M0005601 ip-nnd 


UCz-iNormColon 


1052 


552846 


1717.D23.gz43 218756 


F 


M00055QQ7A -Fnd. 

1T1WUJJ77 //\.rU*T 


UCz-JNormColon 


1053 


639377 


1717.B24.gz43 218770 


F 


M00055994PFOS 


t tpo \T/\««r A u« 
ui^z-iNontii^olon 


1054 


449713 


1717.M24.gz43 218781 


F 


M00056007RP 1 0 

muuv/juuu / J_> , v^< 1 \j 


u -in orm v^oiori 


1055 


456627 


1717.N24.gz43 218782 


F 




u^z-iNornicolon 


1056 


646143 


1717.024.gz43 218783 


F 


M00O56OOQD' AO? 


ui^z-i\joniiVyOion 


1057 


639629 


1718.H03.gz43 218827 


F 


M0005602 1 C*H08 


ulz-in 01111 v^oion 


1058 


643744 


1718.D04.gz43 218839 


F 


M00056016CF1 1 


UL/Z"i> ui iiii^uion 


1059 


397399 


1718.P04.gz43 218851 


F 


M00056032B-H08 


T Tr , 7-"NfnrmPrt1nri 


1060 


650391 


1718.G06.gz43 218874 


F 


M0005 6020C HO^ 


UV/Z"i>l onrii^oion 


1061. 


447659 


1718.N06.gz43 218881 


F 


M00056029D *D08 

WJL vy vy vy — f ^ u . XWO 


UV^Z-lNOlIIll^OlOn 


1062 


644967 


1718.D07.gz43 218887 


F 


M0005 60 1 6D E02 


T TP?-T\Tormr'n1nTi 

uv/Z-i>i uiiiiL/ oion 


1063 


545509 


1718.G08.gz43 218906 


F 


M00056020D:D03 


T IP7-Normr"n1rin 


1064 


643975 


1718.D09.gz43 218919 


F 


M00056017AB08 

x. ti \j vy v/»y v/ vy jl / / i ^ \j (j 


vj \^z.*i>uiiiiv>oivjri 


1065 


645883 


1718.Mll.gz43 218960 


F 


M00056028DF06 


u Vy z -in uiinv_/Oion 


1066 


538808 


1718.B12.gz43 218965 


F 


M000560 1 3AC09 

I'l" Vy Y/«y Vy vy A «y./ri. V/ 


T TP 7 -Nftrm Pr»l rin 


1067 


425455 


1718.P13.gz43 218995 


F 


M00056033AC1 1 


T IP7-NTnrmPr»1nn 

vj VxZ~in ur mv-'Oion 


1068 


467057 


1718.M20.gz43 219104 


F 


M00056029BH03 


u VyZ ~in ui inv^oion 


1069 


640181 


1718.B21.gz43 219109 


F 


M0005 60 1 3D • D07 

J.TAV/V/V7«^ VJV7 X «y XV . i/U / 


UV-'Z-lNOlIIlVyOlOIl 


1070 


650348 


1718.F23.gz43 219145 


F 


M00056070BPOQ 

iTlv Xy v/ Vy Vy vy XJ . Vw Vy J' 


yj V/ z -in orni v^Oion 


1071 


451993 


1718.P24.gz43 219171 


F 


M00056033PHOQ 

IflUUV/J VJV/ J J . X IV/ Zs 


TTP7 XTr*rrMP/\1rt« 


1072 


647976 


1719.E01.gz43 219176 


F 


M0005 603 9 A • F04 


T TP7-T\TnrmPntnri 

u Vy z -in ui iiiv_/OXOH 


1073 


557676 


1719.I01.gz43 219180 


F 


M00056045AD03 


T TP7-TvTnrmPn1nn 

Uv/Z -in uiiiiVsOion 


1074 


644639 


1719.N01.gz43 219185 


F 


M00056050C ■ A09 


T TP7-\Jr*rmPn1r\n 

u V/Z-in uiiiiVxOion 


1075 


644738 


1719.J02.gz43 219197 


F 


M0005 6046 A * B04 


T TP7-r\TnrmPr»lr»ri 
U VyZ -in uniiVyOion 


1076 


645973 


1719.A03.gz43 219204 


F 


M00056033DF06 


T TP7->Jr*TmPrt1rm 
KJ V/Z -IN UI lIIx_/UlOIl 


1077 


550376 


1719.B03.gz43 219205 


F 


M00056035BA04 

a ▼ 1 vy vy vy „y vy vy »y «y j_# . ^tlv ™ 


T TP7-lvTr*rmPn1r*n 

UL/Z -in unnVy ui on 


1078 


646088 


1719.D03.gz43_2 19207 


F 


M0005 603 7B • G02 


T TP 7 _7SJ r»rm Pnl rm 

uvz _ in uiiiiVyOion 


1079 


506920 


1719.P03.gz43 219219 


F 


M00056053AFO1 


T TP 7 -Nnrm Pi-»l/-in 
U L/Z -IN Olllll^OIOn 


1080 


642198 


1719.A04.gz43 219220 


F 


M00056033DG07 


T TP7-TvTnrmPn1r»n 

u wz-in uniiVyOion 


1081 


649293 


1719.E04.gz43 219224 


F 


M00056039BC03 


T TP 7 _AJr*rm P/-»1 r»n 
U V./Z-1NUI IIlVsOlOIl 


1082 


641193 


1719.H04.gz43 219227 


F 


M00056043DE03 

x ▼ jl vy vy vy *y vy vy t ju^ • * > » * J 


T IP7-l\TnnnPn1rin 

V>Z- 1> vJl 11 1 U1UI1 


1083 


461517 


1719.J04.gz43 219229 


F 


M00056046A:B12 


UP2-NormPn1nn 


1084 


645070 


1719.K04.gz43 219230 


F 


M00056046D:C11 


UC2-NortTiPn1on 


1085 


645942 


1719.M04.gz43 219232 


F 


M00056049B:E11 


UC2-NormPolon 


1086 


645151 


1719.O05.gz43 219250 


F 


M00056051D:H02 


UC2-NormColon 


1087 


566745 


1719.B06.gz43 219253 


F 


M00056035B:D11 


UC2-NormCoIon 


1088 


642184 


1719.M06.gz43_2 19264 


F 


M00056049C:A08 


UC2-NormColon 


1089 


649429 


1719.O06.gz43 219266 


F 


M00056052A:A11 


UC2-NormCoIon 


1090 


549786 


1719.B07.gz43 219269 


F 


M00056035B:E10 


UC2-NormColon 


1091 


452102 


1719J07.gz43_219277 


F 


M00056046A:G08 


UC2-NormColon 


1092 


644819 


1719O07.gz43_2 19282 


F 


M00056052A:C07 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 




T TDD A DV 

LIBRARY 


1093 


453804 


1710 FOR otA'X 9 1 Q98fi 


x? 
r 


M00056039C:D05 


UC2-NormColon 


1094 


650348 


1 7 1 Q TC0Q cr741 9 1 0^ 1 ft 

1/1 y .IVV/l/.gZ^TO Z I 7J 1U 


r 


M00056047A:E09 


UC2-NormColon 


1095 


644894 


1710 100 07-4^ 910^11 


T? 

r 


MU0056048A:D12 


UC2-NormColon 


1096 


646048 


1710 A 1 0 9 1 1 A 


T7 

r 


A Jf AAAC^AO j1 A /^i i r\ 

M00056034A:G10 


UC2-NormColon 


1097 


638912 


1 7i o n i n f»-7zi^ 910^10 


c 
r 


M00056037C:G1 1 


UC2-NormColon 


1098 


644817 


1710 F10 rr^d.^ 910^91 


17 
r 


MU0056041B:F03 


T Tsf*%^% ^ T a+**S * 

UC2-NormColon 


1099 


645973 


1710 F11 crr&l 910^^/; 
1 / 1 y .E* 1 1 .gZHO Z 1 7JJO 


r 


M0U056039C:G05 


UC2-NormColon 


1100 


643748 

UtJ / to 


1710 Mil c*'7Zl'? 91Q'*/M 

1 / 1 f.lYll 1 .gz^fo ziy.544 


r? 
r 


M0005 6049C :H04 


UC2-NormColon 


1101 




1710 >J11 rr^AI 91Q7/f^ 


T7 

r 


AlAAAC^Af 1 A .r'A') 

M0005605 1 A:C03 


UC2-NormColon 


1102 


647859 


1710 F 1 9 rr^Zl'* 9101^9 

i / 1 y.n iz.gz^t j> ziyjjz 


17 
r 


M00056039C:H01 


UC2-NormColon 


1103 


451544 


1 710 1 1 9 t*74^ 91Q^G 


17 
r 


MUUU!>oU4oi5: J b0 1 


UC2-NonnColon 


1104 


528616 


1 71 0 N1 9 a-74^ 91Q^A1 

1 / !7.1>lZ.gZHj Z17JD1 


T7 

r 


MUUU3oU3 1 A:ril 1 


UC2-NormColon 


1105 


460727 


1719 G13 a74? '910370 


17 

r 


MUUU3oU4JA:AU9 


UC2-NormColon 


1106 


468689 


1710P14»74^ 91Q389 

1 / l7.Vyi*t,gZHj Z17JOZ 


17 
r 


MUUU3oUJoU:r506 


UC2-NormColon 


1107 


452729 


1710 PIS otAI 91Q7Q8 


r 


MUUU3oU3oLi:DUl 


UC2-NormColon 


1108 


481614 


1710 D 1 5 cr-rdl 9 1 Q3QQ 


17 


lVlUUUjoUjorJ.Cjuo 


UC2-NormColon 


1109 


649900 


1719H15cr74^ 910403 
1/ i7.ni j.gztj zi"*+Uj 


17 




UCz-NormColon 


1110 


645375 


1 71 0 "Ml 5 ari^ 91QzlOQ 


r 


MUUUjou3 IxiioOi 


XT 1 

UC2-NormColon 


1111 


497101 

t7 / 1 \J 1 


1 71 0 PI 5 citA'X 910411 


T7 

r 


MUUUjoO!)3L):Du7 


UC2-NormColon 


1112 


55639S 

-J -7 VJ 


1710 fr 1 f\ avA'X 9 1 OA 1 ft 

1 / 1 y . \j i o .gzH-? z l V4 l o 


T7 

r 


MU0U56U43A:H12 


UC2-NormColon 


1113 


64543 1 


1710 M 1 6 o-t-4^ 9 1 Q494 

1 / x y .1V1 lU.gZHO Z17*tZt' 


17 


MUUU j oU4yD :L)03 


T T/""* ^ XT y 1 

UC2-NormColon 


1114 


452325 


1 710 A 1 7 <*741 91049ft 

1 / 17.A1 / .gZ*T D ZI7HZ0 


p 
r 


lVyfnnn^^ni /ir.r\A7 


UC2-NormColon 


1115 


447645 


1710 R17 07^ 91Qzl9Q 

1 / 17.D1 / .gZtJ Zl7*fZ7 


p 


MUUUjoU3 jLI:CUo 


¥ T/^^ XT 1 

UC2-NormColon 


1116 


644054 


1 710 P1 7 0^41 91043ft 


p 


"\ >f n n n ^ f\ i a t\ . "c r\ « 


UC2-NormColon 


1117 


64 1 7 1 5 


1710 A/117 9 1 QAAO 

1 / 1 ".1V11 / .gZ*TJ Z 1 7M-HU 


p 
r 


MUUU3oU49LI:Jb0 1 


T T/^*^ XT /~1 f 

UC2-NormColon 


1118 




1710 P 1 7 oryA'X 9 1 QAA1 


p 
r 


MUUU5o054A:B06 


T T/™*^ XT i 

UC2-NormColon 


11 19 

X X X .7 


465580 


1 71 0 PI ft crvAl 910/1^0 
1 / l7.r 1 o.gZ4 D Z1V4J7 


p 
r 


H1AAACZACH A . T~\n n 

MU0U56054A:D09 


UC2 -NormColon 


1 120 


6438Q7 


1710 fr 1 Q rr-yZf* 9 1 


p 
r 


MU0U56043B:E03 


UC2-NormColon 


1171 




171Q 1 1Q fY-yzl^ 9 10/191 
1 / lV. J Lfl5'.gZ'U_z 1^4 / 1 


f 

r 


M00056048D:B09 


UC2-NormColon 




4679 


1710 P 1 Q ct'rA'X 9 1 

1 / Ly. r Ly .gZ^D Z, Ly*} fj 


p 
r 


X jffAAAC ZT AC A A rni 

MO0O56054A.E03 


UC2-NormColon 


1123 


645530 


1710 P90 cr-7-4^ 9 1 Q47ft 


p 
r 


MUUU3oU37A:Dl 1 


T T/Tl XT y 1 

UC2-NormColon 


1124 


648390 


1710 A91 ovA'X 91QZIQ9 

X / 1 37./\Z 1 .gZ*TJ Z1747Z 


p 


X JAAAC Hf\n A /~~* . T TAf> 

MUUU3oU34C:rl09 


UC2-NormColon 


1125 


467597 


1719 R9 1 07-43 9 1 0403 

1/ 1 37.JOZ 1 .gZt J Zl7 t T7J 


F 
r 


1V1UUU j ovJo DU.LrlU 


T T/^O XT - t 

U C2-N ormColon 


1126 


468330 

■ W YJ —J V/ 


1719 D91 a741 910405 

1 / 17.L/Zl.gZ 4 tJ Z 1 ^H7 J 


p 


n ^ n *2 ft f% . i7n i 
jYlUUU3oU.3cSLJ.rU3 


T T/^O "XT /•> i 

UCz-iNormColon 


1127 


454438 


1719F21&743 910406 


p 
r 


lVlUUU30U4Ul^.r>Uj 


U -N ormColon 


1128 


402476 


1719.121.gz43_219500 


F 


M00056045D.C09 


UC2-NormColon 


1129 


558439 


1719.M21.gz43 219504 


F 


M00056050A:D02 


UC2-NormColon 


1130 


595066 


1719.N21.gz43 219505 


F 


M00056051C:C09 


UC2-NormColon 


1131 


646687 


1719.E22.gz43 219512 


F 


M00056040C:C12 


UC2-NormColon 


1132 


452026 


1719,H22.gz43_219515 


F 


M00056044D:F01 


UC2-NormColon 


1133 


639255 


17I9.K22.gz43_219518 


F 


M00056047C:E03 


UC2-NormColon 


1134 


612572 


1719.L22.gz43 219519 


F 


M00056049A:C11 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1135 


641786 


1719.022.gz43_219522 


F 


M00056052D:G01 


UC2-NormColon 


1136 


647306 


1719.P22.gz43_219523 


F 


M00056054A:G07 


UC2-NonnColon 


1137 


640559 


17 19.A23.gz43_2 19524 


F 


M00056034D:E09 


UC2-NormColon 


1138 


550365 


1719.D23.gz43 219527 


F 


M00056039A:A02 


UC2-NormColon 


1139 


462293 


1719.123.gz43_219532 


F 


M00056045D:E06 


UC2-NonnColon 


1140 


587106 


1719.N23.gz43_2 19537 


F 


M00056051C:H09 


UC2-NormColon 


1141 


483266 


1719.F24.gz43_219545 


F 


M00056042B:B05 


UC2-NormColon 


1142 


490898 


1719G24.gz43_219546 


F 


M00056043C:G03 


UC2-NormColon 


1143 


640603 


1719.N24.gz43_219553 


F 


M00056051D:A07 


UC2-NormCoion 


1144 


643510 


1720G01.gz43_2 19562 


F 


M00056062D:F12 


UC2-NormColon 


1145 


645049 


1720.A02.gz43_2 19572 


F 


M00056054B:G05 


UC2-NormColon 


1146 


466795 


1720.C02.gz43_219574 


F 


M00056057B:D01 


UC2-NormColon 


1147 


14157 


1720.102.gz43_2 19580 


F 


M00056066B:H04 


UC2-NormColon 


1148 


558494 


1720.A04.gz43_2 19604 


F 


M00056054C:C09 


UC2-NormColon 


1149 


644927 


1720,C04.gz43_219606 


F 


M00056057B:E12 


UC2-NormColon 


1150 


648688 


1720.C05.gz43_2 19622 


F 


M00056057C:B02 


UC2-NormColon 


1151 


454906 


1720.A06.gz43_2 19636 


F 


M00056054C:E12 


UC2-NormColon 


1152 


644597 


1720.D06.gz43_219639 


F 


M00056059A:A02 


UC2-NormColon 


1153 


460047 


1720K06.gz43_2 19646 


F 


M00056069A:F01 


UC2-NormColon 


1154 


62053 


1720.C08.gz43_2 19670 


F 


M00056057C:D06 


UC2-NormColon 


1155 


649054 


1720.D08.gz43_2 19671 


F 


M00056059A:B01 


UC2-NormColon 


1156 


644225 


1720.K08.gz43_219678 


F 


M00056069A:F06 


UC2-NormColon 


1157 


447635 


1720.C09.gz43_2 19686 


F 


M00056057C:E01 


UC2-NormColon 


1158 


642166 


1720.E10.gz43_219704 


F 


M00056060D:H11 


UC2-NormColon 


1159 


473007 


1720Dll.gz43_219719 


F 


M00056059B:D10 


UC2-NormColon 


1160 


551181 


1720.Mll.gz43_2 19728 


F 


M00056072B:C06 


UC2-NormColon 


1161 


644928 


1720.C12.gz43_2 19734 


F 


M00056057C:E12 


UC2-NormColon 


1162 


644916 


1720.E12.gz43_219736 


F 


M00056061A:E05 


UC2-NormColon 


1163 


644701 


1720.L12.gz43_219743 


F 


M00056070C:B09 


UC2-NormColon 


1164 


487567 


1720O12.gz43_219746 


F 


M00056074C:H08 


UC2-NormColon 


1165 


641801 


1720.E13.gz43_219752 


F 


M00056061B:A03 


UC2-NormColon 


1166 


651108 


1720.013.gz43_219762 


F 


M00056074C:H09 


UC2-NormColon 


1167 


447676 


1720A14.gz43_2 19764 


F 


M00056055A:E04 


UC2-NormColon 


1168 


645076 


1720.K14.gz43_2 19774 


F 


M00056069B:E04 


UC2-NormColon 


1169 


641469 


1720.C15.gz43_219782 


F 


M00056057D:E11 


UC2-NormColon 


1170 


375577 


1720.G15.gz43_219786 


F 


M00056063C:F10 


UC2-NormColon 


1171 


558154 


1720.C16.gz43_219798 


F 


M00056058A:B07 


UC2-NormColon 


1172 


650549 


1720.G16.gz43_219802 


F 


M00056063C:G11 


UC2-NormColon 


1173 


461351 


1 720.1 16gz43_2 19804 


F 


M00056066D:G04 


UC2-NormColon 


1174 


640818 


1720.A18.gz43_219828 


F 


M00056055B:B06 


UC2-NormColon 


1175 


455716 


1720.118.gz43_219836 


F 


M00056067A:A12 


UC2-NormColon 


1176 


644822 


1720.J18.gz43_219837 


F 


M00056068D:A06 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


mi 


467780 


1720.C19.gz43 219846 


F 


iTXVy v/Vy_J UuJ O/i. r UO 




1178 


446415 


1720.F20.gz43 219865 


F 




uL/Z -in orm v^oion 


1179 


553923 


1720.H20.gz43 219867 


F 




uLz -in orm oion 


1180 


645900 


1720.I21.gz43 219884 


F 




u -in onTH^oion 


1181 


649364 


1720.M21.gz43 219888 


F 


M0005607?i>FOR 


Ul^Z-lNOlTxlVyOlOn 


1182 


643348 


1720.N23.gz43 219921 


F 


M00056074R-n 19 


TTP9 Mnrmrnlnn 


1183 


648654 


1731.A01.gz43 219940 


F 


M00056 1 20C H04 


u v^z "IN ui 1 1 iv^oion 


1184 


648547 


1731.C01.gz43 219942 


F 


M00056123BG05 




1185 


463824 


1731.I01.gz43 219948 


F 


M00056 1 ^D-DOO 


u v,' z -in on n v> uiuii 


1186 


648721 


1731.N01.gz43 219953 


F 


M00056 1 40C F04 


T TP9-MrirrnfYi1riri 
U V^Z IN UI IIlv>U10iX 


1187- 


648481 


1731.O01.gz43 219954 


F 


M00056 1 47 A F01 




1188 


644037 


1731.A02.gz43 219956 


F 


M000S6 1 ?0DFn 1 


ui^z -in ui 1 1 1 v_/ uiun 


1189 


635354 


1731.H02.gz43 219963 


F 


M00056132RC1? 

IV 1\J\J\J ~J \J lJZ.U.V' J. A. 


U \-/Z~lN UI IIIV^UIUII 


1190 


711493 


1731 J02 ez43 219965 


F 


IViUUUJOl JtL/.VJU / 


uv^z-fN onni^oion 


1191 


648742 


1731L02sz43 219967 


F 


M000S6 1 ^7DH05 


U V^Z-INUI ini^oion 


1192 


476380 


1731.F03.gz43 219977 


F 


M000561 29PF09 


\J V^Z—1N UI IIl^UlUIl 


1193 


649617 


1731 J03 ez43 219981 


F 


MOOO^l ^4D PM 1 

lVXV/W^ VJ X JtL/.\Jl 1 


uv^z -in ui n i v^o ion 


1194 


639555 


1731 B04 sz43 219989 


F 


M000S6197RA07 

IVIWUVyJ \J X Z.Z.XJ ,Au / 


T TP 9 -Mnrm P nl r»n 

vj v_/Z -in urrnv^oiuii 


1195 


465127 


1731 D04 ez43 219991 


F 


M000S61 9^R*F0 1 


T TP 9 -Mnrm Pr»1 r»n 

\j i^z-iN orniv-/Uion 


1196 


458938 


1731 L04 sz43 219999 


F 

X 




TTP9 XTon-nPnlnn 

U v>Z -In oiiiiv^oion 


1197 


453726 


173LG05.gz43 220010 


F 


M00056 1 11 A FOQ 


T TP9-t»JrirfnPrilrin 
Wv>Z-lNUIIIlV^UIUIl 


1198 


557026 


1731.H06.gz43 220027 


F 




T TP9-"NTr*rmPn1r»n 
U V^Z-1N UIIIIV^UIUIl 


1199 


422242 


1731.D07.gz43 220039 


F 


M000561 27A* A 1 0 


T TP 9 - IMorm P r%\ r\n 
U V-/Z -IN UI IIlv^UlUIl 


1200 


456985 


1731.E07.gz43 220040 


F 


M000S6128RA07 


T TP9-\Tr»rmPn1rm 
U V^Z"1N UI IIIV_/U1UI1 


1201 


640695 


1731 107 ez43 220044 


F 


M00056H4AD04 


T TP9->JrirmPri1rin 
%j vz-in uriin^uiun 


1202 


627515 


1731.B08.gz43 220053 


F 


M000561 22RG0Q 


T TP9-fsJnrmPr*lr»n 
U V^Z"IN UIIIIl^/UlUIl 


1203 


648341 


1731.M08.gz43 220064 


F 


M00056 1 19D-F0S 


T TP9-'NTnrmPo1r»n 
U \^Z -IN UHIlV^UlUll 


1204 


464990 


1731.A09.gz43 220068 


F 


M000^61?l A FO^S 

IVlv/UV/JU X ^> 1 z*. . U/KJ ^} 


T TP9-TSJormPn1nn 


1205 


459521 


1731 J10 ez43 220093 


F 


M000S61 liRFOl 

1V1V/UWJU X - '-Hi If! i » 


T TP9_"MnrmPn1nn 

w v_^z -in urinv'Uiuii 


1206 


549691 


1731.L10.gz43 220095 


F 


M0005 6 1 3 8B • F05 


T TP9-MnrmPn1rin 

VJV^Z. l^iUllIi^/UIUIl 


1207 


514838 


1731.M10.gz43 220096 


F 


M0005 6 1 3 9D H04 


I TP2-NfirmPn1nn 


1208 


452738 


1731.O10.gz43 220098 


F 


M00056 142CA09 


T TP9-MnrmPnlnn 


1209 


644928 


1731.Dll.gz43 220103 


F 


M00056127AH03 


T TP9-NnrmPn1nn 


1210 


460190 


1731.B12.gz43 220117 


F 


M00056122CH12 


T TP ? -Nnrm Pnlnn 


1211 


562378 


1731.G12.gz43 220122 


F 


M00056131BE01 


I TP2-NormPo1on 


1212 


647949 


1731.P12.gz43_220131 


F 


M00056144D:C05 


UC2-NormColon 


1213 


557419 


1731.B13.gz43_220133 


F 


M00056122D:B07 


UC2-NormColon 


1214 


638934 


173LD13.gz43_220135 


F 


M00056127B:E11 


UC2-NormColon 


1215 


646785 


1731.F13.gz43 220137 


F 


M00056130B:E11 


UC2-NormColon 


1216 


647248 


1731.E14.gz43_220152 


F 


M00056128C:F02 


UC2-NormColon 


1217 


470006 


1731.114.gz43_220156 


F 


M00056134C:C06 


UC2-NormColon 


1218 


648311 


1731.K14.gz43J220158 


F 


M00056137B:F07 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


> CLUSTER 


SEQ NAME 


ORIEN 
T 


[ 

CLONE ID 


T TRRAPV 


1219 


649299 


1731.F15az43 220169 


F 

X 


Kf ooo s f. i ^ op -r* i 7 


UCz-NormColon 


1220 


478393 


1731 K15 ez43 220174 


F 


IVlwUwOOlO /Jj.F 1 1 


T T/^O XT i 

UC2-NormColon 


1221 


647290 


1731 F16 sz43 220185 


F 

X 


mooo^i ^op-i^nc 

iVJ.WWW.JO 1 jUV^.rUo 


UC2-NormCoIon 


1222 


649735 


1731 G16 ez43 220186 


x 


ivxwwwjoi.5 ii^.riUj 


T T/^^ "XT i 

UCz-JNormCoIon 


1223 


646039 


1731.J16.gz43 220189 


p 


mooos^i ^p-ph^ 

ivxwww J \j l j j . v^U J 


UCz-lNormColon 


1224 


646581 


1731.M16ez43 220192 


F 


MOOOS^I AHA -PI 7 


UL/Z-lNormColon 


1225 


470667 


1731N16sz43 220193 


F 

X 


mooos^ i a i r*-HYi i 


U -IN ormColon 


1226 


642650 


1731J17.gz43 220205 


F 


Mooosfii ^sp-pox 


u z -in ormc olon 


1227 


642275 


173LK17.gz43 220206 


F 


M000S61 ^7P*A01 


u i^z -in orm^oxon 


1228 


651015 


1731.L17.gz43 220207 


F 


J.VXV/WW.JW 1 JOL/ . vjvjo 


u cz-in orm^oion 


1229 


642936 


1731.H18.gz43 220219 


F 


IVIUUUJ \J IJJ XJ . X V/ 1 


u\^z-iNormL,oion 


1230 


642204 


1731.A19.gz43 220228 


F 




TT/^9 \UrmrAUn 


1231 


557867 


1731J19.gz43 220237 


F 




u i^z -in ormuoion 


1232 


639395 


1731.K19.gz43 220238 


F 


1V1UV/UJ OlJ / . 13 V/ 1 


ui^z-JNoriiiL>oion 


1233 


645367 


1731.K20.gz43 220254 


F 


mooo 5 6 1 ^ 7r • no ? 


u ^ z -in onni^oion 


1234 


647232 


1731L20gz43 220255 


F 


IVlUUvJD 1 J7A.L/1Z 


u l>z -in ormL-oion 


1235 


647431 


1731 E21 ez43 220264 


F 

X 


1V1UUUJO 1 Z,!7/\. vJU 1 


u l^z -in ormi^olon 


1236 


708175 


1731.I21.gz43 220268 


F 

X 


Moons f% m.n-RH7 


u L.Z-JN ormcoion 


1237 


503122 


1731.A22.gz43 220276 


F 


M000S61 11 A- AOS 


u i^z-jn ormL/Oion 


1238 


515350 


1731.C22.gz43 220278 


F 


M00OS6 1 74DF06 


T T/^7 \TrtrtnPrt1/>« 


1239 


556336 


1731 F22 ez43 220281 


F 


MOOOS^l ^On-FOA 


u ^z -jn ormcoion 


1240 


589098 


1731A23ez43 220292 


F 

X 


MOOOS^ 1 11 A ROS 


u L^z -IN orm v^olon 


1241 


531461 


1731B23ez43 220293 


F 


MOOOS^ 1 7^R*F07 


u^z-JNonTH^oion 


1242 


455808 


1731.C23.gz43 220294 


F 


MOOOSfi 1 ?4.n-O0'* 

1V1 WWW-JO lXHJLr.VJUj 


u -in omn^oion 


1243 


643510 


1731.D23.gz43 220295 


F 


MOOOSfi 1 97TYni O 

1V1WWWJO 1Z, / J-/.VJ1U 


u L/Z-jn orniL,oion 


1244 


648996 


1731 J23 ez43 220301 


F 

X 


MOOOS^I A R1 1 

IVIWUWJO I-50/\..x3 1 1 


ucz-lNormColon 


1245 


446164 


1731N23ez43 220305 


F 
i 


iviwwwjo i*+ ii^.riUo 


u L,z -in ormcolon 


1246 


419114 


1731.023.gz43 220306 


F 


M000S6 1 4.^ A HOR 
iviwwwjo i tjA.nuo 


u i^z-in ornn^oion 


1247 


648076 


1731.A24.gz43 220308 


F 


M000S6 1 11 A no? 

1V1WWWJO lZ-4fc/\.l-IWZ 


ui^z-iNonTicoion 


1248 


649482 


1731.I24.gz43 220316 


F 


MOOOSfi 1 ^4DFO? 


uv_.z-in ormi^oion 


1249 


646323 


1732.H01 gz43 22033 1 


F 


M000S61 SSR- AO? 


u i^z-in orniv^oioii 


1250 


550714 


1732.A05.gz43 220388 


F 


M00OS6146AAOO 


u i-/Z -in orni v^oion 


1251 


649482 


1732J05.gz43_220397 


F 


M00056158CD1 1 


T TP? _Mr»rm C^oAnn 


1252 


650987 


1732.J06.gz43 220413 


F 


M00056158CF09 




1253 


644224 


1732.F07.gz43_220425 


F 


M00056 1 53 A*G04 




1254 


647412 


1732.M07.gz43 220432 


F 


M00056162C:F02 


UC2-NormColon 


1255 


416624 


1732.108.gz43_220444 


F 


M00056157A:B11 


UC2-NormColon 


1256 


644801 


1732F09.gz43_220457 


F 


M00056153A:H07 


UC2-NormColon 


1257 


650249 


1732.G10.gz43_220474 


F 


M00056154B:F11 


UC2-NormCoIon 


1258 


465734 


1732.Cll.gz43 220486 


F 


M00056149C:A02 


UC2-NormColon 


1259 


605761 


1732.Ell.gz43 220488 


F 


M00056152A:B11 


UC2-NormColon 


1260 


649782 


1732Mll.gz43_220496 


F 


M00056162D:A01 


UC2-NormColon 
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Table 2 



SEQ 


cluster! 




ORIENI 






|IDNO 




SEQ NAME 


T 1 


CLONE ID 


LIBRARY 



449956 | 1732.A12.gz43 220500 



M00056146A.H09 UC2-NormColon 



1262 
1263 



642244 I 1732.I12.gz43 220508 



M00056157B:A08 | UC2-NormColon 



650348 | 1732.L12.gz43 220511 



M00056161C:G06 UC2-NonnColon 



1264 
1265 



461135 I 1732.E13.gz43 220520 



M00056152A.D07 | UC2-NonnColon 



16092 | 1732.L13.gz43 220527 



M00056161C:H10 UC2-NormColon 



1266 650116 | 1732.M13.gz43 220528 



1267 650534 | 1732.013.gz43 220530 



M00056162D.D03 I UC2-NormColon 



M00056165C:B11 UC2-NormColon 



1268 
1269 



640634 | 1732.A14.gz43 220532 



M00056146B:E05 I UC2-NormColon 



446184 | 1732.114:gz43 220540 



M00056157B:H06 UC2-NonnColon 



1270 
1271- 



380477 1732.M14.gz43_220544 



M00056162D:D06 UC2-NormColon 



463966 I 1732.K15.gz43 220558 



M00056160C:F12 UC2-NormColon 



1272 
1273 



649578 I 1732.015.gz43 220562 



M00056165D:D09 UC2-NormColon 



650877 | 1732.C16.gz43 220566 



M00056149C:E10 UC2-NonnColon 



1274 449882 | 1732.F16.gz43 220569 



M00056153B:F11 UC2-NormColon 



1275 650909 | 1732.J16.gz43 220573 



1276 | 645100 | 1732.K16.gz43 2205 74 



M00056159A:E03 UC2-NormColon 



M00056160C:G01 UC2-NormColon 



1277 649883 | 1732.M17.gz43 220592 



1278 554581 | 1732.I19.gz43 220620 



_F_ 
F 



M00056163A:B10 UC2-NormColon 



M00056157C:E10 UC2-NormColon 



1279 467972 | 1732.L19.gz43 220623 



1280 | 488592 | 1732.A20.gz43 220628 



M00056162A.B06 UC2-NonnColon 



M00056146D:D04 UC2-NonnColon 



1281 651075 | 1732.E20.gz43 220632 



M00056152C:G08 UC2-NormColon 



1282 | 649991 | 1732.K20.gz43 220638 



M00056160D:C06 UC2-NonnColon 



1283 649947 | 1732.L20.gz43 220639 



M00056162A:C09 UC2-NoimColon 



1284 224092 | 1732.N20.gz43 220641 



M00056164A:H03 UC2-NormColon 



1285 555634 | 1732.L21.gz43 220655 



1286 639485 | 1732.C22.gz43 220662 



M00056162A:E09 UC2-NormColon 
M00056149D:F06 | UC2-NormColon 



1287 | 646596 | 1 732.123 .gz43 220684 



M00056157D:H08 UC2-NormColon 



1288 459260 | 1732.J23.gz43 220685 



M00056159C:C11 UC2-NormColon 



1289 634409 | 1732.L23.gz43 220687 



M00056162A:F01 UC2-NormColon 



1290 650820 1732.C24.gz43 220694 



M00056150A:E04 UC2-NormColon 



1291 559349 | 1741.M01.gz43 220720 



M00056184C:C07 UC2-NormColon 



1292 650063 | 1741.D02.gz43 220727 



1293 492982 | 1741.J02.gz43 220733 



M00056171C:A08 UC2-NonnColon 



M00056180B:D05 UC2-NormColon 



1294 557353 | 1741.K02.gz43 220734 



M00056182A.B04 UC2-NormColon 



1295 | 649299 | 1741.G03.gz43 220746 



M00056175D.E05 UC2-NormColon 



1296 648063 | 1741.F04.gz43 220761 



M00056174C.H09 UC2-NormColon 



1297 641193 | 1741.J04.gz43 220765 



M00056180B.F11 UC2-NormColon 



1298 | 561856 | 1741.L04.gz43 220767 



M00056183B:G11 UC2-NonnColon 



1299 642631 | 1741.M04.gz43 220768 



M00056184C.E08 UC2-NormColon 



1300 | 648344 | 1741.P04.gz43 22077 1 



M00056188B:E07 UC2-NormColon 



1301 | 458085 | 1741.B05.gz43 220773 



M00056169C:F07 UC2-NonnColon 



1302 I 394772 | 1741.C06.gz43 220790 



M00056170C:C09 UC2-NormColon 
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Table 2 



SEQ 
lIDNOl 



1303 
1304 



1306 
1307 



1310 
1311 



CLUSTER | 
646699 



SEQ NAME 



ORIEN 
T 



1741.E07.gz43 220808 



593173 | 1741.G07.gz43 2208 1 0 



1305 643206 | 1741.K07.gz43 220814 



646309 I 1741.M07.gz43 2208 1 6 



553023 | 1741.N07.gz43 220817 



1308 649705 | 1741.O07.gz43 220818 



1309 555771 | 1741.B08.gz43 220821 



379805 | 1741.I08.gz43 220828 



710362 I 1741.L08.gz43 220831 



1312 650188 I 1741.P08.gz43 220835 



1313. 649795 1741.A09.gz43 220836 



1314 | 460506 | 1741.K09.gz43 220846 



1315 502683 | 1741.N09.gz43 220849 



1316 | 453768 | 1741.F10.gz43 220857 



1317 | 457272 | 1741.G10.gz43 220858 



1318 378694 | 1741.J10.gz43 220861 



1319 649136 | 1741.Fll.gz43 220873 



1320 | 555655 | 1741.Mll.gz43 220880 



1321 648777 | 1741.B12.gz43 220885 



1322 | 446289 | 1741.G12.gz43 220890 



1323 | 449452 | 1741.J12.gz43 220893 



1324 I 648665 | 1741.P12.gz43 220899 



1325 I 561747 | 1741.D14.gz43 2209 1 9 



1326 I 646915 | 1741.E14.gz43 220920 



1327 I 648865 | 1741.G14.gz43 220922 



1328 639886 | 1741.I14.gz43 220924 



1329 | 183842 | 1741.J14.gz43 220925 



1330 I 469409 | 1741.K14.gz43 220926 



1331 | 649702 | 1741.L14.gz43 220927 



1332 646581 | 1741.N14.gz43 220929 



1333 638730 | 1741.H15.gz43 220939 



1334 641525 | 1741.K15.gz43 220942 



1335 | 650800 | 1741.B16.gz43 220949 



1336 647086 | 1741.E16.gz43 220952 



1337 471836 | 1741.J16.gz43 220957 



1338 | 640285 | 1741.E17.gz43 220968 



1339 459923 | 1741.H17.gz43 220971 



1340 I 452729 | 1741.J17.gz43 220973 



1341 209414 | 1741.H18.gz43 220987 



1342 | 463487 | 1741.L18.gz43 22099 1 



1343 | 645288 | 1741.F19.gz43 22 1001 



1344 | 640514 | 1741.J19.gz43 221005 



CLONE ID 



LIBRARY 



M00056173D:C05 I UC2-NormColon 



M00056176A.D06 \ UC2-NormColon 



M00056182A.G11 I UC2-NormColnn 



M00056184C.H03 I UC2-NormColon 



M00056185D:C01 I UC2-NormColon 



M00056186D:G05 I UC2-NormColon 



M00056169D:A03 UC2-NormColon 



M00056179A:D12 UC2-NonnColon 



Mp0056183C:F01 UC2-NormColon 



M00056188C.D02 UC2-NormColon 



M00056168C:A05 UC2-NormColon 



M00056182B:G12 UC2-NormCoIon 



M00056185D:D06 UC2-NormColon 



M00056175A.A06 UC2-NormColon 



M00056176B:B05 UC2-NormColon 



M00056180D;E06 UC2-NormColon 



M00056175A.C04 UC2-NormColon 



M00056184D.D06 UC2-NormColon 



M00056169D.D05 UC2-NormColon 



M00056176B:E10 UC2-NormColon 



M00056180D:F05 UC2-NormColon 



M00056188C:H04 UC2-NormColon 



M00056172A:H06 UC2-NormColon 



M00056174B:D02 UC2-NormColon 



M00056176C:B11 UC2-NormColon 



M00056179B:H12 UC2-NormColon 



M00056180D:G07 UC2-NonnColon 



M00056182C:F08 UC2-NormColon 



M00056184A:B11 UC2-NormColon 



M00056186A:B09 UC2-NormColon 



M00056178B:A11 UC2-NormColon 



M00056182D:A07 UC2-NormColon 



M00056169D:H05 UC2-NormColon 



M00056174B:E02 UC2-NormColon 



M00056181A:B05 UC2-NormColon 



M00056174BE04 UC2-NoimColon 



M00056178B:H08 UC2-NormCoIon 



M00056181A:E07 UC2-NormColon 



M00056178C:C02 UC2-NonnColon 



M00056184B.D08 UC2-NormColon 



M00056175C:A10 UC2-NormColon 



M00056181B:B02 UC2-NormColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1345 


459974 


1741.K19.gz43 221006 


F 


M00056183A:B09 


UC2-NormColon 


1346 


465594 


1741.P19.gz43 221011 


F 


M00056189B:A01 


UC2-NormColon 


1347 


646173 


1741C20.gz43_221014 


F 


M00056171A:H01 


UC2-NormColon 


1348 


557010 


1741.E20.gz43_221016 


F 


M00056174B.H11 


UC2-NormColon 


1349 


650146 


1741.F20.gz43 221017 


F 


M00056175C:B11 


UC2-NormColon 


1350 


499690 


1741.G20.gz43 221018 


F 


M00056177A:D12 


UC2-NormColon 


1351 


584745 


1741.H20.gz43 221019 


F 


M00056178C:E12 


UC2-NormColon 


1352 


446987 


1741L20.gz43_221023 


F 


M00056184B:H08 


UC2-NormColon 


1353 


458683 


1741.A21.gz43 221028 


F 


M00056169B:C02 


UC2-NonnColon 


1354 


640922 


1741.H21.gz43 221035 


F 


M00056178C:F02 


UC2-NormColon 


1355. 


561793 


1741.J21.gz43 221037 


F 


M00056181B:E11 


UC2-NormColon 


1356 


641945 


1741. 02 l.gz43_22 1042 


F 


M00056188A:E05 


UC2-NormColon 


1357 


645578 


1741.J22.gz43 221053 


F 


M00056181C:D02 


UC2-NormColon 


1358 


647688 


1741L22.gz43_221055 


F 


M00056184C:A09 


UC2-NormColon 


1359 


610986 


1741.G23.gz43 221066 


F 


M00056177A:H01 


UC2-NormColon 


1360 


644053 


1741.123.gz43_221068 


F 


M00056180B:A06 


UC2-NormColon 


1361 


639132 


1741.M23.gz43 221072 


F 


M00056185B.B06 


UC2-NormColon 


1362 


642256 


1 74 1 B24.gz43_22 1 077 


F 


M00056170B:B09 


UC2-NormColon 


1363 


648820 


1741F24.gz43_221081 


F 


M00056175D:A06 


UC2-NormColon 


1364 


650975 


1741.L24.gz43 221087 


F 


M00056184C:C03 


UC2-NormColon 


1365 


644801 


1741.N24.gz43_221089 


F 


M00056186B:H09 


UC2-NormColon 


1366 


637387 


1742 G01.gz43_221098 


F 


M00056198A:B05 


UC2-NormColon 


1367 


642118 


1742.P01.gz43_221107 


F 


M00056212C:C09 


UC2-NormColon 


1368 


397363 


1742.F03.gz43_221129 


F 


M00056196C:B03 


UC2-NormColon 


1369 


463168 


1742.J03.gz43 221133 


F 


M00056203D:F05 


UC2-NormColon 


1370 


172843 


1742.M03.gz43_221136 


F 


M00056208B:C10 


UC2-NormColon 


1371 


570939 


1742.J04.gz43_221149 


F 


M00056203D:H03 


UC2-NormColon 


1372 


463545 


1742 L04.gz43_221151 


F 


M00056206D:F07 


UC2-NormColon 


1373 


456920 


1742.J05.gz43_221165 


F 


M00056204A:B08 


UC2-NormColon 


1374 


644916 


1742.E06.gz43_221176 


F 


M00056195C:F12 


UC2-NormColon 


1375 


559955 


1742.M06.gz43_221184 


F 


M00056208C:A10 


UC2-NormColon 


1376 


453762 


1742.B08.gz43_221205 


F 


M00056191B:E08 


UC2-NonnColon 


1377 


640913 


1742I08.gz43_221212 


F 


M00056201C:C03 


UC2-NonnColon 


1378 


648590 


1742.F09.gz43_221225 


F 


M00056196C.G12 


UC2-NormColon 


1379 


470801 


1742.N09.gz43_221233 


F 


M00056210B:C11 


UC2-NormColon 


1380 


645223 


1742.Mll.gz43_221264 


F 


M00056208D:A12 


UC2-NormColon 


1381 


649402 


1742.E12.gz43_221272 


F 


M00056195D:D07 


UC2-NormColon 


1382 


649354 


1742.N12.gz43_221281 


F 


M00056210B:E03 


UC2-NormColon 


1383 


650756 


1742.K15.gz43_221326 


F 


M00056205D:G04 


UC2-NormColon 


1384 


648754 


1742.J18.gz43_221373 


F 


M00056204C:H12 


UC2-NormCoIon 


1385 


645177 


1742.K19.gz43_221390 


F 


M00056206B:A10 


UC2-NormColon 


1386 


507050 


1742.M19.gz43_221392 


F 


M00056209B:D02 


UC2-NormColon 



105 




Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1387 


646275 


1742.P19.gz43 221395 


F 


M00056213C:C01 




1388 


574166 


1743.H01.gz43 221483 


F 


M00056223A:D10 




1389 


649709 


1743.H03.gz43 221515 


F 


M00056223A:H07 




1390 


648155 


1743.C04.gz43 221526 


F 


M00056217A:H11 




1391 


578718 


1743.K04.gz43 221534 


F 


M00056227D:A02 


UC2-Normf'r>1nn 


1392 


649272 


1743.N04.gz43 221537 


F 


M00056232C:E06 


UC2 -Norm Col nn 


1393 


648351 


1743.D05.gz43 221543 


F 


M00056218C:C02 


UC2-NormColon 


1394 


618670 


1743.I06.gz43 221564 


F 


M00056225B:A11 


UC2-Normrolon 


1395 


706245 


1743.P07.gz43 221587 


F 


M00056236A:G12 


UC2-NormCo1on 


1396 


638807 


1743.A08.gz43 221588 


F 


M00056214B:H03 


UC2-NormCoIon 


1397- 


562359 


1743.C09.gz43 221606 


F 


M00056217B:G03 


UC2-NormCoIon 


1398 


447892 


1743.K09.gz43 221614 


F 


M00056228A:A02 


UC2-NormColon 


1399 


502413 


1743.N09.gz43 221617 


F 


M00056232D:C08 


UC2-NormColon 


1400 


499517 


1743.A10.gz43 221620 


F 


M00056214C:A04 


UC2-Normro1on 


1401 


570052 


1743.Gll.gz43 221642 


F 


M00056222A:E05 


UC2-NormCo1on 


1402 


649351 


1743.Mll.gz43 221648 


F 


M00056231B:E01 


UC2-NormCoIon 


1403 


650975 


1743.A16.gz43 221716 


F 


M00056214D.F05 


UC2-NormColon 

V_> ^ 1 ^ \J1 111V_< W1VJ11 


1404 


562625 


1743.C16.gz43 221718 


F 


M00056217D:B07 




1405 


644916 


1743.E16.gz43 221720 


F 


M0005 6220AH04 


VsV>^> 1 lUl 111\^U1V_J11 


1406 


33738 


1743.L16.gz43_221727 


F 


M00056229C:F05 


I JC 2 -Norm Colon 


1407 


645549 


1743.H17.gz43 221739 


F 


M00056224B:F01 


UC2-NormColon 


1408 


451487 


1743.M21.gz43 221808 


F 


M00056232B:A1 1 


UC2-NormColon 


1409 


448780 


1743.C22.gz43 221814 


F 


M00056218A:G11 


I JC 2 -Norm Colon 


1410 


648816 


1743.022.gz43 221826 


F 


M00056235A:C12 


UC2-NormColon 

\~J V>^» X. i \JX 111 V-/V/IVJ11 


1411 


648989 


1 743. G23.gz43_22 1834 


F 


M00056223A:B11 


UC2-NormColon 


1412 


453470 


1743.H23.gz43 221835 


F 


M00056224D:E08 


UC2-NormColon 

v/ V/** i. T \JX XXl\^\JX\Jkl 


1413 


649030 


1743.M23.gz43 221840 


F 


M00056232B:C04 


UC2-NormColon 

V-/ ^/^r x t V-^X 111 KJ X V/X X 


1414 


442347 


1743.G24.gz43 221850 


F 


M00056223A:C03 


UC2-NormColon 


1415 


645271 


1744 O01.gz43_221874 


F 


M00056256A:C02 


UC2-NormColon 

V-' V_^«<w 1 ^ Ul 1 llV> V_/ll^ll 


1416 


645872 


1744.J02.gz43 221885 


F 


M00056249A:H07 


UC2-NormColon 

x ^ vyx x x x v^v./xv^xx 


1417 


651051 


1744.N02.gz43 221889 


F 


M00056254A:F04 


UC2-NormCoIon 


1418 


650463 


1744.F03.gz43 221897 


F 


M00056244A:A01 


UC2-NormColon 


1419 


506719 


1744.J03.gz43 221901 


F 


M00056249B:C02 


UC2-NormColon 


1420 


648989 


1744.E04.gz43 221912 


F 


M00056242D:A02 


UC2-NormColon 


1421 


703978 


1 744. A06.gz43_22 1 940 


F 


M00056237B:D04 


UC2-NormColon 


1422 


650661 


1744.K07.gz43 221966 


F 


M00056250C:D05 


UC2-NormColon 


1423 


419255 


1744.H08.gz43 221979 


F 


M00056246C:G07 


UC2-NormColon 


1424 


648996 


1744.M08.gz43_221984 


F 


M00056253A:F12 


UC2-NormColon 


1425 


451361 


1744.N09.gz43_222001 


F 


M00056254D:C04 


UC2-NormColon 


1426 


15296 


1744.A10.gz43 222004 


F 


M00056237C:C05 


UC2-NormColon 


1427 


649396 


1744.C10.gz43 222006 


F 


M00056240B.F08 


UC2-NormColon 


1428 


650860 


1744.J10.gz43 222013 


F 


M00056249C:E06 


UC2-NormColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


T.TRRARY 


1429 


643327 


1744.K14sz43 222078 


F 


ivx \J\J\J J OZ J 1 D ,/\ 1 £ 


U Li -IN ormColon 


1430 


639119 


1744 M14 ez43 222080 


F 

J. 


1V1 W uu J OZ J .5 13 . iTAJ^f 


U Lz -IN ormColon 


1431 


447272 


1744 C15 ez43 222086 


F 

j. 


lVlUUUJDZHUJL#.rT.U / 


UCz-N ormColon 


1432 


452646 


1744 K15 ez43 222094 


F 


lV/f 000^^9^ 1 R-Rm 


UCz-NormColon 


1433 


637387 


1744 G17 ez43 222122 


F 


1V1U U U J O u/\ . XiU 1 


UC2-NormColon 


1434 


162981 


1744 H17 ez43 222123 


F 


Mnons^?47 a 'T^n^ 


u Lz -in ormColon 


1435 


457221 


1744 H18 ez43 222139 


F 


M0nn^^947 A 
IVll/UUJD^ t T / A.rU / 


u cz-jn ormColon 


1436 


703217 


1744 Ml 8 ez43 222144 


F 


1V1 W U \J J \J Z. J J \^ . ^ U'r 


u\^z -in ormcoion 


1437 


559324 


1744.N18.gz43 222145 


F 


lviuv/ujuzj jD.r 


u Lz -in ormcoion 


1438 


707483 


1744.A19.gz43 222148 


F 


1V1 \J \J \J J UZ. J O /A. . /TlU J 


u v>z-in ormv^oion 


1439- 


464091 


1744.P19.gz43 222163 


F 


M000^6?58AA1 1 


u i^z-in ormv^oion 


1440 


645472 


1744.N20.gz43 222177 


F 


MOoosfi?^spr , n4 

ItIV/VVJUZ, J J V> . V^Ut 


u i^z -iN onnv^oion 


1441 


650018 


1744.H21.gz43 222187 


F 




u\^z-iNoniii^oion 


1442 


651073 


1744.E22.gz43 222200 


F 




u i^z-iN ormcoion 


1443 


641874 


1744.M23.gz43 222224 


F 




u v^z -in ormi^oion 


1444 


610269 


1744 P23 ez43 222227 


F 


M000S67S8 A F 1 0 


u ^z-iN ormcoion 


1445 


650463 


1744.F24.gz43 222233 


F 


M00OS624SAD1 1 


kj v>z -in oiiiiiw/Oion 


1446 


650919 


1744.I24.gz43 222236 


F 


M000S624QAF1 1 


uvz-i> oi iiiVsOion 


1447 


586794 


1744.K24.gz43 222238 


F 


M00056251C-H04 


T TP?-TvJr»rmr"r»1r4n 

u\^z~i>(uiiiiv^oiun 


1448 


642200 


1753.K02.gz43 222270 


F 


M00056?7^A*A07 


kj v^z -in ormi^oion 


1449 


645271 


1753.C03.gz43 222278 


F 


M00056261 AF1? 


Uv/Z -in ormL oion 


1450 


528162 


1753 J04az43 222301 


F 


mooo^6?7i rrofi 


u v^z-iN ui iiHw/Oion 


1451 


648609 


1753.N05.gz43 222321 


F 


M00056777RO0S 


u ^z -in omiL/Oion 


1452 


213 


1753.D06.gz43 222327 


F 


M000S676?n*On8 

IViUl/UJ \JZ. VJZ. JU/ .VJVJO 


TTf*7 MnrmPnlnn 

u i^z-in ormi^oion 


1453 


648063 


1753 E06 ez43 222328 


F 


M000^67fi4R-Pn^ 


u \^z-in onri^oion 


1454 


647828 


1753 106 ez43 222332 


F 


M000^6?70 A ROO 

1VXWV/VJ JUZ> / V//A..XJ\J" 


u ^z-iN orrni^oion 


1455 


476380 


1753.L06.gz43 222335 


F 


M000^6?74n*FOX 


T TP 7 .Nnrm P rA nrt 

u v_/Z-in oimi^oion 


1456 


649262 


1753 D10 ez43 222391 


F 




T TP7-"MnrmPr»1r*« 

u v^z-in oi iiu^oion 


1457 


648590 


1753.A12.gz43 222420 


F 


M000^67SXn*H06 


T TP 7 _"NJr\rm P r» 1 rtn 

ULZ -in oi 1 1 oion 


1458 


557857 


1753 112 ez43 222428 


F 


M000S6970R*Fn? 


T TP 7 MArmP/xl/Mi 

uv^z-iNormv^oion 


1459 


649349 


1753 PI 3 ez43 222451 


F 


M00056780RF00 


T TP7-MnrmPn1nr» 
U\^Z-INOllIIl_/010n 


1460 


460245 


1753 B14ez43 222453 


F 


M000S6760r-Fl 1 


T TP7-"Mr*rmP^1r\rt 

u i^z -in ormv^oion 


1461 


648442 


1753 M14 ez43 222464 


F 


M00056976AFOS 


TTP7 "WnrmrnlAn 

u \^z-in orrni^oion 


1462 


646609 


1753 E15 ez43 222472 


F 


M000^676SA-F01 


%j i^z -in oiiiiv>oion 


1463 


562173 


1753.G15.gz43 222474 


F 


M00056267D • R09 


I TP7-"MnrmPrtlrtri 

vj v_^z -in uiiiiv^oion 


1464 


555021 


1753.A17.gz43 222500 


F 


M00056259A:E08 


UC2-NormColon 


1465 


486452 


1753.D17.gz43_222503 


F 


M00056263B:E10 


UC2-NormColon 


1466 


507066 


1753.M17.gz43_222512 


F 


M00056276D:B12 


UC2-NormColon 


1467 


649592 


1753.O18.gz43_222530 


F 


M00056279B:G06 


UC2-NormColon 


1468 


268336 


1753.C23.gz43 222598 


F 


M00056262B:B08 


UC2-NormColon 


1469 


509202 


1753.P23.gz43_222611 


F 


M00056280D:C06 


UC2-NormColon 


1470 


459914 


1753.A24.gz43_222612 


F 


M00056259C:E03 


UC2-NormColon 
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Table 2 



SEQ 
ID NC 


» CLUSTER 


SEQ NAME 


ORIETs 
T 


r 

CLONE ID 


LIBRARY 


1471 


466894 


1753.I24.gz43 222620 


F 


M00056271A:G04 


UC2-NormColon 


1472 


647839 


1754.D07.gz43 222727 


F 


M00056285B:B01 


UC2-NormColon 


1473 


644862 


1754.I07.gz43 222732 


F 


M00056292D:C06 


UC2-NormColon 


1474 


492587 


1754.K07.gz43 222734 


F 


M00056295C:F12 


UC2-NormColon 


1475 


651057 


1754.M07.gz43 222736 


F 


M00056298A:H02 


UC2-NormColon 


1476 


554276 


1754.E10.gz43 222776 


F 


M00056286C:H08 


UC2-NormColon 


1477 


643984 


1754.N10.gz43 222785 


F 


M00056299C:F06 


UC2-NormColon 


1478 


646060 


1754.H12.gz43 222811 


F 


M00056291B:G01 


UC2-NormColon 


1479 


646387 


1754.K12.gz43 222814 


F 


M00056296A:A07 


UC2-NormColon 


1480 


462986 


1754.N12.gz43 222817 


F 


M00056299D:A02 


UC2-NormColon 


1481. 


598471 


1754.H19.gz43 222923 


F 


M00056291C:C11 


UC2-NormColon 


1482 


449882 


1 754.1 19gz43 222924 


F 


M00056293B:F02 


UC2-NormColon 


1483 


555998 


1754.F20.gz43 222937 


F 


M00056289B:E05 


UC2-NormColon 


1484 


419801 


1754.J20.gz43 222941 


F 


M00056294B:G06 


UC2-NormColon 


1485 


647462 


1754.E21.gz43 222952 


F 


M00056287C:H08 


UC2-NormColon 


1486 


648524 


1754.F21.gz43 222953 


F 


M00056289B:F09 


UC2-NormColon 


1487 


462865 


1754.M22.gz43 222976 


F 


M00056298D:G01 


UC2-NormColon 


1488 


645844 


1754.022.gz43 222978 


F 


M00056301C:H08 


UC2-NormColon 


1489 


556497 


1754.C23.gz43 222982 


F 


M00056284C.A1 1 


UC2-NormColon 


1490 


467709 


1754.L23.gz43 222991 


F 


M00056297D:B10 


UC2-NormColon 


1491 


559857 


1754.D24.gz43 222999 


F 


M00056286B:A04 


UC2-NormColon 


1492 


651115 


1754.N24.gz43 223009 


F 


M00056300B:A1 1 


UC2-NormColon 


1493 


644660 


1755.D05.gz43 223079 


F 


M00056307A:F07 


UC2-NormColon 


1494 


647248 


1 755 .N05 .gz43_223089 


F 


M00056329C:B09 


UC2-NormColon 


1495 


524261 


1755.B06.gz43 223093 


F 


M00056305A:A02 


UC2-NormColon 


1496 


556444 


1755.K06.gz43 223102 


F 


M00056320C:E08 


UC2-NormColon 


1497 


639378 


1755.M08.gz43 223136 


F 


M00056326C:G07 


UC2-NonnColon 


1498 


460929 


1755.O09.gz43 223154 


F 


M00056333A:E09 


UC2-NormColon 


1499 


642260 


1755.K10.gz43 223166 


F 


M00056320D:F07 


UC2-NormColon 


1500 


635062 


1755.11 l.gz43 223180 


F 


M00056319A:A11 


UC2-NormColon 


1501 


644880 


1755.B14.gz43 223221 


F 


M00056305B:F09 


UC2-NormColon 


1502 


452075 


1755.N14.gz43_223233 


F 


M00056330B:C02 


UC2-NormColon 


1503 


557052 


1755.E15.gz43 223240 


F 


M00056310A:C10 


UC2-NormColon 


1504 


569195 


1755.K15.gz43 223246 


F 


M00056323A:G03 


UC2-NormColon 


1505 


444222 


1755.M16.gz43 223264 


F 


M00056328B:E07 


UC2-NormColon 


1506 


467364 


1755.A17.gz43 223268 


F 


M00056304A:H04 


UC2-NormColon 


1507 


446910 


1755.A18.gz43 223284 


F 


M00056304B:C05 


UC2-NormColon 


1508 


447126 


1755.M19.gz43 223312 


F 


M00056328D:B01 


UC2-NormColon 


1509 


650639 


1755.D20.gz43_223319 


F 


M00056308A:C01 


UC2-NormColon 


1510 


641681 


1755.B22.gz43 223349 


F 


M00056305D:E08 


UC2-NormColon 


1511 


606129 


1755.P24.gz43_223395 


F 


M00056338B:B08 


UC2-NormColon 


1512 


448412 


1766.A01.gz43_224099 


F 


M00042346A:B04 


UC2-PrimColon 
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SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1513 


485237 


1766.E01.gz43 224103 


F 


M00042432D:G02 


T 1C7 -Pri m CrA nn 


1514 


479131 


1766.G01.gz43 224105 


F 


M00042437B:A1 1 




1515 


481457 


1766.H03.gz43 224138 


F 


M0004243 9A- D09 


<j v_/Z, - x i iiuv_/Oion 


1516 


448892 


1766.A09.gz43 224227 


F 


M00042346C:G12 


T JC^-PrimPnloTi 


1517 


477797 


1766.E13.gz43 224295 


F 


M00042433C:H05 




1518 


450212 


1766.I19.gz43 224395 


F 


M00042462A:F12 


IJC2-PrirnCnlr»n 


1519 


3781 


1766.J21.gz43 224428 


F 


M00042464A:F06 


UC2-PrimColon 


1520 


447377 


1766.M22.gz43 224447 


F 


M00042470B:H1 1 


UC2-PrimColon 


1521 


453606 


1767.H01.gz43 224490 


F 


M00042764B:B10 


UC2-PrimColon 


1522 


472101 


1767.O02.gz43 224513 


F 


M00042777B:B05 


UC2-PrimColon 

X^/ V_^" A X MX ll\^Vl \JXX 


1523. 


284586 


1767.D03.gz43 224518 


F 


M00042756B:B01 


UC2-PrimColon 

\J\~s£* X 1 11 HV^wl Wll 


1524 


449689 


1767.M03.gz43 224527 


F 


M00042773A:H11 


UC2-PrimColon 


1525 


449215 


1767.A04.gz43 224531 


F 


M00042515D:B11 


I JC 2-Prim CcA on 

\J\^t£* X 1 111 1V_>V_/HJ11 


1526 


128749 


1767I04.gz43_224539 


F 


M00042766A:D07 


UC2-PrimCoIon 


1527 


451982 


1 767. A05 .gz43_224547 


F 


M00042515D:G01 


IJC2-PrirnCnlnn 


1528 


448251 


1767.H05.gz43 224554 


F 


M00042764C:B10 


UC2-PrimCo1rm 

X 1 llllV^VJUJll 


1529 


449437 


1767.J05.gz43_224556 


F 


M00042767D:D02 


UC2-Primrolon 

uv/ii x x ii ii v_/i\jii 


1530 


482876 


1767.K05.gz43 224557 


F 


M00042769B:E12 


UC2-Primro1on 


1531 


481614 


1767.C06.gz43_224565 


F 


M00042755A:D09 


UC 2 -Prim Col on 

v_^z* x 1 iinv^uiuii 


1532 


486547 


1767.M06.gz43_224575 


F 


M00042773BH08 


IJC2-PriTTtCnlrm 

WV^i. X 1 1111 V^VJlWl I 


1533 


179760 


1767.A07.gz43_224579 


F 


M00042751B:C07 


UC2-PrimColon 

KJ\~s£* X 1 llllV^-WlUll 


1534 


484633 


1767.I08.gz43 224603 


F 


M00042766B:G11 


TJC2-PrimCoIon 


1535 


479851 


1767.K08.gz43_224605 


F 


M00042769C:B01 


UC2-Prirn Colon 

\J\^£i J. 1 111 1V> WiUll 


1536 


470801 


1 767.G09.gz43_2246 1 7 


F 


M00042763B:A03 


Ur2-Primrolon 


1537 


486452 


1767J09.gz43_224620 


F 


M00042768A:H09 


UC2-PrimColon 


1538 


479868 


1767K09.gz43_224621 


F 


M00042769C:B06 


UC2-PrimColon 


1539 


485880 


1767.C10.gz43_224629 


F 


M00042755A:H04 


UC2-Primro1on 


1540 


456513 


1767Jll.gz43_224652 


F 


M00042768B:F02 


UC2-PrimColon 


1541 


473433 


1767.011.gz43_224657 


F 


M00042778A:C04 


UC2-PrimColon 


1542 


481594 


1767.C12.gz43_224661 


F 


M00042755B:D01 


UC2-PrimColon 


1543 


477521 


1767.F12.gz43 224664 


F 


M00042761C:G11 


UC2-PrimColon 


1544 


485073 


1767.L12.gz43_224670 


F 


M00042771B:G05 


UC2-PrimColon 


1545 


480960 


1767.B13.gz43_224676 


F 


M00042753B:C08 


UC2-PrimColon 


1546 


449215 


1767.A14.gz43_224691 


F 


M00042751D:D03 


UC2-PrimColon 


1547 


450551 


1767.114.gz43_224699 


F 


M00042766D:C05 


UC2-PrimColon 


1548 


481057 


1767.M14.gz43_224703 


F 


M00042774B:C01 


UC2-PrimColon 


1549 


449076 


1767.D15.gz43 224710 


F 


M00042757A:H07 


UC2-PrimColon 


1550 


477098 


1767.E15.gz43_224711 


F 


M00042760A:G12 


UC2-PrimColon 


1551 


482486 


1767J15.gz43_224716 


F 


M00042768C:E06 


UC2-PrimColon 


1552 


449438 


1767.K15.gz43_224717 


F 


M00042770A:D02 


UC2-PrimColon 


1553 


448260 


1767.015.gz43_224721 


F 


M00042778B:B07 


UC2-PrimColon 


1554 


453385 


1767.P15.gz43 224722 


F 


M00042779D:D04 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1555 


484355 


1767.A16.gz43_224723 


F 


M00042751D:F09 


UC2-PrimColon 


1556 


485882 


1767.C16.gz43_224725 


F 


M00042755B:H04 


UC2-PrimColon 


1557 


449592 


1767.L16.gz43_224734 


F 


M00042771D:F02 


UC2-PrimColon 


1558 


449261 


1767.M16.gz43_224735 


F 


M00042774C:B09 


UC2-PrimColon 


1559 


485907 


1767.C17.gz43_224741 


F 


M00042755B:H11 


UC2-PrimColon 


1560 


474580 


1767.F17.gz43_224744 


F 


M00042761D:D12 


UC2-PrimColon 


1561 


448152 


1767.N17.gz43_224752 


F 


M00042776D:D01 


UC2-PrimColon 


1562 


474823 


1767.P17.gz43_224754 


F 


M00042779D:E06 


UC2-PrimColon 


1563 


453024 


1767.E18.gz43_224759 


F 


M00042760B:E12 


UC2-PrimColon 


1564 


479880 


1767.11 8.gz43_224763 


F 


M00042767A:B10 


UC2-PrimColon 


1565. 


477399 


1767.N18.gz43_224768 


F 


M00042776D:G10 


UC2-PrimColon 


1566 


485441 


1767.A19.gz43_224771 


F 


M00042751D:G09 


UC2-PrimColon 


1567 


485183 


1767.B19.gz43_224772 


F 


M00042753C:G12 


UC2-PrimColon 


1568 


452104 


1767.C19.gz43_224773 


F 


M00042755C:A10 


UC2-PrimColon 


1569 


449275 


1767.A20.gz43_224787 


F 


M00042752A:B04 


UC2-PrimColon 


1570 


447218 


1767.L20.gz43_224798 


F 


M00042772C:C12 


UC2-PrimColon 


1571 


472119 


1767.N20.gz43_224800 


F 


M00042777A:B10 


UC2-PrimColon 


1572 


482868 


1 767.12 l.gz43_2248 11 


F 


M00042767B:E10 


UC2-PrimColon 


1573 


455143 


1767.F22.gz43_224824 


F 


M00042762A:G02 


UC2-PrimColon 


1574 


484964 


1767.122.gz43_224827 


F 


M00042767B:G04 


UC2-PrimColon 


1575 


448325 


1767.D23.gz43_224838 


F 


M00042758B:C06 


UC2-PrimColon 


1576 


484987 


1 767.123. gz43_224843 


F 


M00042767B:G09 


UC2-PrimColon 


1577 


475797 


1767.023.gz43_224849 


F 


M00042778D:F11 


UC2-PrimColon 


1578 


477757 


1767.P24.gz43_224866 


F 


M00042780A:H05 


UC2-PrimColon 


1579 


448233 


1778.D01.gz43_225254 


F 


M00042842A:B12 


UC2-PrimColon 


1580 


475111 


1778.B02.gz43_225268 


F 


M00042838A:E02 


UC2-PrimColon 


1581 


449403 


1778.N02.gz43_225280 


F 


M00042859B:D06 


UC2-PrimColon 


1582 


449604 


1778.O02.gz43_225281 


F 


M00042860D:F02 


UC2-PrimColon 


1583 


447983 


1778.F03.gz43_225288 


F 


M00042845C:A09 


UC2-PrimColon 


1584 


484091 


1 778.003 .gz43_225297 


F 


M00042860D:F05 


UC2-PrimColon 


1585 


480900 


1778.J04.gz43_225308 


F 


M00042852B:C06 


UC2-PrimColon 


1586 


551811 


1778.K04.gz43_225309 


F 


M00042854A:B11 


UC2-PrimColon 


1587 


485841 


1778.L05.gz43_225326 


F 


M00042855B:H06 


UC2-PrimColon 


1588 


476256 


1778.C07.gz43_225349 


F 


M00042839D:F02 


UC2-PrimColon 


1589 


447983 


1778.E07.gz43_225351 


F 


M00042844A:D02 


UC2-PrimColon 


1590 


725641 


1778.K07.gz43_225357 


F 


M00042854A:D05 


UC2-PrimColon 


1591 


484789 


1778.L08.gz43_225374 


F 


M00042855C:G11 


UC2-PrimColon 


1592 


449110 


1778.L09.gz43_225390 


F 


M00042855D:A12 


UC2-PrimColon 


1593 


451368 


1778O09.gz43_225393 


F 


M00042861A:F04 


UC2-PrimColon 


1594 


449521 


1778.P09.gz43_225394 


F 


M00042862C:E02 


UC2-PrimColon 


1595 


727875 


1778.G10.gz43_225401 


F 


M00042847A:F04 


UC2-PrimColon 


1596 


449273 


1778.11 l.gz43_2254 19 


F 


M00042851A:B08 


UC2-PrimColon 
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# • 



Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1597 


449652 


1778.Mll.gz43_225423 


F 


M00042857C:G04 


UC2-PrimColon 


1598 


481664 


1778.Nll.gz43 225424 


F 


M00042859D:D12 


UC2-PrimColon 


1599 


448230 


1778.E12.gz43_225431 


F 


M00042844C:B03 


UC2-PrimColon 


1600 


485029 


1778H12.gz43_225434 


F 


M00042849B:G06 


UC2-PrimColon 


1601 


483274 


1778.112.gz43_225435 


F 


M00042851A:E11 


UC2-PrimColon 


1602 


644190 


1778J12.gz43_225436 


F 


M00042853A:G03 


UC2-PrimColon 


1603 


725266 


1778.K12.gz43_225437 


F 


M00042854B:F05 


UC2-PrimCoIon 


1604 


480005 


1778.L12.gz43_225438 


F 


M00042855D:B12 


UC2-PrimColon 


1605 


484841 


1778.M12.gz43_225439 


F 


M00042857C:G05 


UC2-PrimColon 


1606 


480641 


1778.N12.gz43_225440 


F 


M00042860A:C01 


UC2-PrimColon 


1607. 


481319 


1778.013.gz43_225457 


F 


M00042861C:C03 


UC2-PrimColon 


1608 


387077 


1778.C14.gz43_225461 


F 


M00042840C:B12 


UC2-PrimColon 


1609 


456737 


1778.L14.gz43_225470 


F 


M00042855D:D06 


UC2-PrimColon 


1610 


479084 


1778.M14.gz43_225471 


F 


M00042858A:A08 


UC2-PrimColon 


1611 


484468 


1778.015.gz43_225489 


F 


M00042861C:F07 


UC2-PrimColon 


1612 


450673 


1778.C18.gz43_225525 


F 


M00042841B:H05 


UC2-PrimColon 


1613 


474597 


1778D18.gz43_225526 


F 


M00042843A:E07 


UC2-PrimColon 


1614 


477593 


1778.E18.gz43_225527 


F 


M00042844D:H02 


UC2-PrimColon 


1615 


452990 


1778.11 8gz43_225531 


F 


M00042851D:B08 


UC2-PrimColon 


1616 


449407 


1778.018.gz43_225537 


F 


M00042861D:D09 


UC2-PrimColon 


1617 


448104 


1778.E19.gz43_225543 


F 


M00042845A:B05 


UC2-PrimColon 


1618 


453856 


1778.D22.gz43_225590 


F 


M00042843B:F05 


UC2-PrimColon 


1619 


475942 


1778B23.gz43_225604 


F 


M00042839B:F05 


UC2-PrimColon 


1620 


639991 


1778.F23.gz43_225608 


F 


M00042846C:D09 


UC2-PrimColon 


1621 


480640 


1778.023.gz43_225617 


F 


M00042862A:C01 


UC2-PrimColon 


1622 


472307 


1778.C24.gz43_225621 


F 


M00042842A:B04 


UC2-PrimCoIon 


1623 


733806 


1780.10 l.gz43_226027 


F 


M00056361A:C01 


UC2-PrimColon 


1624 


732150 


1 780.E02.gz43_226039 


F 


M00056356C:H07 


UC2-PrimColon 


1625 


724907 


1780.K02.gz43_226045 


F 


M00056363A:C06 


UC2-PrimColon 


1626 


724792 


1780L02.gz43_226046 


F 


M00056364A:B04 


UC2-PrimColon 


1627 


736210 


1780.B04.gz43_226068 


F 


M00056353C:E12 


UC2-PrimColon 


1628 


623978 


1780.G05.gz43_226089 


F 


M00056359A:C08 


UC2-PrimColon 


1629 


600856 


1 780.105 .gz43_226091 


F 


M00056361A:H06 


UC2-PrimColon 


1630 


364334 


1780.K05.gz43_226093 


F 


M00056363A:E10 


UC2-PrimColon 


1631 


661194 


1780.P05.gz43_226098 


F 


M00056368B:H09 


UC2-PrimColon 


1632 


416326 


1780.G06.gz43_226105 


F 


M00056359A:F06 


UC2-PrimColon 


1633 


727129 


1780.D07.gz43_226118 


F 


M00056355D:A05 


UC2-PrimColon 


1634 


560859 


1780.O07.gz43_226129 


F 


M00056367C:D06 


UC2-PrimColon 


1635 


482831 


1780.P07.gz43_226130 


F 


M00056368C:E07 


UC2-PrimColon 


1636 


503862 


1780.L08.gz43_226142 


F 


M00056364C:B01 


UC2-PrimColon 


1637 


468467 


1780.D09.gz43_226150 


F 


M00056355D:B08 


UC2-PrimColon 


1638 


594994 


1780.A10.gz43_226163 


F 


M00056352D:H02 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1639 


463966 


1780.C10.gz43 226165 


F 


M00056355A.C10 


UC2-PrimColon 


1640 


471646 


1780.I10.gz43 226171 


F 


M00056361B:E02 


UC2-PrimColon 


1641 


731925 


1780.Bll.gz43 226180 


F 


M00056353D.-D10 


UC2-PrimColon 


1642 


480924 


1780.Mll.gz43 226191 


F 


M00056365B:G10 


UC2-PrimCo1nn 


1643 


517298 


1780.Pll.gz43 226194 


F 


M00056368D.D03 


UC2-PrimColon 


1644 


734150 


1780.F12.gz43 226200 


F 


M00056358A:G05 


UC2-PrimColon 


1645 


528981 


1780.H12.gz43 226202 


F 


M00056360B:D09 


UC2-PrimColon 


1646 


135364 


1 780.1 12.gz43 226203 


F 


M00056361C:C06 


UC2-PrimColon 


1647 


511606 


1780.J12.gz43 226204 


F 


M00056362C:C02 


UC2-PrimColon 


1648 


552891 


1780.K14.gz43 226237 


F 


M00056363C:C12 


UC2-PrimColon 


1649. 


730301 


1780P14.gz43_226242 


F 


M00056368D:G05 


UC2-PrimColon 


1650 


480005 


1780.H15.gz43 226250 


F 


M00056360C:B01 


UC2-PrimColon 


1651 


588959 


1780.L15.gz43 226254 


F 


M00056364C:H08 


UC2-PrimColon 


1652 


557029 


1780C16.gz43_226261 


F 


M00056355B.D02 


UC2-PrimCoIon 


1653 


553244 


1780.L17.gz43 226286 


F 


M00056364D:C05 


UC2-PrimColon 


1654 


456535 


1780.O20.gz43_226337 


F 


M00056368A:C11 


UC2-PrimColon 


1655 


728774 


1780.K22.gz43 226365 


F 


M00056363D:H03 


UC2-PrimColon 


1656 


724039 


1780.E23.gz43 226375 


F 


M00056357C:C06 


UC2-PrimCoIon 


1657 


732144 


1780.H23.gz43 226378 


F 


M00056360D:H10 


UC2-PrimColon 


1658 


724257 


1789.G01.gz43 226409 


F 


M00056375D.D12 


UC2-PrimColon 


1659 


474346 


1789.M01.gz43 226415 


F 


M00056384C:H04 


UC2-PrimColon 


1660 


376342 


1 789.N0 l.gz43 226416 


F 


M00056386A:F08 


UC2-PrimColon 


1661 


475682 


1789.P01.gz43 226418 


F 


M00056388B:A05 


UC2-PrimColon 


1662 


734646 


1789.B02.gz43 226420 


F 


M00056370B:C10 


UC2-PrimColon 


1663 


186372 


1789.M02.gz43 226431 


F 


M00056384D:A04 


UC2-PrimColon 


1664 


736001 


1789.A03.gz43 226435 


F 


M00056369B:E03 


UC2-PrimColon 


1665 


487079 


1789.F03.gz43 226440 


F 


M00056374C:G08 


UC2-PrimColon 


1666 


460284 


1789.H03.gz43 226442 


F 


M00056377A:C01 


UC2-PrimColon 


1667 


725274 


1 789.103 .gz43_226443 


F 


M00056377D:F08 


UC2-PrimColon 


1668 


553035 


1789.N03.gz43 226448 


F 


M00056386A:G01 


UC2-PrimColon 


1669 


724009 


1789.P03.gz43 226450 


F 


M00056388B:B03 


UC2-PrimColon 


1670 


568484 


1789.B04.gz43 226452 


F 


M00056370B:E07 


UC2-PrimColon 


1671 


725321 


1789.L04.gz43 226462 


F 


M00056383B:F08 


UC2-PrimColon 


1672 


731542 


1789.K05.gz43_226477 


F 


M00056382C:F11 


UC2-PrimColon 


1673 


725448 


1 789L06.gz43_226494 


F 


M00056383B:G08 


UC2-PrimColon 


1674 


452806 


1789.N06.gz43 226496 


F 


M00056386B.A07 


UC2-PrimColon 


1675 


549790 


1789.L07.gz43 226510 


F 


M00056383C:E07 


UC2-PrimColon 


1676 


730660 


1789.N07.gz43 226512 


F 


M00056386B:A11 


UC2-PrimColon 


1677 


503625 


1789.B08.gz43_226516 


F 


M00056370C.F01 


UC2-PrimColon 


1678 


725381 


1789.E09.gz43_226535 


F 


M00056373D:G01 


UC2-PrimColon 


1679 


649411 


1789.G09.gz43 226537 


F 


M00056376A:C10 


UC2-PrimColon 


1680 


729993 


1789I09.gz43_226539 


F 


M00056378C:E10 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 



CLUSTE R 
727523 



SEQ NAME 



ORIEN 
T 



CLONE ID 



LIBRARY 



1681 



1789.M10.gz43 226559 



M00056385A:A07 | UC2-PrimColon 



1682 



459918 
735649 



1789.N10.gz43 226560 



M00056386B:D07 I UC2-PrimColon 



1683 



1789.Bll.gz43 226564 



M00056370D:C06 I UC2-PrimColon 



1684 



734348 | 1789.Fll.gz43 226568 



M00056375A:D11 I UC2-PrimColnn 



1685 



558412 | 1789.01 l.gz43 226577 



M00056387C:D01 | UC2-PrimCo1nn 



1686 



724694 | 1789.H12.gz43 226586 



M00056377C:A01 UC2-PrimColon 



1687 



723972 | 1789.J12.gz43 226588 



M00056382A:A12 UC2-PrimColon 



1688 



1689 



1690 



623408 1789.N12.gz43_226592 



640158 | 1790.F01.gz43 226792 



M00056386B.E09 UC2-PrimColon 



725095 | 1790.J02.gz43 226812 



M00056394C:D07 UC2-PrimColon 



M00056398C:D10 UC2-PrimColon 



1691 



562769 1790.O02.gz43 226817 



M00056404D:G03 UC2-PrimColon 



1692 



479851 1 1790.L03.gz43 226830 



M00056400C.G04 UC2-PrimColon 



1693 



481077 I 1790.N03.gz43 226832 



M00056403C:F02 UC2-PrimColon 



1694 



1695 



656268 I 1790.K04.gz43 226845 



724411 I 1790.N04.gz43 226848 



M00056399D:C11 UC2-PrimColon 



M00056403C:F03 UC2-PrimColon 



1696 



640261 I 1790.N05.gz43 226864 



M00056403C:F12 UC2-PrimColon 



1697 



426297 | 1790.D07.gz43 226886 



M00056392D:C05 UC2-PrimColon 



1698 



649591 | 1790.M07.gz43 226895 



M00056402C:B09 UC2-PrimColon 



1699 



729111 | 1790.D08.gz43 226902 



M00056392D:H02 UC2-PrimColon 



1700 



1701 



733563 | 1790.G08.gz43 226905 



729637 | 1790.A10.gz43 22693 1 



M00056396A:D03 UC2-PrimColon 



M00056389B:H03 UC2-PrimColon 



1702 



726494 | 1790.K10.gz43 22694 1 



M00056400A:D08 UC2-PrimColon 



1703 



724714 | 1790.M10.gz43 226943 



M00056402D:A01 UC2-PrimColon 



1704 



543323 | 1790.P10.gz43 226946 



M00056406C:D09 UC2-PrimColon 



1705 



556769 | 1790.Bll.gz43 226948 



M00056390C:B08 UC2-PrimColon 



1706 



209 



1790.Mll.gz43 226959 



M00056402D:B07 UC2-PrimColon 



1707 



724914 I 1790.B12.gz43 226964 



M00056390C:C04 UC2-PrimColon 



1708 



614463 I 1790.D12.gz43 226966 



M00056393A:G06 UC2-PrimColon 



1709 



474981 | 1790.G12.gz43 226969 



M00056396A:G10 UC2-PrimColon 



1710 



729950 | 1790.L12.gz43 226974 



M00056401A:F09 UC2-PrimColon 



1711 



640756 | 1790.N12.gz43 226976 



M00056404A:G03 UC2-PrimColon 



1712 



449228 1 1790.O13.gz43 226993 



M00056405CH04 UC2-PrimColon 



1713 



423420 | 1790.P13.gz43 226994 



M00056406C:F12 UC2-PrimColon 



1714 



727150 | 1790.C14.gz43 226997 



M00056391C:H12 UC2-PrimColon 



1715 



503173 | 1790.E14.gz43 226999 



M00056394B:C11 UC2-PrimColon 



1716 



188309 | 1790.N14.gz43 227008 



M00056404B:B06 | UC2-PrimColon 



[717 



448758 | 1790.J15.gz43 227020 



M00056399B:G09 UC2-PrimColon 



1718 



735234 | 1790.L15.gz43 227022 



M00056401B:B01 UC2-PrimColon 



1719 



483147 I 1790.P15.gz43 227026 



M00056407A.D10 UC2-PrimColon 



1720 



642985 | 1790.G16.gz43 227033 



M00056396B:E10 UC2-PrimColon 



1721 



725451 I 1790.K16.gz43 227037 



M00056400A:G10 UC2-PrimColon 



1722 



729809 I 1790.P16.gz43 227042 



M00056407A:G03 UC2-PrimColon 
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Table 2 



SEQ 
lIDNOl 



CLUSTER 



SEQ NAME 



lORIEN 
T 



CLONE ID 



LIBRARY 



1723 
1724 



495712 
730948 



1790.A17.gz43 227043 



M00056389D:F01 UC2-PrimColon 



1790.O17.gz43 227057 



M00056405D:E05 I UC2-PrimColon 



1725 
1726 



457508 



1790.C18.gz43 22706 1 



M00056391D.E06 UC2-PrimColon 



417549 



1790.M18.gz43_227071 



M00056403A:E09 UC2-PrimColon 



1727 
1728 



725454 



1790.E19.gz43 227079 



M00056394B.G05 UC2-PrimColon 



1729 
1730 



448817 



726411 



1790.C20.gz43 227093 



1790.O20.gz43 227105 



M00056392A:F06 UC2-PrimColon 



M00056406A:C02 UC2-PrimColon 



558794 



1790D21gz43_227110 



M00056393D:A09 UC2-PrimColon 



1731 641687 



1790.P21.gz43 227122 



1732 733643 



1790.N22.gz43 227136 



M00056409A:C03 I UC2-PrimColon 



M00056404C:C12 UC2-PrimColon 



1733. 523364 



1790.F23.gz43 227144 



M00056395C:H03 UC2-PrimColon 



1734 468467 



1790.M23.gz43 22715 1 



1735 402799 



1790.O23.gz43 227153 



M00056403B:D09 UC2-PrimColon 



M00056406A:E01 I UC2-PrimColon 



1736 470462 



1790.P23.gz43 227154 



M00056409A:D03 UC2-PrimColon 



1737 118514 



1790.D24.gz43 227158 



M00056393D:C12 UC2-PrimColon 



1738 732821 



1790.F24.gz43 227 1 60 



M00056395D:B01 UC2-PrimColon 



1739 447222 



1790.K24.gz43 227165 



M00056400C:B11 UC2-PrimColon 



1740 729779 



1790.M24.gz43 227167 



M00056403B.G12 UC2-PrimColon 



1741 727633 



1790.P24.gz43 227170 



1742 475797 



1791.F01.gz43 227176 



M00056409A:D06 UC2-PrimColon 



M00056415A:D03 UC2-PrimColon 



1743 648170 



1791.G01.gz43 227177 



M00056416B:B11 UC2-PrimColon 



1744 472684 



1791.I01.gz43 227179 



M00056418A:G09 UC2-PrimColon 



1745 477718 



1791.O02.gz43 227201 



M00056423D:H07 UC2-PrimColon 



1746 729784 



1791.A03.gz43 227203 



M00056409B:C12 UC2-PrimColon 



1747 402471 



1791.F03.gz43 227208 



M00056415B:B10 UC2-PrimColon 



1748 453762 



1791.K03.gz43 227213 



M00056419D:G04 UC2-PrimColon 



1749 511149 



1791.M03.gz43 227215 



M00056422B:A08 UC2-PrimColon 



1750 727090 



1791.G04.gz43 227225 



M00056416B.H01 UC2-PrimColon 



1751 560957 



1791.K04.gz43 227229 



M00056420A:C01 UC2-PrimColon 



1752 727845 



1791.M04.gz43 22723 1 



1753 642170 



1791.H05.gz43 227242 



M00056422B:C12 UC2-PrimColon 



M00056417B:D05 UC2-PrimColon 



1754 128749 



1791.K05.gz43 227245 



M00056420A:C12 UC2-PrimColon 



1755 736632 



1791.P05.gz43 227250 



M00056424D:C08 UC2-PrimColon 



1756 542957 



1791.J07.gz43 227276 



M00056419A:F08 UC2-PrimColon 



1757 733490 



1791.O08.gz43 227297 



M00056424A:F05 UC2-PrimColon 



1758 649722 



1791.O09.gz43 227313 



M00056424A:F12 UC2-PrimColon 



1759 562137 



1791.Lll.gz43 227342 



M00056421C:E06 UC2-PrimColon 



1760 729199 



1791.G12.gz43 227353 



M00056416D:C08 UC2-PrimColon 



1761 730178 



1791.L12.gz43 227358 



M00056421C:E12 UC2-PrimColon 



1762 435349 



1791.G13.gz43 227369 



M00056416D:D12 UC2-PrimColon 



1763 723963 



1791.P13.gz43 227378 



M00056425B:B02 UC2-PrimColon 



1764 725408 



1791.H14.gz43_227386 



M00056417C:G10 UC2-PrimColon 
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Table 2 





SEQ 






ORIEN 








ID NO 


CLUSTER 


SEQ NAME 


T 


CLONE ID 


LIBRARY 




1765 


500253 


1791.L14.gz43_227390 


F 


M00056421C:H01 


UC2-PrimColon 




1766 


644723 


1791.N14.gz43_227392 


F 


M00056423C:G05 


UC2-PrimColon 




1767 


474965 


1791.M15.gz43_227407 


F 


M00056422C:B10 


UC2-PrimColon 




1768 


724024 


1791.N17.gz43_227440 


F 


M00056423D:B06 


UC2-PrimColon 




1769 


27083 


1791.D18.gz43_227446 


F 


M00056413B:D07 


UC2-PrimColon 




1770 


729899 


1791E18.gz43_227447 


F 


M00056414C:H08 


UC2-PrimColon 




1771 


728986 


1791G18.gz43_227449 


F 


M00056416D:H10 


UC2-PrimColon 




1772 


640744 


1791.A19.gz43_227459 


F 


M00056410A:A04 


UC2-PrimColon 




1773 


703978 


1791.B19.gz43_227460 


F 


M00056411B:D01 


UC2-PrimColon 




1774 


725408 


1791.019.gz43_227473 


F 


M00056424C:B10 


UC2-PrimColon 




1775. 


725407 


1791H20.gz43_227482 


F 


M00056417D:G09 


UC2-PrimColon 




1776 


736507 


1791.K21.gz43_227501 


F 


M00056420C:E08 


UC2-PrimColon 




1777 


555359 


1791.O21.gz43_227505 


F 


M00056424C:F02 


UC2-PrimColon 




1778 


724773 


1791C22.gz43_227509 


F 


M00056412B:B08 


UC2-PrimColon 


; : s 


1779 


724220 


1791.P22.gz43_227522 


F 


M00056425D:D08 


UC2-PrimColon 




1780 


641383 


1791.A23.gz43_227523 


F 


M00056410B:A09 


UC2-PrimColon 


H 


1781 


562581 


1791K23.gz43_227533 


F 


M00056420D:A09 


UC2-PrimColon 




1782 


726768 


1791P23.gz43_227538 


F 


M00056425D:H01 


UC2-PrimColon 




1783 


729579 


1 79 1 .H24.gz43_227546 


F 


M00056418A:G01 


UC2-PrimColon 




1784 


498194 


1791.L24.gz43_227550 


F 


M00056422A:F07 


UC2-PrimColon 




1785 


288134 


1791.N24.gz43_227552 


F 


M00056423D:F10 


UC2-PrimColon 




1786 


553983 


1792.P01.gz43_227582 


F 


M00056486B:G02 


UC2-PrimColon 


U 1 


1787 


724400 


1792 E03.gz43_227603 


F 


M00056433B:A09 


UC2-PrimColon 




1788 


645476 


1792J05.gz43_227640 


F 


M00056479C:E04 


UC2-PrimColon 




1789 


556115 


1792D06.gz43_227650 


F 


M00056432A:A01 


UC2-PrimColon 




1790 


734287 


1792.F06.gz43_227652 


F 


M00056434C:B01 


UC2-PrimColon 




1791 


725584 


1792.107.gz43_227671 


F 


M00056478C:H01 


UC2-PrimColon 




1792 


725006 


1792.P07.gz43_227678 


F 


M00056486D:A12 


UC2-PrimColon 




1793 


731158 


1792.A09.gz43_227695 


F 


M00056426B:G12 


UC2-PrimColon 




1794 


592186 


1792.B09.gz43_227696 


F 


M00056428A:F10 


UC2-PrimColon 




1795 


452618 


1792.M09.gz43_227707 


F 


M00056483A:F10 


UC2-PrimColon 




1796 


725238 


1792G10.gz43_227717 


F 


M00056476C:E05 


UC2-PrimColon 




1797 


727948 


1792.P10.gz43_227726 


F 


M00056486D:F04 


UC2-PrimColon 




1798 


452981 


1792.All.gz43_227727 


F 


M00056426C:G05 


UC2-PrimColon 




1799 


710362 


1792.Bll.gz43_227728 


F 


M00056428B:B01 


UC2-PrimColon 




1800 


480005 


1792.Cll.gz43_227729 


F 


M00056429D:E02 


UC2-PrimColon 




1801 


471931 


1792.Pll.gz43_227742 


F 


M00056487A:C10 


UC2-PrimColon 




1802 


725454 


1792.H12.gz43_227750 


F 


M00056477C:G09 


UC2-PrimColon 




1803 


729584 


1792.112.gz43_227751 


F 


M00056478D:G02 


UC2-PrimColon 




1804 


725210 


1792N12.gz43_227756 


F 


M00056484B:E11 


UC2-PrimColon 




1805 


462687 


1792.P12.gz43_227758 


F 


M00056487A:D01 


UC2-PrimColon 




1806 


724183 


1792.M13.gz43_227771 


F 


M00056483B:E04 


UC2-PrimColon 
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Table 2 



IQ 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1807 


724390 


1792M14.gz43_227787 


F 


M00056483B:F04 


UC2-PrimColon 


1808 


647375 


1792.G16.gz43_227813 


F 


M00056476D:H1 1 


UC2-PrimColon 


1809 


736751 


1792.G17.gz43_227829 


F 


M00056477A:B11 


UC2-PrimColon 


1810 


725348 


1792.P17.gz43 227838 


F 


M00056487A:F05 


UC2-PrimColon 


1811 


481594 


1792.L18.gz43_227850 


F 


M00056482B:B11 


UC2-PrimColon 


1812 


675299 


1792.N18.gz43_227852 


F 


M00056484D:D01 


UC2-PrimColon 


1813 


734646 


1792.K19.gz43_227865 


F 


M00056481A:F02 


UC2-PrimColon 


1814 


453713 


1792L20.gz43_227882 


F 


M00056482B:C09 


UC2-PrimColon 


1815 


724183 


1792.M20.gz43_227883 


F 


M00056483C:D06 


UC2-PrimColon 


1816 


730282 


1792.M24.gz43_227947 


F 


M00056483D:F06 


UC2-PrimColon 


1817 


733006 


1801.D01.gz43_227954 


F 


M00056491B:B09 


UC2-PrimColon 


1818 


733151 


1801.101.gz43_227959 


F 


M00056496D:B12 


UC2-PrimColon 


1819 


734261 


1801J01.gz43_227960 


F 


M00056497C:D05 


UC2-PrimColon 


1820 


514142 


1801.K01.gz43_227961 


F 


M00056498D:C01 


UC2-PrimColon 


1821 


471232 


1801.M01.gz43_227963 


F 


M00056501B:B09 


UC2-PrimColon 


1822 


447785 


1801.D02.gz43_227970 


F 


M00056491B:E02 


UC2-PrimColon 


1823 


523753 


1801J02.gz43_227976 


F 


M00056497C:E01 


UC2-PrimColon 


1824 


451383 


1801.M02.gz43_227979 


F 


M00056501B:C07 


UC2-PrimColon 


1825 


635951 


1801K03.gz43_227993 


F 


M00056499A:A04 


UC2-PrimColon 


1826 


731820 


1801.A04.gz43_227999 


F 


M00056487C:E03 


UC2-PrimColon 


1827 


446928 


1801E04.gz43_228003 


F 


M00056492B:D06 


UC2-PrimColon 


1828 


732598 


1801.P04.gz43_228014 


F 


M00056504B:B01 


UC2-PrimColon 


1829 


725434 


1801.B05.gz43_228016 


F 


M00056488B:G10 


UC2-PrimColon 


1830 


460690 


1801G05.gz43_228021 


F 


M00056495A:C02 


UC2-PrimColon 


1831 


611604 


1801J05.gz43_228024 


F 


M00056497D:C11 


UC2-PrimColon 


1832 


635951 


1801.L05.gz43_228026 


F 


M00056500A:G12 


UC2-PrimColon 


1833 


429191 


1801.B06.gz43_228032 


F 


M00056488B:H04 


UC2-PrimCoIon 


1834 


640116 


1801.O06.gz43_228045 


F 


M00056503B:G1 1 


UC2-PrimColon 


1835 


734043 


1813.H05.gz43_229558 


F 


M00056576A:A04 


UC2-PrimColon 


1836 


508210 


1813.106.gz43_229575 


F 


M00056577A:F10 


UC2-PrimColon 


1837 


11632 


1813O07.gz43_229597 


F 


M00056585C:C08 


UC2-PrimColon 


1838 


475757 


1813H08.gz43_229606 


F 


M00056576A:E01 


UC2-PrimColon 


1839 


729281 


1813.J10.gz43_229640 


F 


M00056578D:A02 


UC2-PrimColon 


1840 


733970 


1813.N10.gz43_229644 


F 


M00056584C:A06 


UC2-PrimColon 


1841 


727410 


1813.A13.gz43_229679 


F 


M00056567A:H06 


UC2-PrimColon 


1842 


465446 


1813.L13.gz43_229690 


F 


M00056581D:A08 


UC2-PrimColon 


1843 


511351 


1813.M13.gz43_229691 


F 


M00056583C:A05 


UC2-PrimColon 


1844 


733664 


1813.120.gz43_229799 


F 


M00056577D:F08 


UC2-PrimColon 


1845 


548893 


1813C22.gz43_229825 


F 


M00056570A:D09 


UC2-PrimColon 


1846 


513540 


1813.H22.gz43_229830 


F 


M00056576C:G01 


UC2-PrimColon 


1847 


725759 


1813.122.gz43_229831 


F 


M00056578A:B05 


UC2-PrimColon 


1848 


726104 


1816.C19.gz43_230929 


F 


M00056633D:E05 


UC2-PrimColon 
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Table 2 



SEQ 
IIDNOI 



CLUSTER 



1849 
1850 



SEQ NAME 



482868 



ORIENl 
T 



CLONE ID 



1816.E19.gz43 23093 1 



LIBRARY 



M00056636C:H05 UC2-PrimColon 



727332 
727078 



1816.I19.gz43 230935 



M00056641C:G09 I UC2-PriniCn1on 



1851 
1852 



1816.K19.gz43 230937 



M00056644C:B03 I UC2-PrimCnInn 



727108 
736325 



1816.L19.gz43 230938 



M00056645D:E06 I UC2-PrimCo1nn 



1853 
1854 



1816.B20.gz43 230944 



M00056632C:E03 UC2-PrimColon 



726818 



1816.J20.gz43 230952 



M00056642D:C05 I UC2-PrimColon 



1855 724083 



1816.M20.gz43 230955 



M00056646D:E03 I UC2-PrimColon 



1856 724565 



1816.N20.gz43 230956 



M00056648B:C08 | UC2-PrimColon 



1857 207530 



1816.O20.gz43 230957 



1858 736778 



1816.C21.gz43 23096 1 



M00056650A:E01 UC2-PrimColon 



M00056634A:C06 UC2-PrimColon 



1859. 559684 



1816.E21.gz43 230963 



M00056636D:F03 UC2-PrimColon 



1860 726494 



1816.B22.gz43 230976 



M00056632C;H08 UC2-PrimColon 



1861 419711 



1816.G22.gz43 23098 1 



M00056639A:F11 UC2-PrimColon 



1862 377935 



1816.M22.gz43 230987 



M00056646D:F07 UC2-PrimColon 



1863 726448 



1816.C23.gz43 230993 



M00056634A:G11 UC2-PrimColon 



1864 215249 



1816.G23.gz43 230997 



1865 735322 



1816.N23.gz43 231004 



M00056639A:H12 UC2-PrimColon 



M00056648B:E09 UC2-PrimColon 



1866 727175 



1816.Q23.gz43 23 1005 



M00056650B:C11 UC2-PrimColon 



1867 731966 



1816.P23.gz43 23 1006 



1868 727944 



1816.B24.gz43 231008 



M00056651B.E12 UC2-PrimColon 



M00056632D:E02 UC2-PrimColon 



1869 730596 



1816.M24.gz43 23 1019 



M00056647A.F06 UC2-PrimColon 



1870 725607 



1825.N13.gz43 231228 



M00056667A.A12 UC2-PrimColon 



1871 



45934 



1825.P13.gz43 231230 



M00056670A:G02 UC2-PrimColon 



1872 484987 



1825.F14.gz43 231236 



M00056658A:E11 UC2-PrimColon 



1873 481592 



1 825 .M14.gz43_23 1243 



M00056666A;C04 UC2-PrimColon 



1874 735605 



1825.Q14.gz43 23 1245 



M00056668D:E03 UC2-PrimColon 



1875 734184 



1825.P14.gz43 231246 



M00056670B:A12 UC2-PrimColon 



1876 477718 



1825.E16.gz43 231267 



M00056657A:F06 UC2-PrimColon 



1877 552927 



1825.F16.gz43 231268 



M00056658B:C03 UC2-PrimColon 



1878 719620 



1825.J16.gz43 231272 



M00056663A:B02 UC2-PrimColon 



1879 



27083 



1825.M16.gz43 23 1275 



M00056666A:D09 UC2-PrimColon 



1880 478458 



1825.P17.gz43 231294 



M00056670B:G05 UC2-PrimColon 



1881 448104 



1 825 .L19.gz43_23 1322 



1882 729675 



1825.J20.gz43 231336 



M00056664D:H06 UC2-PrimColon 



M00056663A:H09 UC2-PrimColon 



1883 730570 



1825.B21.gz43 23 1 344 



M00056653DF07 UC2-PrimColon 



1884 727108 



1825.D21.gz43 231346 



M00056656A:E01 UC2-PrimColon 



1885 447074 



1 825.12 l gz43 231351 



M00056661D:D06 UC2-PrimColon 



1886 463741 



1825.F22.gz43 231364 



M00056658C.B12 UC2-PrimColon 



1887 732351 



1825.N22.gz43 231372 



M00056667D:E05 UC2-PrimColon 



1888 727343 



1825.L23.gz43 231386 



M00056665A:G08 UC2-PrimColon 



1889 736415 



1825.B24.gz43 231392 



M00056654A:A07 UC2-PrimColon 



1890 554597 



1825.Q24.gz43 23 1405 



M00056669B:E10 UC2-PrimColon 
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Table 2 



SEQ 
I ID NO 



CLUSTER 



1891 



1892 
1893 



1894 
1895 



1896 



1897 



1898 



1899 



1900 
1901 



1902 
1903 



1904 
1905 



SEQ NAME 



ORIEN 
T 



723914 | 1826.A13.gz43 231599 



734622 | 1826.C14.gz43 231617 



645925 I 1826.I14.gz43 23 1 623 



641777 | 1826.D16.gz43 231650 



734828 | 1826.E17.gz43 231667 



381623 | 1826.I17.gz43 231671 



1906 



1907 



1908 



1909 
1910 



1911 
1912 



1913 



1914 



1915 



1916 



1917 
1918 



1919 
1920 



1921 
1922 



1923 



1924 



1925 



1926 



1927 



1928 



1929 



1930 



1931 
1932 



517274 | 1826.J17.gz43 231672 



446575 | 1826.C18.gz43 23 1 68 



725899 | 1826.D18.gz43 231682 



492094 | 1826.M19.gz43 23 1 707 



736246 | 1826.E20.gz43 231715 



473238 I 1826.J20.gz43 231720 



730829 | 1826.K20.gz43 231721 



726722 1826.A21.gz43 231727 



492629 | 1 826.02 l.gz43 231741 



730664 | 1826.E22.gz43 231747 



729387 | 1826.G22.gz43 231749 



546121 | 1826.A23.gz43 231759 



513156 1 1826.C23.gz43 231761 



728791 | 1826.G24.gz43 231781 



559610 | 1826.I24.gz43 231783 



729205 | 1826.J24.gz43 23 1 784 



561301 I 1827.C01.gz43 23 1 793 



728556 | 1827.E01.gz43 231795 



640458 I 1827.101 .gz43 23 1799 



650136 I 1827.K01.gz43 231801 



288801 | 1827.L01.gz43 23 1 802 



648777 | 1827.M01.gz43 231803 



728196 | 1827.G02.gz43 231813 



644354 I 1827.I02.gz43 231815 



419465 | 1827.M02.gz43 231819 



558317 I 1827.P02.gz43 231822 



693869 | 1827B03.gz43_231824 



553805 | 1827.L03.gz43 231834 



626791 | 1827.N03.gz43 231836 



580601 I 1827.A04.gz43 231839 



493830 I 1827.B04.gz43 231840 



724714 I 1827.E04.gz43 231843 



737109 | 1827.F04.gz43 231844 



395604 | 1827.G04.gz43 23 1 845 



735834 | 1827.J04.gz43 231848 



726852 | 1827.L04.gz43 231850 



CLONE ID 



LIBRARY 



M00056672A:E05 UC2-PrimColon 



M00056674D.E09 UC2-PrimColon 



M00056683B.H04 UC2-PrimColon 



M00056676B:G12 UC2-PrimColon 



M00056678B:E11 UC2-PrimColon 



M00056683C:H10 | UC2-PrimColon 



M00056685A:H11 UC2-PrimColon 



M00056675A:C02 UC2-PrimColon 



M00056676C:B04 UC2-PrimColon 



M00056689B:F03 UC2-PrimColon 



M00056678B:H02 UC2-PrimColon 



M00056685B:G04 I UC2-PrimColon 



M00056686D:E04 UC2-PrimColon 



M00056672D:B08 I UC2-PrimColon 



M00056692A:A05 UC2-PrimColon 



M00056678C:A12 UC2-PrimColon 



M00056681A:E06 UC2-PrimColon 



M00056672D:E04 UC2-PrimColon 



M00056675A:G02 UC2-PrimColon 



M00056681B:A11 UC2-PrimCoIon 



M00056684BB12 UC2-PrimColon 



M00056685C:G07 UC2-PrimColon 



M00056695C:C04 UC2-PrimColon 



M00056697C:H10 UC2-PrimColon 



M00056701C;G09 UC2-PrimColon 



M00056703C:F04 UC2-PrimColon 



M00056704D:H10 UC2-PrimColon 



M00056706A:B05 UC2-PrimColon 



M00056699C:F09 UC2-PrimColon 



M00056701D:C03 UC2-PrimColon 



M00056706A:E09 UC2-PrimColon 



M00056708C:F11 UC2-PrimColon 



M00056694A:G03 UC2-PrimColon 



M00056705A:B12 UC2-PrimColon 



M00056707A:G11 UC2-PrimColon 



M00056693B:D11 UC2-PrimColon 



M00056694A:G11 UC2-PrimColon 



M00056697D:B09 UC2-PrimColon 



M00056698D:E11 UC2-PrimColon 



M00056699C:G11 UC2-PrimColon 



M00056702D:G03 UC2-PrimColon 



M00056705A:D02 UC2-PrimColon 
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Table 2 



SEQ 
I ID NO 



CLUSTER 



SEQ NAME 



IorienI 

T 



CLONE ID 



LIBRARY 



1933 



728892 I 1827.P05.gz43 23 1 870 



M00056708D:B03 I UC2-PrimColon 



1934 



540618 1827.C06.gz43 23 1 873 



M00056695C:H08 UC2-PrimColon 



1935 
1936 



501030 1827.E06.gz43 231875 



M00056697D:C09 UC2-PrimColon 



477366 | 1827.F06.gz43 23 1 876 



1937 
1938 



727948 I 1827.I06.gz43 231879 



M00056698D:G09 [ UC2-PrimColon 
M00056701D;DQ4 | UC2-PrimColon 



1939 
1940 



649927 | 1827.K06.gz43 23 1881 



449752 1827.M06.gz43 231883 



M00056703D:F08 UC2-PrimColon 



M00056706B:C02 UC2-PrimColon 



649106 | 1827.N06.gz43 23 1 884 



M00056707B:E02 UC2-PrimColon 



1941 



730463 1827.P06.gz43 231886 



M00056708D;D10 UC2-PrimColon 



1942 
1943 



462687 1827.E07.gz43 231891 



M00056697D:C12 UC2-PrimColon 



1944 
1945 



553140 | 1827.I07.gz43 231895 



736100 1827.M07.gz43_231899 



M00056701D:GO3 I UC2-PrimColon 



M00056706B:C07 UC2-PrimColon 



730375 I 1827.B08.gz43 231904 



M00056694B:H10 | UC2-PrimColon 



1946 
1947 



403671 | 1827.K08.gz43 231913 



M00056704A:B05 I UC2-PrimColon 



595181 | 1827.M08.gz43 231915 



M00056706B:D04 UC2-PrimColon 



1948 
1949 



573733 I 1827.A09.gz43 231919 



M00056693B:H03 UC2-PrimColon 



1950 
1951 



550454 I 1827.I09.gz43 231927 



651050 1 1827.J09.gz43 23 1 928 



M00056701D:H12 UC2-PrimColon 



M00056703A:G01 UC2-PrimColon 



728768 | 1827.L09.gz43 23 1 930 



M00056705B:A12 UC2-PrimColon 



1952 
1953 



736276 | 1827.N09.gz43 231932 



M00056707B:F06 UC2-PrimColon 



472837 | 1827.C10.gz43 231937 



M00056695D:E05 UC2-PrimColon 



1954 



1955 



732770 | 1827.H10.gz43 231942 



83388 | 1827.L10.gz43 23 1 946 



M00056700D:H07 UC2-PrimColon 



M00056705B:D10 UC2-PrimColon 



1956 
1957 



584693 | 1827.M10.gz43 231947 



M00056706C:A07 UC2-PrimColon 



588059 | 1827.Dll.gz43 231954 



M00056697A.D02 UC2-PrimColon 



1958 
1959 



452662 I 1827.Fll.gz43 231956 



M00056699A:D08 UC2-PrimColon 



728898 | 1827.Mll.gz43 231963 



M00056706C:B12 UC2-PrimColon 



1960 
1961 



672032 I 1827.F12.gz43 231972 



M00056699A:E08 UC2-PrimColon 



496084 I 1827.H12.gz43 23 1 974 



M00056701A:A02 UC2-PrimColon 



1962 



1963 



736579 | 1827.Q12.gz43 231981 



727968 | 1827.A13.gz43 231983 



M00056708B:B09 UC2-PrimColon 



M00056693C:D12 UC2-PrimCoIon 



1964 
1965 



447624 1827J13.gz43_231992 



M00056703B:A04 UC2-PrimColon 



1966 



728552 1827.K13.gz43 231993 



733064 | 1827.M13.gz43 231995 



M00056704A:H08 UC2-PrimColon 



M00056706C:D05 UC2-PrimColon 



1967 



607202 | 1827.01 3 gz43 23 1997 



M00056708B:D03 UC2-PrimColon 



1968 



473433 I 1827.P13.gz43 231998 



M00056708D:H06 UC2-PrimColon 



1969 



617813 I 1827.A14.gz43 23 1999 



M00056693C;G06 UC2-PrimColon 



1970 



728300 | 1827.G14.gz43 232005 



M00056700A:F12 UC2-PrimColon 



1971 



727760 | 1827.K14.gz43 232009 



M00056704B:C02 UC2-PrimColon 



1972 



1973 
1974 



454087 | 1827.P14.gz43 232014 



425396 | 1827:A15.gz43 232015 



M00056709A:A05 UC2-PrimColon 



M00056693D:B02 UC2-PrimColon 



552430 | 1827.N15.gz43 232028 



M00056707C:E01 UC2-PrimColon 



119 




Table 2 





CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


1975 


650542 


1827.C16.gz43 232033 


F 


M00056696A:G12 


UC2-PrimColon 


1976 


725084 


1827.D16.gz43 232034 


F 


M00056697A:H10 


UC2-PrimColon 


1977 


446607 


1827.G16.gz43 232037 


F 


M00056700A:G05 


UC2-PrimColon 


1978 


4584 


1827.H16.gz43 232038 


F 


M00056701A:G09 


UC2-PrimColon 


1979 


560678 


1827.J16.gz43_232040 


F 


M00056703B:D07 


UC2-PrimColon 


1980 


730639 


1827.N16.gz43_232044 


F 


M00056707C:E03 


UC2-PrimColon 


1981 


727506 


1827.C17.gz43_232049 


F 


M00056696B:B03 


UC2-PrimColon 


1982 


540618 


1827.E17.gz43 232051 


F 


M00056698B:E10 


UC2-PrimColon 


1983 


735676 


1 827.1 17.gz43_232055 


F 


M00056702C:A10 


UC2-PrimCoIon 


1984 


728768 


1827.N17.gz43_232060 


F 


M00056707C:F10 


UC2-PrimColon 


1985 


475942 


1827.A18.gz43_232063 


F 


M00056693D:E12 


UC2-PrimColon 


1986 


631056 


1827.H18.gz43_232070 


F 


M00056701B:D02 


UC2-PrimColon 


1987 


259218 


1827.P18.gz43_232078 


F 


M00056709A:H11 


UC2-PrimColon 


1988 


724489 


1827.H19.gz43_232086 


F 


M00056701B.D06 


UC2-PrimColon 


1989 


727845 


1827.D21.gz43_232114 


F 


M00056697C:D11 


UC2-PrimColon 


1990 


641072 


1827.F21.gz43_232116 


F 


M00056699B:G04 


UC2-PrimColon 


1991 


553483 


1827.A22.gz43_232127 


F 


M00056694A.C05 


UC2-PrimColon 


1992 


551096 


1827.C24.gz43_232161 


F 


M00056696B:H01 


UC2-PrimColon 


1993 


727649 


1827.F24.gz43_232164 


F 


M00056699C:C02 


UC2-PrimColon 


1994 


728196 


1827H24.gz43_232166 


F 


M00056701C:F08 


UC2-PrimColon 


1995 


729428 


1828.A01.gz43_232175 


F 


M00056709C:F06 


UC2-PrimColon 


1996 


724050 


1828.F01.gz43_232180 


F 


M00056714B:C12 


UC2-PrimColon 


1997 


732351 


1828.L01.gz43_232186 


F 


M00056721C:E05 


UC2-PrimColon 


1998 


730627 


1828.M01.gz43_232187 


F 


M00056722C:C09 


UC2-PrimColon 


1999 


728182 


1828.H02.gz43 232198 


F 


M00056716B:F12 


UC2-PrimColon 


2000 


727878 


1828.M02.gz43_232203 


F 


M00056722C:D11 


UC2-PrimColon 


2001 


462687 


1 828.N02.gz43_232204 


F 


M00056723C:C09 


UC2-PrimColon 


2002 


555763 


1828.L03.gz43_232218 


F 


M00056721C:H01 


UC2-PrimColon 


2003 


728627 


1828.P03.gz43 232222 


F 


M00056725C:H06 


UC2-PrimColon 


2004 


630516 


1828.H04.gz43_232230 


F 


M00056716C:B06 


UC2-PrimColon 


2005 


584179 


1828.A05.gz43_232239 


F 


M00056709D:D05 


UC2-PrimColon 


2006 


558544 


1828.G05.gz43_232245 


F 


M00056715C:B07 


UC2-PrimColon 


2007 


553294 


1828.N05.gz43_232252 


F 


M00056723C.E01 


UC2-PrimColon 


2008 


726576 


1 828.005 .gz43_232253 


F 


M00056724C:H11 


UC2-PrimColon 


2009 


734622 


1828.L07.gz43_232282 


F 


M00056721D:D01 


UC2-PrimColon 


2010 


735362 


1828.P07.gz43 232286 


F 


M00056726A:C12 


UC2-PrimColon 


2011 


455297 


1828.A08.gz43_232287 


F 


M00056709D:E12 


UC2-PrimCoIon 


2012 


734990 


1828.N08.gz43_232300 


F 


M00056723C:G03 


UC2-PrimColon 


2013 


707609 


1828.B10.gz43_232320 


F 


M00056710D:C05 


UC2-PrimColon 


2014 


728178 


1828.L10.gz43_232330 


F 


M00056721D:F12 


UC2-PrimCoIon 


2015 


447634 


1828.P10.gz43_232334 


F 


M00056726B:H06 


UC2-PrimColon 


2016 


729813 


1828.Fll.gz43_232340 


F 


M00056714D:A11 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


. 

LIBRARY 


2017 


728528 


1828.11 l.gz43_232343 


F 


M00056718A:H05 


UC2-PrimColon 


2018 


732770 


1828.Jll.gz43_232344 


F 


M00056719B:G04 


UC2-PrimColon 


2019 


505858 


1828.F12.gz43_232356 


F 


M00056714D:E08 


UC2-PrimColon 


2020 


735047 


1828E13.gz43_232371 


F 


M00056714A:B02 


UC2-PrimColon 


2021 


727749 


1828.K13.gz43_232377 


F 


M00056721A:C07 


UC2-PrimColon 


2022 


598746 


1828L13.gz43_232378 


F 


M00056722A:E10 


UC2-PrimColon 


2023 


728061 


1828.013.gz43_232381 


F 


M00056725A:E02 


UC2-PrimColon 


2024 


448217 


1828.F14.gz43_232388 


F 


M00056714D:H07 


UC2-PrimColon 


2025 


485899 


1828.L14.gz43_232394 


F 


M00056722A:F08 


UC2-PrimColon 


2026 


641287 


1828B15.gz43_232400 


F 


M00056711A:A09 


UC2-PrimCoIon 


2027. 


48619 


1828.D15.gz43 232402 


F 


M00056713A:F05 


UC2-PrimColon 


2028 


728929 


1828.E15.gz43_232403 


F 


M00056714A:B09 


UC2-PrimColon 


2029 


451972 


1828.115.gz43_232407 


F 


M00056718B:C02 


UC2-PrimCoIon 


2030 


486363 


1828J18.gz43_232456 


F 


M00056719C:B10 


UC2-PrimColon 


2031 


728897 


1828C19.gz43_232465 


F 


M00056712B:B11 


UC2-PrimColon 


2032 


448217 


1828.E19.gz43_232467 


F 


M00056714A:E12 


UC2-PrimColon 


2033 


646309 


1828.119.gz43_232471 


F 


M00056718C:B01 


UC2-PrimColon 


2034 


728303 


1828.J19.gz43_232472 


F 


M00056719C:F06 


UC2-PrimColon 


2035 


728002 


1828.K19.gz43_232473 


F 


M00056721B:D03 


UC2-PrimColon 


2036 


473640 


1828.N20.gz43_232492 


F 


M00056724B:E11 


UC2-PrimColon 


2037 


727480 


1828.O21.gz43_232509 


F 


M00056725C:A03 


UC2-PrimColon 


2038 


509678 


1828.P21.gz43_232510 


F 


M00056726D:G08 


UC2-PrimColon 


2039 


728445 


1828.E22.gz43_232515 


F 


M00056714A:H06 


UC2-PrimColon 


2040 


447150 


1828.N22.gz43J232524 


F 


M00056724B:G03 


UC2-PrimColon 


2041 


728273 


1828.A23.gz43_232527 


F 


M00056710B:F05 


UC2-PrimColon 


2042 


734453 


1828.C23.gz43 232529 


F 


M00056712B:F02 


UC2-PrimColon 


2043 


732006 


1828.D23.gz43_232530 


F 


M00056713C:A10 


UC2-PrimColon 


2044 


449585 


1828.123.gz43_232535 


F 


M00056718D:D12 


UC2-PrimColon 


2045 


728966 


1828.E24.gz43_232547 


F 


M00056714B:B06 


UC2-PrimColon 


2046 


529356 


1838.C01.gz43_232945 


F 


M00056747D:A03 


UC2-PrimColon 


2047 


551693 


1838.P01.gz43_232958 


F 


M00056764C:A02 


UC2-PrimColon 


2048 


438663 


1838B03.gz43_232976 


F 


M00056746D:D06 


UC2-PrimColon 


2049 


735464 


1838.E03.gz43_232979 


F 


M00056750B:H03 


UC2-PrimColon 


2050 


736035 


1838.J03.gz43_232984 


F 


M00056756C:D06 


UC2-PrimColon 


2051 


736210 


1838.L03.gz43_232986 


F 


M00056759C:C04 


UC2-PrimColon 


2052 


728464 


1838.105.gz43_233015 


F 


M00056755B:E07 


UC2-PrimColon 


2053 


730178 


1838.K05.gz43_233017 


F 


M00056758C:B08 


UC2-PrimColon 


2054 


481614 


1838.N05.gz43_233020 


F 


M00056762B:D06 


UC2-PrimColon 


2055 


402799 


1839A01.gz43_233327 


F 


M00056766D:G09 


UC2-PrirnColon 


2056 


675768 


1839.N01.gz43_233340 


F 


M00056784D:C06 


UC2-PrimColon 


2057 


577305 


1839.P01.gz43_233342 


F 


M00056787B:C07 


UC2-PrimColon 


2058 


730505 


1839.K02.gz43_233353 


F 


M00056781A:E06 


UC2-PrimColon 
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# • 



Table 2 



1 ID JNL 


> CLUSTER 


SEQ NAME 


ORIEIs 
T 


CLONE ID 


LIBRARY 


2059 


349977 


1839.B03.gz43 233360 


F 


M00056768D:A09 


UC2-PrimColon 


2060 


726344 


1839.N03.gz43 233372 


F 


M00056784D:G10 


UC2-PrimColon 


2061 


724183 


1839.J04.gz43 233384 


F 


M00056780B:E06 


UC2-PrimColon 


2062 


730352 


1839.M04.gz43 233387 


F 


M00056783C:E03 


UC2-PrimColon 


2063 


510596 


1839.E05.gz43 233395 


F 


M00056773C.C09 


UC2-PrimColon 


2064 


128773 


1839.F05.gz43 233396 


F 


M00056775A:A05 


UC2-PrimColon 


2065 


728464 


1839.G06.gz43 233413 


F 


M00056776A:G04 


UC2-PrimColon 


2066 


449010 


1839.L06.gz43 233418 


F 


M00056782B:G05 


UC2-PrimColon 


2067 


460023 


1839.M06.gz43 233419 


F 


M00056783D:A02 


UC2-PrimColon 


2068 


733744 


1839.K07.gz43 233433 


F 


M00056781B:C03 


UC2-PrimColon 


2069 


2180 


1839.L07.gz43 233434 


F 


M00056782B:H11 


UC2-PrimColon 


2070 


556458 


1839.M07.gz43 233435 


F 


M00056783D:B07 


UC2-PrimColon 


2071 


711325 


1839.A08.gz43 233439 


F 


M00056767A:F12 


UC2-PrimColon 


2072 


135593 


1839.E08.gz43 233443 


F 


M00056773C:F12 


UC2-PrimColon 


2073 


639178 


1839.K08.gz43 233449 


F 


M00056781B:C05 


UC2-PrimColon 


2074 


504513 


1839.B09.gz43 233456 


F 


M00056769C:C03 


UC2-PrimColon 


2075 


493193 


1839.J09.gz43 233464 


F 


M00056780B:H04 


UC2-PrimColon 


2076 


728133 


1839.K09.gz43 233465 


F 


M00056781B:E01 


UC2-PrimColon 


2077 


727737 


1839.All.gz43 233487 


F 


M00056767B:C01 


UC2-PrimColon 


2078 


477053 


1839.F12.gz43 233508 


F 


M00056775B:H07 


UC2-PrimColon 


2079 


726384 


1839.P12.gz43 233518 


F 


M00056787C:G01 


UC2-PrimColon 


2080 


736686 


1839.A13.gz43 233519 


F 


M00056767B:G08 


UC2-PrimColon 


2081 


483549 


1839.L13.gz43 233530 


F 


M00056782D:F02 


UC2-PrimColon 


2082 


549801 


1839.A14.gz43 233535 


F 


M00056767C:A09 


UC2-PrimColon 


2083 


730484 


1839.H14.gz43 233542 


F 


M00056777D:D04 


UC2-PrimColon 


2084 


647991 


1839.J14.gz43 233544 


F 


M00056780C:H12 


UC2-PrimColon 


2085 


490393 


1839.L14.gz43 233546 


F 


M00056782D:F08 


UC2-PrimColon 


2086 


643968 


1839.N14.gz43 233548 


F 


M00056785C:B09 


UC2-PrimColon 


2087 


641968 


1839.B15.gz43 233552 


F 


M00056770C:A07 


UC2-PrimColon 


2088 


726261 


1839.L15.gz43 233562 


F 


M00056782D:F10 


UC2-PrimColon 


2089 


733891 


1839.M15.gz43 233563 


F 


M00056784B.A01 


UC2-PrimColon 


2090 


730022 


1839.015.gz43 233565 


F 


M00056786D:A03 


UC2-PrimColon 


2091 


730296 


1839.M16.gz43 233579 


F 


M00056784B.C02 


UC2-PrimColon 


2092 


733149 


1839.M17.gz43 233595 


F 


M00056784B:D06 


UC2-PrimColon 


2093 


728487 


1839.A18.gz43 233599 


F 


M00056768A:C10 


UC2-PrimCoIon 


2094 


726229 


1839.C18.gz43 233601 


F 


M00056771D:D10 


UC2-PrimColon 


1 ono^ 
| Z\)yj 


523868 


1839.G18.gz43 233605 


F 


M00056776D:B02 


UC2-PrimColon 


2096 


577305 


1839.L18.gz43 233610 


F 


M00056783A.C08 


UC2-PrimColon 


2097 


727216 


1839.J19.gz43 233624 


F 


M00056780D:G05 


UC2-PrimColon 


2098 


47461 


1839.L19.gz43 233626 


F 


M00056783B:B01 


UC2-PrimColon 


2099 


550780 


1839.N19.gz43 233628 


F 


M00056785D:C09 


UC2-PrimColon 


2100 


606076 


1839P21.gz43_233662 


F 


M00056788A:D06 


UC2-PrimColon 
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Table 2 



id Nn 




SEQ NAME 


ORIEN 
T 


[ 

CLONE ID 


LIBRARY 


2101 


691229 


1839.C22.gz43 233665 


F 


M00056772A:C08 


UC2-PrimColon 


2102 


727366 


1839.D22.gz43 233666 


F 


M00056773A:G10 


UC2-PrimCoIon 


2103 


724722 


1839.J22.gz43 233672 


F 


M00056780D:H10 


UC2-PrimCoIon 


2104 


725784 


1839.M22.gz43 233675 


F 


M00056784C:C06 


UC2-PrimColon 


2105 


732345 


1839.B23.gz43 233680 


F 


M00056771A:F03 


UC2-PrimCoIon 


2106 


726699 


1839.C23.gz43 233681 


F 


M00056772A:D03 


UC2-PrimColon 


2107 


513156 


1839.J23.gz43 233688 


F 


M00056781A:C05 


UC2-PrimColon 


2108 


631038 


1839.E24.gz43 233699 


F 


M00056774D:B02 


UC2-PrimColon 


2109 


609459 


1839.H24.gz43 233702 


F 


M00056778C:G08 


UC2-PrimCoIon 


2110 


482788 


1839.J24.gz43 233704 


F 


M00056781A:D02 


UC2-PrimColon 


2111 


730592 


1839.N24.gz43 233708 


F 


M00056785D:G07 


UC2-PrimColon 


2112 


735801 


1852.A13.gz43 235742 


F 


M00056871C:D05 


UC2-PrimColon 


2113 


472704 


1852.H13.gz43 235749 


F 


M00056879D:D01 


UC2-PrimColon 


2114 


649299 


1852.I13.gz43 235750 


F 


M00056881A.A10 


UC2-PrimColon 


2115 


650944 


1852.N13.gz43 235755 


F 


M00056886B:B10 


UC2-PrimColon 


2116 


481641 


1852.E14.gz43 235762 


F 


M00056876C:B02 


UC2-PrimColon 


2117 


488349 


1852.F14.gz43 235763 


F 


M00056877B:H09 


UC2-PrimColon 


2118 


735096 


1852.I14.gz43 235766 


F 


M00056881A:C02 


UC2-PrimColon 


2119 


594434 


1852.J14.gz43 235767 


F 


M00056882B:E12 


UC2-PrimCoIon 


2120 


550673 


1852.L14.gz43 235769 


F 


M00056884C:H08 


UC2-PrimColon 


2121 


735672 


1852.N14.gz43 235771 


F 


M00056886B:C05 


UC2-PrimColon 


2122 


732712 


1852.K15.gz43 235784 


F 


M00056883D:A07 


UC2-PrimColon 


2123 


472307 


1852.L15.gz43 235785 


F 


M00056884C:H11 


UC2-PrimColon 


2124 


729981 


1852.D16.gz43 235793 


F 


M00056875D:C04 


UC2-PrimColon 


2125 


639950 


1 852.1 16.gz43 235798 


F 


M00056881A:H02 


UC2-PrimColon 


2126 


724517 


1852.L16.gz43 235801 


F 


M00056884D:C07 


UC2-PrimColon 


2127 


735412 


1852.B17.gz43_235807 


F 


M00056873A:H06 


UC2-PrimColon 


2128 


724223 


1852.N17.gz43_235819 


F 


M00056886B:H02 


UC2-PrimColon 


2129 


727602 


1852.D18.gz43_235825 


F 


M00056875D:E09 


UC2-PrimColon 


2130 


561396 


1852.L18.gz43 235833 


F 


M00056884D:D06 


UC2-PrimColon 


2131 


726408 


1852.B19.gz43_235839 


F 


M00056873B:C09 


UC2-PrimColon 


2132 


730059 


1852.C19.gz43_235840 


F 


M00056874C:D05 


UC2-PrimColon 


2133 


655327 


1852.F19.gz43_235843 


F 


M00056877C:G12 


UC2-PrimColon 


2134 


732872 


1852.M19.gz43_235850 


F 


M00056886A:A09 


UC2-PrimColon 


2135 


473512 


1852.N19.gz43 235851 


F 


M00056886C:D02 


UC2-PrimColon 


2136 


730441 


1852.F20.gz43_235859 


F 


M00056877C:H03 


UC2-PrimColon 






l6JZ.rNZU.gZ4j_Z.3jSO/ 


F 


M00056886C:D11 


UC2-PrimColon 


2138 


733040 


1852.D21.gz43 235873 


F 


M00056875D:H12 


UC2-PrimColon 


2139 


732872 


1852.K21.gz43_235880 


F 


M00056883D:F07 


UC2-PrimColon 


2140 


152 


1 852.M2 1 .gz43_235882 


F 


M00056886A:C11 


UC2-PrimColon 


2141 


730301 


1852.122.gz43_235894 


F 


M00056881B:G04 


UC2-PrimColon 


2142 


471931 


1852.J22.gz43_235895 


F 


M00056882D:A06 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


OPJEN 
T 


CLONE ID 


LIBRARY 


2143 


730106 


1852.L23.gz43_235913 


F 


M00056885A.D12 


UC2-PrimColon 


2144 


649030 


1852.024.gz43 235932 


F 


M00056888A:H04 


UC2-PrimColon 


2145 


562345 


1861.104.gz43_235990 


F 


M00056900C:J312 


UC2-PrimColon 


2146 


472226 


1 86 l.P04.gz43_ 235997 


F 


M00056910B:F01 


UC2-PrimColon 


2147 


494393 


1861.E07.gz43 236034 


F 


M00056895C:E11 


UC2-PrimColon 


2148 


730393 


1861.B08.gz43_236047 


F 


M00056891C:H08 


UC2-PrimColon 


2149 


676221 


1861.G10.gz43_236084 


F 


M00056898C:B06 


UC2-PrimColon 


2150 


648774 


1861.011.gz43 236108 


F 


M00056909B:E11 


UC2-PrimColon 


2151 


448368 


1861.H12.gz43_236117 


F 


M00056899D:B06 


UC2-PrimColon 


2152 


489001 


1861.K12.gz43_236120 


F 


M00056902D.H09 


UC2-PrimColon 


2153 


557525 


1861.J14.gz43_236151 


F 


M00056902A:G12 


UC2-PrimColon 


2154 


724773 


1861.C16.gz43_236176 


F 


M00056893B:G12 


UC2-PrimColon 


2155 


730608 


1861.116.gz43_236182 


F 


M00056901B:C03 


UC2-PrimColon 


2156 


642897 


1861.K16.gz43 236184 


F 


M00056903A:C08 


UC2-PrimColon 


2157 


548275 


1861.N16.gz43 236187 


F 


M00056908B:D02 


UC2-PrimColon 


2158 


728640 


1861.C17.gz43_236192 


F 


M00056893C:A02 


UC2-PrimColon 


2159 


479572 


1861.B18.gz43_236207 


F 


M00056892B:C09 


UC2-PrimColon 


2160 


735834 


1861.E18.gz43_236210 


F 


M00056896A:C01 


UC2-PrimColon 


2161 


733425 


1861.K18.gz43 236216 


F 


M00056903A:F03 


UC2-PrimColon 


2162 


732653 


1861.P19.gz43_236237 


F 


M00056911B:A01 


UC2-PrimColon 


2163 


729060 


1861.G20.gz43_236244 


F 


M00056898D:H09 


UC2-PrimColon 


2164 


656667 


1861.H21.gz43_236261 


F 


M00056900B:D01 


UC2-PrimColon 


2165 


732676 


1861.G22.gz43 236276 


F 


M00056899A:A1 1 


UC2-PrimColon 


2166 


730296 


1861.H24.gz43 236309 


F 


M00056900B:F07 


UC2-PrimColon 


2167 


453001 


1861.K24.gz43_236312 


F 


M00056903B:D09 


UC2-PrimColon 


2168 


725451 


1861.N24.gz43_236315 


F 


M00056908D:A1 1 


UC2-PrimColon 


2169 


730851 


1863.C01.gz43 236704 


F 


M00056936C:E04 


UC2-PrimColon 


2170 


727596 


1863.H01.gz43 236709 


F 


M00056941D:D07 


UC2-PrimColon 


2171 


736293 


1863.L01.gz43_236713 


F 


M00O56946A:F07 


UC2-PrimColon 


2172 


733910 


1863.N01.gz43 236715 


F 


M00056948C:F03 


UC2-PrimColon 


2173 


77737 


1863.C02.gz43 236720 


F 


M00056936C:F11 


UC2-PrimColon 


2174 


638983 


1863.L02.gz43 236729 


F 


M00056946A:G06 


UC2-PrimColon 


2175 


736001 


1863.M02.gz43 236730 


F 


M00056947C:B04 


UC2-PrimColon 


2176 


731196 


1863.P02.gz43 236733 


F 


M00056950D:F12 


UC2-PrimColon 


2177 


446820 


1863.A03.gz43 236734 


F 


M00056934A:E07 


UC2-PrimColon 


2178 


734151 


1863.E03.gz43 236738 


F 


M00056939A:C01 


UC2-PrimColon 


2179 


456183 


1863.H03.gz43 236741 


F 


M00056941D:G05 


UC2-PrimColon 


2180 


734629 


1863.I03.gz43 236742 


F 


M00056943A:B04 


UC2-PrimColon 


2181 


733623 


1863.J03.gz43 236743 


F 


M00056944A:D09 


UC2-PrimColon 


2182 


733856 


1863.N03.gz43 236747 


F 


M00056948D:A11 


UC2-PrimColon 


2183 


727480 


1863.P03.gz43 236749 


F 


M00056950D:H09 


UC2-PrimColon 


2184 


448233 


1863.J04.gz43 236759 


F 


M00056944B:C02 


UC2-PrimColon 
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Table 2 



11-1 IN LI 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


2185 


729691 


1863.105.gz43_236774 


F 


M00056943A:F01 


UC2-PrimColon 


2186 


500239 


1863.N05.gz43_236779 


F 


M00056949A:A04 


UC2-PrimColon 


2187 


646314 


1863.K06.gz43 236792 


F 


M00056945A:F02 


UC2-PrimColon 


2188 


492893 


1863.A08.gz43_236814 


F 


M00056934C:C04 


UC2-PrimColon 


2189 


451624 


1863.D08.gz43_236817 


F 


M00056938A:H09 


UC2-PrimColon 


2190 


552416 


1863.L08.gz43_236825 


F 


M00056946C:B08 


UC2-PrimColon 


2191 


555478 


1863.A10.gz43_236846 


F 


M00056934C:G09 


UC2-PrimColon 


2192 


616541 


1863 .G10.gz43 236852 


F 


M00056941B:B02 


UC2-PrimColon 


2193 


410487 


1863.O10.gz43_236860 


F 


M00056950A:H03 


UC2-PrimColon 


2194 


726825 


1863.Bll.gz43 236863 


F 


M00056935C:G08 


UC2-PrimColon 


2195 


733144 


1863.D12.gz43_236881 


F 


M00056938B.C09 


UC2-PrimColon 


2196 


642528 


1863 E12.gz43_236882 


F 


M00056939B:E05 


UC2-PrimColon 


2197 


735994 


1863.J12.gz43_236887 


F 


M00056944C:B03 


UC2-PrimColon 


2198 


724401 


1863K12.gz43_236888 


F 


M00056945C:A11 


UC2-PrimColon 


2199 


736129 


1863.M12.gz43_236890 


F 


M00056948A:D09 


UC2-PrimColon 


2200 


448046 


1863.P13.gz43_236909 


F 


M00056951B:F09 


UC2-PrimColon 


2201 


731216 


1863D15.gz43_236929 


F 


M00056938B:H08 


UC2-PrimColon 


2202 


449061 


1863.M15.gz43_236938 


F 


M00056948B:B03 


UC2-PrimColon 


2203 


730815 


1863.F17.gz43_236963 


F 


M00056940C:E05 


UC2-PrimColon 


2204 


732756 


1863.H17.gz43_236965 


F 


M00056942C:F11 


UC2-PrimColon 


2205 


468015 


1863.117.gz43_236966 


F 


M00056943C:A01 


UC2-PrimColon 


2206 


733081 


1863.J17.gz43_236967 


F 


M00056944D:A06 


UC2-PrimColon 


2207 


555484 


1863 L17.gz43_236969 


F 


M00056946D:G09 


UC2-PrimColon 


2208 


731355 


1863.A18.gz43_236974 


F 


M00056935A:C02 


UC2-PrimColon 


2209 


730375 


1863.G18.gz43_236980 


F 


M00056941B:G08 


UC2-PrimColon 


2210 


467803 


1863.M18.gz43_236986 


F 


M00056948B:G05 


UC2-PrimColon 


2211 


735306 


1863.D20.gz43_237009 


F 


M00056938C:F10 


UC2-PrimColon 


2212 


729316 


1863.120.gz43_237014 


F 


M00056943D:B02 


UC2-PrimColon 


2213 


551441 


1863.L20.gz43_237017 


F 


M00056947A:C05 


UC2-PrimColon 


2214 


735484 


1863.A21.gz43_237022 


F 


M00056935A.F04 


UC2-PrimColon 


2215 


171511 


1863 E21.gz43_237026 


F 


M00056939D:B02 


UC2-PrimColon 


2216 


728076 


1863.H21.gz43_237029 


F 


M00056942D:D07 


UC2-PrimColon 


2217 


511351 


1863.H22.gz43_237045 


F 


M00056942D:D11 


UC2-PrimColon 


2218 


731125 


1863.122.gz43_237046 


F 


M00056943D:H08 


UC2-PrimColon 


2219 


524706 ' 


1863.B24.gz43_237071 


F 


M00056936C:C06 


UC2-PrimColon 


2220 


485880 


1863 C24.gz43_237072 


F 


M00056937C:H08 


UC2-PrimColon 


2221 


736197 


1 864.103 .gz43_237 1 26 


F 


M00056961C:C07 


UC2-PrimColon 


2222 


731089 


1864.M04.gz43_237146 


F 


M00056968C:C06 


UC2-PrimColon 


2223 


642781 


1864.G12.gz43_237268 


F 


M00056959C:B10 


UC2-PrimColon 


2224 


731947 


1864.M14.gz43_237306 


F 


M00056969A:B07 


UC2-PrimColon 


2225 


448712 


1864.M16.gz43_237338 


F 


M00056969A:C07 


UC2-PrimColon 


2226 


736289 


1 864.1 17.gz43_237350 


F 


M00056962D:F09 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 




iN/YJVLc, 


T 


CLONE ID 


LIBRARY 


1121 


735123 


1873.E02.gz43 237490 


F 


M00056978D:B07 


UC2-PrimColon 


2228 


724411 


1873.H02.gz43 237493 


F 


M00056982D:B12 


UC2-PrimColon 


inn 

2229 


662617 


1873.P02.gz43 237501 


F 


M00056993D:C05 


UC2-PrimColon 


2230 


639934 


1873.M03.gz43 237514 


F 


M00056989C:H09 


UC2-PrimColon 


2231 


n i c /i 

733563 


1 873 .C04.gz43_237520 


F 


M00056977A:C02 


UC2-PrimColon 


2232 


203793 


1873.K04.gz43 237528 


F 


M00056987C:D08 


UC2-PrimColon 


2233 


703978 


1873.L04.gz43 237529 


F 


M00056988C:D05 


UC2-PrimColon 


2234 


732114 


1873.P05.gz43 237549 


F 


M00056993D:F05 


UC2-PrimColon 


2235 


n ii i i 

732213 


1873J06.gz43__237559 


F 


M00056986A:E09 


UC2-PrimColon 


2236 


n f oif 

725825 


1873.N06.gz43 237563 


F 


M00056990D:C11 


UC2-PrimCoIon 


2237 


n /i oi / 

734936 


1873. D08.gz43 237585 


F 


M00056978A:A03 


UC2-PrimColon 


2238 


731607 


1873.E08.gz43 237586 


F 


M00056979B:D03 


UC2-PrimColon 


2239 


*7i 1 1 ni 

731302 


1 873. K10.gz43 237624 


F 


M00056987D:A09 


UC2-PrimColon 


2240 


n 1 1 1 1 

732113 


1 873 . L 1 0.gz43_237625 


F 


M00056988D:F05 


UC2-PrimColon 


11 vl 1 

2241 


728013 


1873.M10.gz43 237626 


F 


M00056989D:F07 


UC2-PrimColon 


1 1 A 1 

2242 


730089 


1 873 .D 1 1 .gz43_237633 


F 


M00056978A:H04 


UC2-PrimColon 


11^1 
2243 


648034 


1873.E12.gz43 237650 


F 


M00056979C:D11 


UC2-PrimColon 


11 j* y| 

2244 


729623 


1873.112.gz43_237654 


F 


M00056985A:D06 


UC2-PrimColon 


11/t£ 

2245 


731317 


1873.M12.gz43_237658 


F 


M00056989D.H1 1 


UC2-PrimColon 


2246 


HOI ^ 

728249 


1 873 .A 1 3 .gz43_23 7662 


F 


M00056975A:H1 1 


UC2-PrimColon 


n /it 
2247 


613029 


1 873 .C 1 3 ,gz43_237664 


F 


M00056977B:B06 


UC2-PrimColon 


11 yl O 

2248 


731542 


1 873 .0 1 3 .gz43_237676 


F 


M00056993A:B08 


UC2-PrimColon 


2249 


732535 


1873.G14.gz43_237684 


F 


M00056981D:H02 


UC2-PrimColon 


11 C f\ 

2250 


735440 


1873.A16.gz43_237710 


F 


M00056975B:E04 


UC2-PrimColon 


2251 


641658 


1 873 . D 1 6 .gz43_23 77 1 3 


F 


M00056978B:G02 


UC2-PrimColon 


2252 


40693 1 


1873.N18.gz43 237755 


F 


M00056991C:H11 


UC2-PrimColon 


2253 


736401 


1873.P18.gz43_237757 


F 


M00056994B:H05 


UC2-PrimColon 


2254 


734561 


1873.B20.gz43 237775 


F 


M00056976C:F05 


UC2-PrimCoIon 


2255 


727151 


1873.F20.gz43_237779 


F 


M00056980D:E07 


UC2-PrimColon 


2256 


731453 


1873.I20.gz43 237782 


F 


M00056985C:C06 


UC2-PrimCoIon 


iifi 
2257 


642649 


1873.K20.gz43_237784 


F 


M00056988A:F06 


UC2-PrimColon 


11 ^ o 

2258 


/ 1 1 a o 

642288 


1873.M20.gz43 237786 


F 


M00056990B:H07 


UC2-PrimColon 


2259 


572426 


1873.P20.gz43 237789 


F 


M00056994C:B04 


UC2-PrimColon 


2260 


731358 


1873.K22.gz43_237816 


F 


M00056988B:A06 


UC2-PrimColon 


ii/i 
2261 


A 1 1 O Ol 

471883 


1 873 .N22 .gz43_23 78 1 9 


F 


M00056992A:E01 


UC2-PrimColon 


n/1 

2262 


n i no 

733778 


1 873 .D24.gz43_23784 1 


F 


M00056978D.A01 


UC2-PrimColon 




71 1 SO? 


16 / «3.JZ*t.gZ'f J Zj/oH/ 


r 


M00056987A:C02 


UC2-PnmColon 


2264 


456236 


1874.D01.gz43 237857 


F 


M00056998C:B10 


UC2-PrimColon 


2265 


732792 


1 874.N0 l.gz43_237867 


F 


M00057009C:B02 


UC2-PrimCoIon 


2266 


733932 


1874.E03.gz43_237890 


F 


M00057000A:A05 


UC2-PrimColon 


2267 


724810 


1874.P07.gz43 237965 


F 


M00057011C:H03 


UC2-PrimCoIon 


2268 


731317 


1874.B08.gz43_237967 


F 


M00056996D:A02 


UC2-PrimColon 
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SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


2269 


726408 


1874.D08.gz43 237969 


F 


M00056998D:H08 


UC2-PrimColon 


2270 


734708 


1874.D12.gz43 238033 


F 


M00056999A:G12 


UC2-PrimColon 


2271 


631251 


1874.A15.gz43j238078 


F 


M00056995C:H06 


UC2-PrimColon 


2272 


726696 


1874.D16.gz43 238097 


F 


M00056999B:D07 


UC2-PrimColon 


2273 


448202 


1874.F16.gz43_238099 


F 


M00057001D.F02 


UC2-PrimColon 


2274 


734629 


1874H16.gz43_238101 


F 


M00057003D:F02 


UC2-PrimColon 


2275 


561993 


1874.K16.gz43_238104 


F 


M00057007B:G02 


UC2-PrimColon 


2276 


407723 


1874.J17.gz43_238119 


F 


M00057006A:G10 


UC2-PrimColon 


2277 


736413 


1874.K17.gz43_238120 


F 


M00057007C:A06 


UC2-PrimColon 


2278 


556637 


1874.P17.gz43_238125 


F 


M00057012A:D12 


UC2-PrimColon 


2279 


731686 


1874.A19.gz43 238142 


F 


M00056995D:C11 


UC2-PrimColon 


2280 


559053 


1874.B20.gz43_238159 


F 


M00056997B:C11 


UC2-PrimColon 


2281 


733625 


1875. JO l.gz43_23 8247 


F 


M00057024B:F07 


UC2-PrimColon 


2282 


461486 


1875.O01.gz43_238252 


F 


M00057030C:B03 


UC2-PrimColon 


2283 


556019 


1 875. B02.gz43_23 8255 


F 


M00057014B:A02 


UC2-PrimColon 


2284 


732119 


1875.F02.gz43_238259 


F 


M00057018C:F02 


UC2-PrimColon 


2285 


411128 


1875.K02.gz43_238264 


F 


M00057025C:A08 


UC2-PrimColon 


2286 


732441 


1875.A03.gz43_238270 


F 


M00057012D:G03 


UC2-PrimColon 


2287 


478448 


1 875 .B03.gz43_23 8271 


F 


M00057014B:B01 


UC2-PrimColon 


2288 


733723 


1 875 .D03.gz43_23 8273 


F 


M000570I6B:A09 


UC2-PrimColon 


2289 


726173 


1875.103.gz43_238278 


F 


M00057023A:H09 


UC2-PrimColon 


2290 


451351 


1 875. B04.gz43_23 8287 


F 


M00057014B:B06 


UC2-PrimColon 


2291 


570573 


1875.E04.gz43_238290 


F 


M00057017A:F11 


UC2-PrimColon 


2292 


655312 


1 875 .K04.gz43_23 8296 


F 


M00057025C:D11 


UC2-PrimColon 


2293 


732047 


1875. COS .gz43_23 8304 


F 


M00057015A:E02 


UC2-PrimCoIon 


2294 


734091 


1875.F05.gz43_238307 


F 


M00057018D:B10 


UC2-PrimColon 


2295 


538582 


1875.E06.gz43_238322 


F 


M00057017A:G04 


UC2-PrimColon 


2296 


487183 


1875.C07.gz43_238336 


F 


M00057015A:G06 


UC2-PrimColon 


2297 


483919 


1875.G07.gz43_238340 


F 


M00057020D:A05 


UC2-PrimColon 


2298 


651049 


1 875. L07.gz43_23 8345 


F 


M00057027B.B11 


UC2-PrimColon 


2299 


734392 


1875.D08.gz43_238353 


F 


M00057016B:E06 


UC2-PrimColon 


2300 


557896 


1875.F08.gz43_238355 


F 


M00057018D:E05 


UC2-PrimColon 


2301 


89082 


1875.L08.gz43_238361 


F 


M00057027B:E04 


UC2-PrimColon 


2302 


512863 


1875.P08.gz43_238365 


F 


M00057032A:C01 


UC2-PrimColon 


2303 


474009 


1875.D09.gz43_238369 


F 


M00057016B:H08 


UC2-PrimColon 


2304 


729731 


1875.H09.gz43_238373 


F 


M00057022B:A04 


UC2-PrimColon 


2305 


481581 


1875.A10.gz43_238382 


F 


M00057013B:H07 


UC2-PrimColon 


2306 


732550 


1875.C10.gz43_238384 


F 


M00057015A:H12 


UC2-PrimColon 


2307 


735396 


1875.J10.gz43_238391 


F 


M00057024D:D12 


UC2-PrimColon 


2308 


642693 


1875.P10.gz43_238397 


F 


M00057032A:F12 


UC2-PrimColon 


2309 


733970 


1875Dll.gz43_238401 


F 


M00057016C:B05 


UC2-PrimColon 


2310 


639629 


1875 Hll.gz43_238405 


F 


M00057022B:F03 


UC2-PrimColon 
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Table 2 



ID JND 


> CLUSTER 


SEQ NAME 


ORIE1S 
T 


f 

CLONE ID 


LIBRARY 


2311 


731577 


1875.Lll.gz43 238409 


F 


M00057027B:F06 


UC2-PrimColon 


2312 


733354 


1875.B12.gz43 238415 


F 


M00057014C:E01 


UC2-PrimColon 


2313 


626791 


1875.F12.gz43 238419 


F 


M00057019A:G04 


UC2-PrimColon 


2314 


559699 


1875.H12.gz43 238421 


F 


M00057022B:H04 


UC2-PrimCoIon 


2315 


726208 


1875.A13.gz43 238430 


F 


M00057013C:E09 


UC2-PrimColon 


2316 


730828 


1875.B13.gz43 238431 


F 


M00057014C:E03 


UC2-PrimColon 


2317 


390968 


1875.I13.gz43 238438 


F 


M00057023D:D05 


UC2-PrimColon 


2318 


471522 


1875.013.gz43 238444 


F 


M00057031A:H02 


UC2-PrimColon 


2319 


731838 


1875.D14.gz43 238449 


F 


M00057016C:E04 


UC2-PrimColon 


2320 


726476 


1875.H14.gz43 238453 


F 


M00057022C:D12 


UC2-PrimColon 


2321 


466920 


1875.I14.gz43 238454 


F 


M00057023D:D08 


UC2-PrimColon 


2322 


732712 


1875.N14.gz43 238459 


F 


M00057029D:A06 


UC2-PrimColon 


2323 


561338 


1875.014.gz43 238460 


F 


M00057031B:A01 


UC2-PrimColon 


2324 


733756 


1875.A15.gz43 238462 


F 


M00057013C:H01 


UC2-PrimColon 


2325 


484023 


1875.C15.gz43 238464 


F 


M00057015B:E01 


UC2-PrimColon 


2326 


557262 


1875.D15.gz43 238465 


F 


M00057016C:E11 


UC2-PrimColon 


2327 


559096 


1875.E15.gz43 238466 


F 


M00057017C:E03 


UC2-PrimColon 


2328 


455148 


1875.F15.gz43 238467 


F 


M00057019C:C08 


UC2-PrimColon 


2329 


616985 


1875.I15.gz43 238470 


F 


M00057023D:E11 


UC2-PrimColon 


2330 


733479 


1875.N15.gz43 238475 


F 


M00057029D:F01 


UC2-PrimColon 


2331 


558134 


1875.F16.gz43 238483 


F 


M00057019C.E03 


UC2-PrimColon 


2332 


703298 


1875.D17.gz43 238497 


F 


M00057016D:B07 


UC2-PrimColon 


2333 


477797 


1875.G18.gz43 238516 


F 


M00057021B:B07 


UC2-PrimColon 


2334 


432159 


1875.L18.gz43 238521 


F 


M00057027D:A12 


UC2-PrimColon 


2335 


736385 


1875.P18.gz43 238525 


F 


M00057032D:A04 


UC2-PrimColon 


2336 


619265 


1875.B19.gz43 238527 


F 


M00057014D:E05 


UC2-PrimColon 


2337 


4255 


1875.F19.gz43 238531 


F 


M00057019D:C02 


UC2-PrimColon 


2338 


554080 


1875.M19.gz43 238538 


F 


M00057028D:D09 


UC2-PrimColon 


2339 


734074 


1875.C21.gz43 238560 


F 


M00057016A:B04 


UC2-PrimColon 


2340 


734177 


1875.E21.gz43 238562 


F 


M00057018A:C05 


UC2-PrimColon 


2341 


727761 


1875.I21.gz43 238566 


F 


M00057024A:D08 


UC2-PrimColon 


2342 


733209 


1875.L21.gz43 238569 


F 


M00057027D.D07 


UC2-PrimColon 


2343 


512377 


1875.A22.gz43 238574 


F 


M00057014A:B11 


UC2-PrimColon 


2344 


733664 


1875.B22.gz43 238575 


F 


M00057014D:G10 


UC2-PrimColon 


2345 


640904 


1875.D22.gz43 238577 


F 


M00057016D:H03 


UC2-PrimColon 


2346 


425203 


1875.N22.gz43 238587 


F 


M00057030B:B03 


UC2-PrimColon 


2347 


557164 


1875.B23.gz43 238591 


F 


M00057014D:H01 


UC2-PrimColon 


2348 


494450 


1875.C23.gz43 238592 


F 


M00057016A:G01 


UC2-PrimColon 


2349 


639427 


1875.F23.gz43 238595 


F 


M00057020A:F09 


UC2-PrimColon 


2350 


554693 


1875.H23.gz43 238597 


F 


M00057022D:E10 


UC2-PrimColon 


2351 


732598 


1875.K23.gz43 238600 


F 


M00057026D:A05 


UC2-PrimColon 


2352 


495241 


1875.M23.gz43 238602 


F 


M00057029A:C08 


UC2-PrimColon 



128 



Table 2 



SEQ 
ID NO 


> CLUSTER 


SEQ NAME 


ORIEN 
T 


F 

CLONE ID 


LIBRARY 


2353 


735989 


1875.N23.gz43 238603 


F 


M00057030B:F01 


UC2-PrimColon 


2354 


726044 


1875.I24.gz43 238614 


F 


M00057024B.A11 


UC2-PrimColon 


2355 


728791 


1875.L24.gz43 238617 


F 


M00057027D:G03 


UC2-PrimColon 


2356 


562769 


1875.N24.gz43 238619 


F 


M00057030C.A05 


UC2-PrimColon 


2357 


479208 


1875.P24.gz43 238621 


F 


M00057033A:D08 


UC2-PrimColon 


2358 


4244 


1876.C03.gz43 238656 


F 


M00057035C:E10 


UC2-PrimColon 


2359 


732159 


1876.K03.gz43 238664 


F 


M00057046A:F02 


UC2-PrimCoIon 


2360 


473026 


1876.E04.gz43 238674 


F 


M00057038A:H07 


UC2-PrimColon 


2361 


736385 


1876.B05.gz43 238687 


F 


M00057034C:G12 


UC2-PrimCoIon 


2362 


734466 


1876.J05.gz43 238695 


F 


M00057045A:C04 


UC2-PrimColon 


2363 


668731 


1876.K05.gz43 238696 


F 


M00057046A:G05 


UC2-PrimColon 


2364 


736881 


1876.H06.gz43 238709 


F 


M00057042D:E06 


UC2-PrimColon 


2365 


466920 


1876.O06.gz43 238716 


F 


M00057051B:E09 


UC2-PrimColon 


2366 


728353 


1876.E07.gz43 238722 


F 


M00057038C:B06 


UC2-PrimColon 


2367 


732246 


1876.N07.gz43 238731 


F 


M00057049D.F12 


UC2-PrimColon 


2368 


468296 


1876.A08.gz43 238734 


F 


M00057033C:B03 


UC2-PrimColon 


2369 


454819 


1876.E09 : gz43 238754 


F 


M00057038C:G08 


UC2-PrimCoIon 


2370 


481293 


1876.M10.gz43 238778 


F 


M00057048C:E04 


UC2-PrimColon 


2371 


735815 


1876.Ell.gz43 238786 


F 


M00057038D:A12 


UC2-PrimColon 


2372 


732315 


1876.H12.gz43 238805 


F 


M00057043A:G07 


UC2-PrimColon 


2373 


558573 


1876.M12.gz43 238810 


F 


M00057048C:H11 


UC2-PrimCoIon 


2374 


492627 


1876.G13.gz43 238820 


F 


M00057041D:C08 


UC2-PrimColon 


2375 


732042 


1876.K14.gz43 238840 


F 


M00057046C:E05 


UC2-PrimColon 


2376 


737087 


1876.N14.gz43 238843 


F 


M00057050B:F06 


UC2-PrimColon 


2377 


456469 


1876.A16.gz43 238862 


F 


M00057033D:F10 


UC2-PrimColon 


2378 


735871 


1876.I16.gz43 238870 


F 


M00057044C:B05 


UC2-PrimCoIon 


2379 


736855 


1876.F18.gz43_238899 


F 


M00057040D.H04 


UC2-PrimColon 


2380 


732562 


1876.M20.gz43_238938 


F 


M00057048D:H10 


UC2-PrimColon 


2381 


727321 


1876.G22.gz43 238964 


F 


M00057042B:A10 


UC2-PrimColon 


2382 


638857 


1876.H22.gz43 238965 


F 


M00057043C:H11 


UC2-PrimColon 


2383 


735972 


1876.A23.gz43 238974 


F 


M00057034B:B01 


UC2-PrimColon 


2384 


735054 


1876.L23.gz43_238985 


F 


M00057047D:H04 


UC2-PrimColon 


2385 


554854 


1 885 .MO 1 .gz43_2390 1 8 


F 


M00057068D:F04 


UC2-PrimColon 


2386 


731262 


1885.O01.gz43 239020 


F 


M00057071A:A10 


UC2-PrimColon 


2387 


729981 


1885.C02.gz43_239024 


F 


M00057055C:F01 


UC2-PrimColon 


2388 


735292 


1885.F02.gz43_239027 


F 


M00057059D:H09 


UC2-PrimColon 


2389 


726699 


1885.L02.gz43_239033 


F 


M00057067D.H06 


UC2-PrimCoIon 


2390 


624133 


1885.O02.gz43 239036 


F 


M00057071A:B04 


UC2-PrimColon 


2391 


645262 


1885.A03.gz43_239038 


F 


M00057053A:A02 


UC2-PrimColon 


2392 


558549 


1885.K03.gz43 239048 


F 


M00057066D:B03 


UC2-PrimCoIon 


2393 


736093 


1885.L03.gz43_239049 


F 


M00057068A:C10 


UC2-PrimColon 


2394 


733573 


1885.G04.gz43_239060 


F 


M00057061B:H02 


UC2-PrimColon 
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SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


2395 


481366 


1885.I04.gz43 239062 


F 


M00057064B:H10 


UC2-PrimColon 


2396 


556385 


1885.N04.gz43 239067 


F 


M00057070A:B07 


UC2-PrimColon 


2397 


559656 


1885.L05.gz43_239081 


F 


M00057068A:F05 


UC2-PrimColon 


2398 


446616 


1885.A06.gz43 239086 


F 


M00057053A:D11 


UC2-PrimColon 


2399 


728445 


1885.C06.gz43_239088 


F 


M00057055C:H07 


UC2-PrimColon 


2400 


732223 


1885.D06.gz43_239089 


F 


M00057056D:F11 


UC2-PrimColon 


2401 


595506 


1885 G06.gz43_239092 


F 


M00057061C:D04 


UC2-PrimColon 


2402 


627515 


1885.K06.gz43_239096 


F 


M00057066D.E01 


UC2-PrimColon 


2403 


730528 


1885.L06.gz43_239097 


F 


M00057068A:F07 


UC2-PrimColon 


2404 


451233 


1885 E07.gz43_239106 


F 


M00057058D:G08 


UC2-PrimColon 


2405 


728884 


1885.H07.gz43_239109 


F 


M00057063A:C08 


UC2-PrimColon 


2406 


541793 


1885.107.gz43_239110 


F 


M00057064C:F11 


UC2-PrimColon 


2407 


736415 


1885.L08.gz43_239129 


F 


M00057068A:G05 


UC2-PrimColon 


2408 


552641 


1885.F09.gz43_239139 


F 


M00057060B:D07 


UC2-PrimColon 


2409 


736156 


1885.G09.gz43_239140 


F 


M00057061D:D03 


UC2-PrimColon 


2410 


560581 


1885.M09.gz43_239146 


F 


M00057069A:F09 


UC2-PrimColon 


2411 


654723 


1885.D10.gz43_239153 


F 


M00057057A:G01 


UC2-PrimColon 


2412 


731449 


1885.E10.gz43_239154 


F 


M00057059A:B04 


UC2-PrimColon 


2413 


452623 


1885.G10.gz43_239156 


F 


M00057061D:F05 


UC2-PrimColon 


2414 


732072 


1885.M10.gz43_239162 


F 


M00057069A:H08 


UC2-PrimColon 


2415 


452224 


1885.P10.gz43_239165 


F 


M00057072C:A03 


UC2-PrimColon 


2416 


550637 


1885.Kll.gz43_239176 


F 


M00057067B:C11 


UC2-PrimColon 


2417 


453508 


1885.Mll.gz43_239178 


F 


M00057069B:A08 


UC2-PrimColon 


2418 


732872 


1885.A12.gz43_239182 


F 


M00057053C:B10 


UC2-PrimColon 


2419 


735283 


1885.P12.gz43_239197 


F 


M00057072C:A09 


UC2-PrimColon 


2420 


422590 


1885.D13.gz43_239201 


F 


M00057057B:E06 


UC2-PrimColon 


2421 


422242 


1885.J13.gz43_239207 


F 


M00057066A:A09 


UC2-PrimColon 


2422 


471982 


1885.D14.gz43_239217 


F " 


M00057057B:E07 


UC2-PrimColon 


2423 


736285 


1885.H14.gz43_239221 


F 


M00057063B:F06 


UC2-PrimColon 


2424 


734787 


1885.P14.gz43 239229 


F 


M00057072C.C02 


UC2-PrimColon 


2425 


733623 


1885.A15.gz43_239230 


F 


M00057053C:G04 


UC2-PrimColon 


2426 


736318 


1885.D15.gz43_239233 


F 


M00057057B:G03 


UC2-PrimColon 


2427 


677769 


1885.A16.gz43_239246 


F 


M00057053D:D01 


UC2-PrimColon 


2428 


726380 


1885.B16.gz43_239247 


F 


M00057055B:D07 


UC2-PrimColon 


2429 


134501 


1885.C16.gz43_239248 


F 


M00057056B:C06 


UC2-PrimColon 


2430 


731844 


1885.F16.gz43_239251 


F 


M00057060C:D05 


UC2-PrimColon 


2431 


730472 


1885.B17.gz43 239263 


F 


M00057055B:E10 


UC2-PrimColon 


2432 


455884 


1885.C17.gz43_239264 


F 


M00057056B:D05 


UC2-PrimColon 


2433 


669159 


1885.H17.gz43_239269 


F 


M00057063C:C04 


UC2-PrimColon 


2434 


736146 


1885.K17.gz43_239272 


F 


M00057067C:D04 


UC2-PrimColon 


2435 


729502 


1885A18.gz43_239278 


F 


M00057054A:B12 


UC2-PrimColon 


2436 


731830 


1885.C18.gz43_239280 


F 


M00057056B:D11 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


2437 


706245 


1885N18.gz43_239291 


F 


M00057070C:H10 


UC2-PrimColon 


2438 


631526 


1885 P18.gz43_239293 


F 


M00057072C:H01 


UC2-PrimColon 


2439 


736113 


1885.A19.gz43_239294 


F 


M00057054A:D01 


UC2-PrimColon 


2440 


567005 


1885.B19.gz43_239295 


F 


M00057055B:F05 


UC2-PrimColon 


2441 


730128 


1885.K19.gz43 239304 


F 


M00057067C:H09 


UC2-PrimColon 


2442 


620462 


1885.E20.gz43_239314 


F 


M00057059C:H02 


UC2-PrimColon 


2443 


450551 


1885.F20.gz43_239315 


F 


M00057060D:C09 


UC2-PrimColon 


2444 


724773 


1885.L20.gz43 239321 


F 


M00057068D:B03 


UC2-PrimColon 


2445 


735028 


1885.N20.gz43_239323 


F 


M00057070D:B08 


UC2-PrimColon 


2446 


476947 


1885.E21.gz43_239330 


F 


M00057059D:A09 


UC2-PrimColon 


2447 


420958 


1885.121.gz43_239334 


F 


M00057065C:B07 


UC2-PrimColon 


2448 


447758 


1885 O21.gz43_239340 


F 


M00057071D:E01 


UC2-PrimColon 


2449 


447075 


1885.G22.gz43_239348 


F 


M00057062B:H04 


UC2-PrimColon 


2450 


455248 


1885.L22.gz43_239353 


F 


M00057068D:C09 


UC2-PrimColon 


2451 


732351 


1885.022.gz43_239356 


F 


M00057071D:G01 


UC2-PrimColon 


2452 


620159 


1885.P22.gz43_239357 


F 


M00057073A:B12 


UC2-PrimColon 


2453 


472704 


1885.K23.gz43_239368 


F 


M00057067D:F03 


UC2-PrimColon 


2454 


649349 


1885.N23.gz43_239371 


F 


M00057070D:G03 


UC2-PrimColon 


2455 


735216 


1885.F24.gz43 239379 


F 


M00057061A:F09 


UC2-PrimColon 


2456 


572273 


1885.J24.gz43_239383 


F 


M00057066C:B02 


UC2-PrimColon 


2457 


736210 


1885.L24.gz43_239385 


F 


M00057068D:E05 


UC2-PrimColon 


2458 


731358 


1886.M02.gz43_239418 


F 


M00057088B:E02 


UC2-PrimColon 


2459 


639629 


1886.D03.gz43_239425 


F 


M00057077B:B06 


UC2-PrimColon 


2460 


733910 


1886.L03.gz43_239433 


F 


M00057087B:A07 


UC2-PrimColon 


2461 


473588 


1886.N04.gz43_239451 


F 


M00057090A:B02 


UC2-PrimColon 


2462 


727407 


1886.N05.gz43_239467 


F 


M00057090A:C03 


UC2-PrimColon 


2463 


368965 


1886H07.gz43_239493 


F 


M00057082C:G03 


UC2-PrimColon 


2464 


734059 


1886.L13.gz43_239593 


F 


M00057087D:B04 


UC2-PrimColon 


2465 


419465 


1886P15.gz43_239629 


F 


M00057092C:A05 


UC2-PrimColon 


2466 


732487 


1886.A18.gz43_239662 


F 


M00057073D:H05 


UC2-PrimColon 


2467 


538582 


1886.M20.gz43_239706 


F 


M00057089B:D01 


UC2-PrimColon 


2468 


732131 


1886.O22.gz43_239740 


F 


M00057091D:F11 


UC2-PrimColon 


2469 


556212 


1886.E23.gz43_239746 


F 


M00057079A:F05 


UC2-PrimColon 


2470 


731966 


1886M24.gz43_239770 


F 


M00057089D:E03 


UC2-PrimColon 


2471 


732873 


1887.C02.gz43_239809 


F 


M00057095B:G03 


UC2-PrimColon 


2472 


732550 


1887.G02.gz43_239813 


F 


M00057099D:D11 


UC2-PrimColon 


2473 


731477 


1887.D04.gz43_239842 


F 


M00057096C:B08 


UC2-PrimColon 


2474 


485441 


1887.104.gz43_239847 


F 


M00057102A:F12 


UC2-PrimColon 


2475 


533689 


1887.B05.gz43_239856 


F 


M00057094B:D07 


UC2-PrimColon 


2476 


735515 


1887.H05.gz43_239862 


F 


M00057100D:E09 


UC2-PrimColon 


2477 


678846 


1887.M06.gz43_239883 


F 


M00057106A:H04 


UC2-PrimColon 


2478 


638908 


1887.P06.gz43_239886 


F 


M00057110C:A04 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


2479 


516799 


1887.L07.gz43_239898 


F 


M00057105C:C06 


UC2-PrimColon 


2480 


553898 


1887.A08.gz43 239903 


F 


M00057093C:A02 


UC2-PrimColon 


2481 


644354 


1887.F09.gz43_239924 


F 


M00057099A:C04 


UC2-PrimColon 


2482 


731697 


1 887.K09.gz43_239929 


F 


M00057104D:A02 


UC2-PrimColon 


2483 


645139 


1887.F10.gz43_239940 


F 


M00057099A:H12 


UC2-PrimColon 


2484 


732885 


1887.O10.gz43_239949 


F 


M00057109A:F11 


UC2-PrimColon 


2485 


736634 


1887.All.gz43_239951 


F 


M00057093C:E01 


UC2-PrimColon 


2486 


457842 


1887.Dll.gz43_239954 


F 


M00057096D:H05 


UC2-PrimColon 


2487 


654475 


1887.Fll.gz43_239956 


F 


M00057099B:A06 


UC2-PrimColon 


2488 


543772 


1 887.1 12.gz43_239975 


F 


M00057102B:E12 


UC2-PrimColon 


2489 


734344 


1887.L12.gz43_239978 


F 


M00057105C:G02 


UC2-PrimColon 


2490 


' 693869 


1887.N12.gz43_239980 


F 


M00057108B:A12 


UC2-PrimColon 


2491 


733348 


1 887.0 12.gz43_239981 


F 


M00057109A:H09 


UC2-PrimColon 


2492 


733209 


1887.L14.gz43_240010 


F 


M00057105D:C10 


UC2-PrimColon 


2493 


401368 


1887.F15.gz43_240020 


F 


M00057099B:D02 


UC2-PrimColon 


2494 


452276 


1887.117.gz43_240055 


F 


M00057102D:C11 


UC2-PrimColon 


2495 


736220 


1887.P17.gz43_240062 


F 


M00057110D:E12 


UC2-PrimColon 


2496 


735584 


1887.G18.gz43_240069 


F 


M00057100C:F02 


UC2-PrimColon 


2497 


448325 


1887J19.gz43_240088 


F 


M00057104A:H06 


UC2-PrimColon 


2498 


732312 


1 887.0 1 9.gz43_240093 


F 


M00057109C:D10 


UC2-PrimColon 


2499 


477387 


1887.C20.gz43_240097 


F 


M00057096B:C05 


UC2-PrimColon 


2500 


729222 


1887.D20.gz43_240098 


F 


M00057097B:E02 


UC2-PrimColon 


2501 


733972 


1887.F20.gz43_240100 


F 


M00057099C:A06 


UC2-PrimColon 


2502 


735113 


1887.B21.gz43_240112 


F 


M00057095A:F11 


UC2-PrimColon 


2503 


650920 


1887.121.gz43_240119 


F 


M00057103A:F11 


UC2-PrimColon 


2504 


729560 


1887O24.gz43_240173 


F 


M00057110A:A03 


UC2-PrimColon 


2505 


415825 


1888.A01.gz43_240175 


F 


M00057112A:C12 


UC2-PrimColon 


2506 


733225 


1888D01.gz43_240178 


F 


M00057115D:C10 


UC2-PrimColon 


2507 


732113 


1888.D02.gz43_240194 


F 


M00057115D:D06 


UC2-PrimColon 


2508 


568632 


1888.L03.gz43_240218 


F 


M00057126D:A04 


UC2-PrimColon 


2509 


736539 


1888J04.gz43_240232 


F 


M00057124A:G08 


UC2-PrimColon 


2510 


602673 


1888.D05.gz43_240242 


F 


M00057115D:F06 


UC2-PrimColon 


2511 


553264 


1888.F06.gz43_240260 


F 


M00057118B:E10 


UC2-PrimColon 


2512 


555655 


1888.G06.gz43_240261 


F 


M00057120A:D01 


UC2-PrimColon 


2513 


736534 


1888.K06.gz43_240265 


F 


M00057125C:B11 


UC2-PrimColon 


2514 


451764 


1888.O06.gz43_240269 


F 


M00057129D:F03 


UC2-PrimColon 


2515 


732896 


1888.D07.gz43_240274 


F 


M00057116A:B06 


UC2-PrimColon 


2516 


733365 


1 888.107 gz43_240279 


F 


M00057122D:A01 


UC2-PrimColon 


2517 


736548 


1888.O08.gz43_240301 


F 


M00057130A:A02 


UC2-PrimColon 


2518 


553338 


1888.H09.gz43_240310 


F 


M00057121B:H10 


UC2-PrimColon 


2519 


105056 


1888.K10.gz43_240329 


F 


M00057125C:E02 


UC2-PrimColon 


2520 


288535 


1888.O10.gz43_240333 


F 


M00057130A:A11 


UC2-PrimColon 
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Table 2 





CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


2521 


453733 


1888.G12.gz43_240357 


F 


M00057120C:B09 


UC2-PrimColon 


2522 


552783 


1888.H12.gz43 240358 


F 


M00057121C:E08 


UC2-PrimColon 


2523 


732937 


1888.112.gz43_240359 


F 


M00057123A:F09 


UC2-PrimColon 


2524 


542490 


1888.K12.gz43_240361 


F 


M00057125D:B02 


UC2-PrimColon 


2525 


553087 


1888.L12.gz43_240362 


F 


M00057127A:E10 


UC2-PrimColon 


2526 


609459 


1888.G14.gz43 240389 


F 


M00057120C:F08 


UC2-PrimColon 


2527 


736810 


1888.O14.gz43_240397 


F 


M00057130A:D10 


UC2-PrimColon 


2528 


735959 


1888.L15.gz43_240410 


F 


M00057127A:H10 


UC2-PrimColon 


2529 


560868 


1888.H17.gz43_240438 


F 


M00057122A:A07 


UC2-PrimColon 


2530 


726176 


1888P17.gz43_240446 


F 


M00057131D:D02 


UC2-PrimColon 


2531 


724952 


1888.A18.gz43_240447 


F 


M00057112D:G08 


UC2-PrimColon 


2532 


1030 


1888H18.gz43_240454 


F 


M00057122A:C04 


UC2-PrimColon 


2533 


454961 


1888.L18.gz43 240458 


F 


M00057127B:D10 


UC2-PrimColon 


2534 


643517 


1888.C19.gz43_240465 


F 


M00057115C:B11 


UC2-PrimColon 


2535 


733006 


1888.H19.gz43_240470 


F 


M00057122A:C11 


UC2-PrimColon 


2536 


733171 


1888.A20.gz43_240479 


F 


M00057113A:A08 


UC2-PrimColon 


2537 


735018 


1888.B20.gz43 240480 


F 


M00057114C:E07 


UC2-PrimColon 


2538 


728479 


1888.B21.gz43_240496 


F 


M00057114C:F08 


UC2-PrimColon 


2539 


726575 


1888J21.gz43_240504 


F 


M00057125A:A07 


UC2-PrimColon 


2540 


734606 


1888.122.gz43_240519 


F 


M00057123D:F07 


UC2-PrimColon 


2541 


732476 


1888.D23.gz43_240530 


F 


M00057116C:H09 


UC2-PrimColon 


2542 


656263 


1888.H23.gz43_240534 


F 


M00057122B:F04 


UC2-PrimColon 


2543 


662617 


1888.K23.gz43_240537 


F 


M00057126C:C05 


UC2-PrimColon 


2544 


474108 


1 888. D24.gz43 240546 


F 


M00057116D:B10 


UC2-PrimColon 


2545 


455581 


1888.J24.gz43_240552 


F 


M00057125A:F07 


UC2-PrimColon 


2546 


447692 


1888 L24.gz43_240554 


F 


M00057127C:F03 


UC2-PrimColon 


2547 


735514 


1888.M24.gz43_240555 


F 


M00057128C:H02 


UC2-PrimColon 


2548 


440284 


1888.N24.gz43_240556 


F 


M00057129D:D07 


UC2-PrimColon 


2549 


639644 


1897.A02.gz43_240575 


F 


M00057132B:H05 


UC2-PrimColon 


2550 


568031 


1897.G02.gz43_240581 


F 


M00057139A:G08 


UC2-PrimColon 


2551 


555960 


1897.L02.gz43_240586 


F 


M00057144D.D03 


UC2-PrimColon 


2552 


462779 


1897.M02.gz43_240587 


F 


M00057145D:E01 


UC2-PrimColon 


2553 


472068 


1897.C03.gz43_240593 


F 


M00057134B:D06 


UC2-PrimColon 


2554 


728413 


1897.N03.gz43_240604 


F 


M00057147A:B07 


UC2-PrimColon 


2555 


519378 


1897.P03.gz43_240606 


F 


M00057149B:B07 


UC2-PrimColon 


2556 


733951 


1 897. A04.gz43_240607 


F 


M00057132C:C04 


UC2-PrimColon 


Zjj 1 


A A Ol 1 1 

448212 


1897 .P04 .gz43_240622 


F 


M00057149B:B08 


UC2-PrimColon 


2558 


536225 


1897.B05.gz43_240624 


F 


M00057133C:B02 


UC2-PrimColon 


2559 


736288 


1897.F05.gz43_240628 


F 


M00057138A:F02 


UC2-PrimColon 


2560 


225960 


1897.E06.gz43_240643 


F 


M00057137A:H12 


UC2-PrimColon 


2561 


655327 


1897.H08.gz43_240678 


F 


M00057140B:H01 


UC2-PrimColon 


2562 


735729 


1 897N08.gz43_240684 


F 


M00057147A:H07 


UC2-PrimColon 
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Table 2 



SEQ 
ID NU 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


2563 


642273 


1897.F09.gz43 240692 


F 


M00057138B:B02 


UC2-PrimColon 


2564 


549285 


1897.O09.gz43 240701 


F 


M00057148C:B02 


UC2-PrimColon 


2565 


735756 


1897.Pll.gz43_240734 


F 


M00057149C:H01 


UC2-PrimColon 


2566 


640662 


1897.E12.gz43_240739 


F 


M00057137B:C07 


UC2-PrimColon 


2567 


543772 


1897.F12.gz43_240740 


F 


M00057138B:H02 


UC2-PrimColon 


2568 


735801 


1897.M12.gz43 240747 


F 


M00057146A:H05 


UC2-PrimColon 


2569 


457846 


1897.N12.gz43 240748 


F 


M00057147C:B01 


UC2-PrimColon 


2570 


736318 


1897.O12.gz43_240749 


F 


M00057148C:C09 


UC2-PrimColon 


2571 


658271 


1897.K13.gz43_240761 


F 


M00057144B.D04 


UC2-PrimColon 


2572 


735283 


1 8 97.M 1 3 .gz43_240763 


F 


M00057146B:C06 


UC2-PrimColon 


2573 


447550 


1897.A15.gz43 240783 


F 


M00057133A.B07 


UC2-PrimColon 


2574 


500758 


1 897.H1 5.gz43_240790 


F 


M00057140D:B03 


UC2-PrimColon 


2575 


567005 


1897.H17.gz43_240822 


F 


M00057140D:F02 


UC2-PrimColon 


2576 


523171 


1897.K17.gz43_240825 


F 


M00057144B:H10 


UC2-PrimColon 


2577 


397515 


1 897.M 1 7.gz43_240827 


F 


M00057146B:H12 


UC2-PrimColon 


2578 


736318 


1897.F18.gz43_240836 


F 


M00057138D:F03 


UC2-PrimColon 


2579 


449035 


1 897.1 18.gz43_240839 


F 


M00057142A:H07 


UC2-PrimColon 


2580 


486076 


1897.K18.gz43_240841 


F 


M00057144C:A02 


UC2-PrimColon 


2581 


729843 


1897.O18.gz43_240845 


F 


M00057149A:A04 


UC2-PrimColon 


2582 


726440 


1 897.F 1 9.gz43_240852 


F 


M00057138D:F10 


UC2-PrimColon 


2583 


731317 


1897.G19.gz43_240853 


F 


M00057139D:G07 


UC2-PrimColon 


2584 


559004 


1897.I22.gz43 240903 


F 


M00057142B:F06 


UC2-PrimColon 


2585 


448202 


1897.L22.gz43_240906 


F 


M00057145C:H03 


UC2-PrimColon 


2586 


718314 


1897.N22.gz43_240908 


F 


M00057147D:H09 


UC2-PrimColon 


2587 


735989 


1897.C23.gz43_240913 


F 


M00057135A:B02 


UC2-PrimColon 


2588 


489001 


1 897.F23 .gz43_2409 16 


F 


M00057139A:B10 


UC2-PrimColon 


2589 


462779 


1897.H23.gz43_240918 


F 


M00057141A:G06 


UC2-PrimColon 


2590 


596809 


1898.C02.gz43_242213 


F 


M00057153B:G07 


UC2-PrimColon 


2591 


447003 


1898.A03.gz43_242227 


F 


M00057150D:C04 


UC2-PrimColon 


2592 


730661 


1898.D04.gz43_242246 


F 


M00057155A:E11 


UC2-PrimColon 


2593 


461316 


1 898H04.gz43_242250 


F 


M00057160D:A11 


UC2-PrimColon 


2594 


450724 


1898.A05.gz43 242259 


F 


M00057150D:F08 


UC2-PrimColon 


2595 


480307 


1898.B05.gz43_242260 


F 


M00057152B:H02 


UC2-PrimColon 


2596 


727608 


1898.F05.gz43_242264 


F 


M00057157D:H08 


UC2-PrimColon 


2597 


649852 


1898.P05.gz43_242274 


F 


M00057169A:F05 


UC2-PrimColon 


2598 


650297 


1898A06.gz43_242275 


F 


M00057151A:B04 


UC2-PrimColon 


2599 


491933 


1 898.E06.gz43_242279 


F 


M00057156B:D10 


UC2-PrimColon 


2600 


733552 


1898.P06.gz43_242290 


F 


M00057169A.F06 


UC2-PrimColon 


2601 


730341 


1898.D07.gz43_242294 


F 


M00057155A:G11 


UC2-PrimColon 


2602 


640563 


1898.D08.gz43_242310 


F 


M00057I55A:H07 


UC2-PrimColon 


2603 


640603 


1898.B10.gz43_242340 


F 


M00057152C:C10 


UC2-PrimColon 


2604 


552972 


1898.E10.gz43 242343 


F 


M00057156C:E08 


UC2-PrimColon 
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Table 2 



crrv 




SEQ NAME 


ORIEN 

T 


CLONE ID 


LIBRARY 


2605 


736030 


1898.F10.gz43 242344 


F 


M00057158B:C02 


UC2-PrimColon 


2606 


736228 


1898.Gll.gz43 242361 


F 


M00057159C:E11 


UC2-PrimColon 


2607 


648820 


1898.Mll.gz43 242367 


F 


M00057165D:H05 


UC2-PrimColon 


2608 


735993 


1898.D12.gz43_242374 


F 


M00057155C:B07 


UC2-PrimColon 


2609 


402516 


1898.H12.gz43 242378 


F 


M00057161B:D02 


UC2-PrimColon 


2610 


583625 


1 898.0 1 2.gz43_242385 


F 


M00057168B:D02 


UC2-PrimColon 


2611 


726448 


1 898 .N 1 3 .gz43_242400 


F 


M00057167A:D08 


UC2-PrimColon 


2612 


735756 


1898.H14.gz43_242410 


F 


M00057161B:E07 


UC2-PrimColon 


2613 


551654 


1898.C15.gz43 242421 


F 


M00057153D:H01 


UC2-PrimColon 


2614 


558477 


1898.D15.gz43_242422 


F 


M00057155C:G04 


UC2-PrimColon 


2615 


728353 


1898.115.gz43_242427 


F 


M00057162B:E10 


UC2-PrimColon 


2616 


736634 


1898M15.gz43_242431 


F 


M00057166B:B07 


UC2-PrimColon 


2617 


737006 


1898.P15.gz43_242434 


F 


M00057169C:H10 


UC2-PrimColon 


2618 


736288 


1 898. J 1 6gz43_242444 


F 


M00057163C:D06 


UC2-PrimColon 


2619 


734794 


1898.P16.gz43_242450 


F 


M00057169C:H12 


UC2-PrimColon 


2620 


734928 


1898.E17.gz43_242455 


F 


M00057156D:F02 


UC2-PrimColon 


2621 


727976 


1898.F17.gz43 242456 


F 


M00057158C:C10 


UC2-PrimColon 


2622 


736790 


1898.F19.gz43_242488 


F 


M00057158C:G05 


UC2-PrimColon 


2623 


735633 


1 898.12 l.gz43_242523 


F 


M00057162C:C01 


UC2-PrimColon 


2624 


734484 


1898.B22.gz43_242532 


F 


M00057153B:A04 


UC2-PrimColon 


2625 


480142 


1 898.D22.gz43_242534 


F 


M00057155D:E12 


UC2-PrimColon 


2626 


454073 


1 898.G22.gz43_242537 


F 


M00057160B:G08 


UC2-PrimColon 


2627 


599714 


1 898.L22.gz43_242542 


F 


M00057165C:E08 


UC2-PrimColon 


2628 


737042 


1898.M22.gz43_242543 


F 


M00057166B:G06 


UC2-PrimColon 


2629 


405016 


1 898.N22.gz43_242544 


F 


M00057167C:A05 


UC2-PrimColon 


2630 


723915 


1898.L23.gz43_242558 


F 


M00057165C:F05 


UC2-PrimColon 


2631 


737006 


1898.023.gz43 242561 


F 


M00057168C:D03 


UC2-PrimCoIon 


2632 


734808 


1 898.B24.gz43_242564 


F 


M00057153B:D02 


UC2-PrimColon 


2633 


471887 


1898.P24.gz43 242578 


F 


M00057170A:D11 


UC2-PrimColon 


2634 


733972 


1 899.A0 1 .gz43_242579 


F 


M00057170A:D06 


UC2-PrimColon 


2635 


554221 


1899.J01.gz43 242588 


F 


M00057181C:D10 


UC2-PrimColon 


2636 


735426 


1 899.N0 1 gz43_242592 


F 


M00057192A:D12 


UC2-PrimColon 


2637 


454812 


1 899.P0 1 .gz43_242594 


F 


M00057194C:A03 


UC2-PrimColon 


2638 


562453 


1899.D02.gz43 242598 


F 


M00057173D:B12 


UC2-PrimColon 


2639 


554683 


1 899.M02.gz43_242607 


F 


M00057189D:G08 


UC2-PrimColon 


2640 


474346 


1 899.N02.gz43_242608 


F 


M00057192A:F01 


UC2-PrimColon 


Z041 




1899.D03.gz43 242614 


F 


M00057173D:C07 


UC2-PrimColon 


2642 


560183 


1899.J03.gz43_242620 


F 


M00057181C:G07 


UC2-PrimColon 


2643 


422590 


1899.D04.gz43_242630 


F 


M00057173D:E04 


UC2-PrimColon 


2644 


642936 


1899.K04.gz43_242637 


F 


M00057182C:C03 


UC2-PrimColon 


2645 


730106 


1 899.N04.gz43_242640 


F 


M00057192A:H05 


UC2-PrimColon 


2646 


449548 


1899.H05.gz43_242650 


F 


M00057179C:B05 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


2647 


736861 


1899.P05.gz43 242658 


F 


M00057194C:F02 


UC2-PrimColon 


2648 


730220 


1899.A06.gz43_242659 


F 


M00057170A:H04 


UC2-PrimColon 


2649 


736309 


1899.106.gz43_242667 


F 


M00057180C:F09 


UC2-PrimColon 


2650 


373615 


1 899.K06.gz43_242669 


F 


M00057182C:C11 


UC2-PrimColon 


2651 


640635 


1 899.O06.gz43_242673 


F 


M00057193C:F11 


UC2-PrimColon 


2652 


735764 


1899.F07.gz43_242680 


F 


M00057176A:H10 


UC2-PrimColon 


2653 


735801 


1899.J07.gz43_242684 


F 


M00057181D:C09 


UC2-PrimColon 


2654 


477098 


1899.K07.gz43_242685 


F 


M00057182D:A08 


UC2-PrimColon 


2655 


734031 


1899.M07.gz43_242687 


F 


M00057191A:H09 


UC2-PrimColon 


2656 


419711 


1899.N07.gz43_242688 


F 


M00057192B:D10 


UC2-PrimColon 


2657 


390968 


1899.P07.gz43_242690 


F 


M00057194C:F10 


UC2-PrimColon 


2658 


732625 


1899.E08.gz43_242695 


F 


M00057175B:G12 


UC2-PrimColon 


2659 


462986 


1899.F08.gz43_242696 


F 


M00057176A:H12 


UC2-PrimColon 


2660 


482512 


1899.H08.gz43_242698 


F 


M00057179C:G10 


UC2-PrimColon 


2661 


473238 


1899.C09.gz43_242709 


F 


M00057172D:F08 


UC2-PrimColon 


2662 


735789 


1899I09.gz43_242715 


F 


M00057180D:C10 


UC2-PrimColon 


2663 


736595 


1899.K09.gz43_242717 


F 


M00057182D:B11 


UC2-PrimColon 


2664 


735514 


1899.M09.gz43_242719 


F 


M00057191B:C06 


UC2-PrimColon 


2665 


630516 


1899.O09.gz43_242721 


F 


M00057193D:F06 


UC2-PrimColon 


2666 


730899 


1 899.N 10gz43_242736 


F 


M00057192C:B11 


UC2-PrimColon 


2667 


552055 


1899.P10.gz43_242738 


F 


M00057194D:C12 


UC2-PrimColon 


2668 


528404 


1899.Cll.gz43_242741 


F 


M00057173A:C07 


UC2-PrimColon 


2669 


734209 


1899.Dll.gz43_242742 


F 


M00057174B:C06 


UC2-PrimColon 


2670 


452243 


1899.Jll.gz43_242748 


F 


M00057181D:H07 


UC2-PrimColon 


2671 


734061 


1899.01 l.gz43_242753 


F 


M00057194A:B07 


UC2-PrimColon 


2672 


625988 


1899.G12.gz43_242761 


F 


M00057177C:D07 


UC2-PrimColon 


2673 


561632 


1899.D13.gz43_242774 


F 


M00057174B:C11 


UC2-PrimColon 


2674 


731370 


1899.E13.gz43_242775 


F 


M00057175C:D02 


UC2-PrimColon 


2675 


480142 


1899.G13.gz43_242777 


F 


M00057177C:G11 


UC2-PrimColon 


2676 


736049 


1899.013.gz43_242785 


F 


M00057194A:C06 


UC2-PrimColon 


2677 


725691 


1899.A14.gz43_242787 


F 


M00057170C:C01 


UC2-PrimColon 


2678 


727523 


1899.M14.gz43_242799 


F 


M00057191C:D02 


UC2-PrimColon 


2679 


421794 


1899D15.gz43_242806 


F 


M00057174B:F11 


UC2-PrimColon 


2680 


732429 


1899:G15.gz43_242809 


F 


M00057177C:H04 


UC2-PrimColon 


2681 


530883 


1899.H15.gz43_242810 


F 


M00057180A:H11 


UC2-PrimColon 


2682 


729173 


1899.P15.gz43_242818 


F 


M00057196A:E03 


UC2-PrimColon 


2683 


734744 


1899.D16.gz43_242822 


F 


M00057174B:G03 


UC2-PrimColon 


2684 


735605 


1899.F16.gz43_242824 


F 


M00057176B:F10 


UC2-PrimColon 


2685 


726786 


1899.N16.gz43_242832 


F 


M00057192D:G02 


UC2-PrimColon 


2686 


725089 


1899.016.gz43_242833 


F 


M00057194A:F01 


UC2-PrimColon 


2687 


448770 


1899.D17.gz43_242838 


F 


M00057174B:G12 


UC2-PrimColon 


2688 


532904 


1899.E17.gz43_242839 


F 


M00057175D:A12 


UC2-PrimColon 
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Table 2 



TO NO 


/^T TTCTT7D 


bbQ NAME 


f\n IT7XT 

1 


CLONE ID 


LIBRARY 


2689 


560568 


1899.K17.gz43 242845 


F 


M00057186A:E12 


UC2-PrimColon 


2690 


736402 


1899.N17.gz43 242848 


F 


M00057192D:G04 


UC2-PrimColon 


2691 


516729 


1899.P17.gz43 242850 


F 


M00057196B:H10 


UC2-PrimColon 


2692 


718314 


1899.H18.gz43 242858 


F 


M00057180B:C06 


UC2-PrimColon 


2693 


737116 


1899J18.gz43_242860 


F 


M00057182A:H07 


UC2-PrimColon 


2694 


723985 


1899.B19.gz43_242868 


F 


M00057172A:B02 


UC2-PrimColon 


2695 


573733 


1899.F19.gz43_242872 


F 


M00057176C:H08 


UC2-PrimColon 


2696 


475872 


1899.H19.gz43J242874 


F 


M00057180B:D01 


UC2-PrimColon 


2697 


625988 


1899.F20.gz43 242888 


F 


M00057176D:A03 


UC2-PrimCoIon 


2698 


736354 


1 899.L20.gz43_242894 


F 


M00057189C:G11 


UC2-PrimColon 


2699 


535866 


1899.C21.gz43_242901 


F 


M00057173C:C07 


UC2-PrimColon 


2700 


648221 


1 899.D2 1 .gz43 J242902 


F 


M00057174C:G07 


UC2-PrimColon 


2701 


736280 


1 899.H2 1 .gz43_242906 


F 


M00057180B:F05 


UC2-PrimCoIon 


2702 


156329 


1899.N21.gz43 242912 


F 


M00057193B:C11 


UC2-PrimColon 


2703 


*m t ******* 

721768 


1 899 . 02 1 .gz43_2429 1 3 


F 


M00057194B:E05 


UC2-PrimColon 


2704 


639507 


1899.B22.gz43_242916 


F 


M00057172A:H06 


UC2-PrimColon 


2705 


451383 


1899.E22.gz43_242919 


F 


M00057175D:E12 


UC2-PrimColon 


2706 


349977 


1 899. J22.gz43_242924 


F 


M00057182B:D09 


UC2-PrimColon 


2707 


734928 


1899.M22.gz43_242927 


F 


M00057191D:G10 


UC2-PrimColon 


2708 


480623 


1 899. A23 .gz43_24293 1 


F 


M00057170D:B08 


UC2-PrimColon 


2709 


674526 


1 899.H23 .gz43_24293 8 


F 


M00057180B:G06 


UC2-PrimColon 


2710 


730805 


1899.M23.gz43_242943 


F 


M00057192A:B04 


UC2-PrimColon 


2711 


640350 


1900.G01.gz43_242969 


F 


M00057206A:C06 


UC2-PrimColon 


2712 


726449 


1900.K01.gz43 242973 


F 


M00057210A:C12 


UC2-PrimColon 


2713 


552457 


1900.E02.gz43_242983 


F 


M00057203B.F08 


UC2-PrimColon 


2714 


730308 


1900.C04.gz43_243013 


F 


M00057200D:E03 


UC2-PrimColon 


2715 


734993 


1900.E04.gz43_243015 


F 


M00057203C:A09 


UC2-PrimCoIon 


2716 


562320 


1900.H04.gz43_243018 


F 


M00057207B:F06 


UC2-PrimColon 


2717 


556656 


1900.O04.gz43_243025 


F 


M00057216D:D05 


UC2-PrimColon 


2718 


446752 


1900.D06.gz43_243046 


F 


M00057202A:D05 


UC2-PrimColon 


2719 


736738 


1900.L06.gz43 243054 


F 


M00057211C:C06 


UC2-PrimColon 


2720 


639395 


1900.O07.gz43_243073 


F 


M00057216D:F10 


UC2-PrimCoIon 


2721 


516729 


1900.B08.gz43_243076 


F 


M00057199B:B06 


UC2-PrimColon 


2722 


557262 


1900.E08.gz43_243079 


F 


M00057203D:A09 


UC2-PrimColon 


2723 


731531 


1900.108.gz43_243083 


F 


M00057208B:H08 


UC2-PrimCoIon 


2724 


736778 


1900.K08.gz43_243085 


F 


M00057210C:D09 


UC2-PrimColon 


Z / Zj 




i onn nin i o a ~j i i i 
iyUU.L»lU.gz4j_z4J Izl 


r 


M000572 17A:A12 


UC2-PnmColon 


2726 


483061 


1900.B12.gz43_243140 


F 


M00057199C:H08 


UC2-PrimColon 


2727 


725905 


1900.C12.gz43_243141 


F 


M00057201A:H03 


UC2-PrimColon 


2728 


648757 


1900.M12.gz43_243151 


F 


M00057213C:D06 


UC2-PrimColon 


2729 


735797 


1900.A14.gz43_243171 


F 


M00057197D:H10 


UC2-PrimColon 


2730 


732771 


1900.114.gz43_243179 


F 


M00057208C:E12 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


2731 


736276 


1900.E15.gz43 243191 


F 


M00057204A:F11 


UC2-PrimColon 


2732 


449078 


1900.O19.gz43_243265 


F 


M00057217B:F12 


UC2-PrimColon 


2733 


161489 


1900.A20.gz43 243267 


F 


M00057198B:C02 


UC2-PrimColon 


2734 


734226 


1900F20.gz43_243272 


F 


M00057205D:G06 


UC2-PrimColon 


2735 


736500 


1900.G20.gz43 243273 


F 


M00057207A:A07 


UC2-PrimColon 


2736 


733272 


1900.K20.gz43_243277 


F 


M00057211A:D01 


UC2-PrimColon 


2737 


532904 


1900.B22.gz43_243300 


F 


M00057200B:D04 


UC2-PrimColon 


2738 


648379 


1900K22.gz43_243309 


F 


M00057211B:A08 


UC2-PrimColon 


2739 


649490 


1900.M22.gz43 243311 


F 


M00057214B:B12 


UC2-PrimColon 


2740 


186522 


1900.N22.gz43 243312 


F 


M00057216C:D12 


UC2-PrimColon 


2741 


561626 


1900.E24.gz43_243335 


F 


M00057204C:G06 


UC2-PrimColon 


2742 


653616 


1900.K24.gz43 243341 


F 


M00057211B:C09 


UC2-PrimColon 


2743 


734828 


1909G01.gz43_243353 


F 


M00057225C:H07 


UC2-PrimColon 


2744 


427113 


1909.H01.gz43_243354 


F 


M00057226C:E05 


UC2-PrimColon 


2745 


561877 


1909.K01.gz43 243357 


F 


M00057231B:A01 


UC2-PrimColon 


2746 


454563 


1909.M01.gz43_243359 


F 


M00057233B:G04 


UC2-PrimColon 


2747 


625810 


1909P01.gz43_243362 


F 


M00057236D:H09 


UC2-PrimColon 


2748 


475562 


1909F02.gz43_243368 


F 


M00057224C:B02 


UC2-PrimColon 


2749 


735749 


1909K02.gz43_243373 


F 


M00057231C:B04 


UC2-PrimColon 


2750 


289328 


1909,B03.gz43_243380 


F 


M00057219D:G11 


UC2-PrimColon 


2751 


465697 


1909.C03.gz43_243381 


F 


M00057220D:E06 


UC2-PrimColon 


2752 


726892 


1909E03.gz43_243383 


F 


M00057223B:G01 


UC2-PrimColon 


2753 


723959 


1909J03.gz43_243388 


F 


M00057230C:C05 


UC2-PrimColon 


2754 


733874 


1909N03.gz43_243392 


F 


M00057234D:A12 


UC2-PrimColon 


2755 


454129 


1909.A04.gz43 243395 


F 


M00057218C:F10 


UC2-PrimColon 


2756 


733868 


1909.B04.gz43_243396 


F 


M00057219D:H04 


UC2-PrimColon 


2757 


642940 


1909H04.gz43_243402 


F 


M00057226C:F12 


UC2-PrimColon 


2758 


398061 


1909.D05.gz43_243414 


F 


M00057222B.A06 


UC2-PrimColon 


2759 


734690 


1 909.G05 .gz43_2434 1 7 


F 


M00057225D:E01 


UC2-PrimColon 


2760 


601051 


1909.N05.gz43 243424 


F 


M00057234D:E04 


UC2-PrimColon 


2761 


727255 


1909.E06.gz43_243431 


F 


M00057223C:B01 


UC2-PrimColon 


2762 


632499 


1909I06.gz43_243435 


F 


M00057229A.B03 


UC2-PrimColon 


2763 


737010 


1909.K06.gz43_243437 


F 


M00057231C:E06 


UC2-PrimColon 


2764 


728121 


1909.L06.gz43_243438 


F 


M00057232B:G02 


UC2-PrimColon 


2765 


472811 


1909.B08.gz43_243460 


F 


M00057220A:C06 


UC2-PrimColon 


2766 


648140 


1909.H08.gz43_243466 


F 


M00057226C:H10 


UC2-PrimColon 


2767 


726251 


1909.A09.gz43_243475 


F 


M00057219A:D05 


UC2-PrimColon 


2768 


642631 


1909B09.gz43_243476 


F 


M00057220A:D09 


UC2-PrimColon 


2769 


598087 


1909.J09.gz43 243484 


F 


M00057230C:G12 


UC2-PrimColon 


2770 


398061 


1909.K09.gz43_243485 


F 


M00057231C:F12 


UC2-PrimColon 


2771 


463513 


1909.M09.gz43_243487 


F 


M00057233D:G12 


UC2-PrimColon 


2772 


81 


1909F10.gz43_243496 


F 


M00057225A:C08 


UC2-PrimColon 
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Table 2 





CLUSTER 


SEQ NAME 


ORIEN 
T 


r 

CLONE ID 


LIBRARY 


2773 


735050 


1909.H10.gz43 243498 


F 


M00057226D:B03 


UC2-PrimColon 


2774 


735140 


1909.K10.gz43 243501 


F 


M00057231C:G04 


UC2-PrimColon 


2775 


220107 


1909.O10.gz43 243505 


F 


M00057236A.F08 


UC2-PrimColon 


2776 


735817 


1909.P10.gz43 243506 


F 


M00057237B.D10 


UC2-PrimColon 


2777 


735514 


1909.Dll.gz43 243510 


F 


M00057222D:C10 


UC2-PrimColon 


2778 


602673 


1909.Fll.gz43 243512 


F 


M00057225A:E03 


UC2-PrimColon 


2779 


647815 


1909.11 l.gz43 243515 


F 


M00057229B.F11 


UC2-PrimColon 


2780 


402024 


1909.A12.gz43 243523 


F 


M00057219A:E11 


UC2-PrimColon 


2781 


631111 


1909.C12.gz43 243525 


F 


M00057221B:B01 


UC2-PrimCoIon 


2782 


451888 


1909.I12.gz43 243531 


F 


M00057229B:G11 


UC2-PrimColon 


2783 


456224 


1909.J12.gz43_243532 


F 


M00057230D:C05 


UC2-PrimColon 


2784 


555336 


1909.L12.gz43 243534 


F 


M00057232D:B03 


UC2-PrimColon 


2785 


736988 


1909.A13.gz43 243539 


F 


M00057219A:H11 


UC2-PrimColon 


2786 


613067 


1909.B13.gz43 243540 


F 


M00057220B:A06 


UC2-PrimColon 


2787 


736738 


1909.E13.gz43 243543 


F 


M00057223D:H03 


UC2-PrimColon 


2788 


646552 


1909.G13.gz43 243545 


F 


M00057226A:B04 


UC2-PrimColon 


2789 


725120 


1909.O13.gz43 243553 


F 


M00057236B:D11 


UC2-PrimColon 


2790 


591979 


1909.A14.gz43 243555 


F 


M00057219B:B10 


UC2-PrimColon 


2791 


731467 


1909.N14.gz43 243568 


F 


M00057235B:A07 


UC2-PrimColon 


2792 


474869 


1909.A15.gz43 243571 


F 


M00057219B:C06 


UC2-PrimColon 


2793 


735306 


1909.C15.gz43_243573 


F 


M00057221B:E11 


UC2-PrimColon 


2794 


732736 


1909.K15.gz43 243581 


F 


M00057231D:A05 


UC2-PrimColon 


2795 


724417 


1909.L15.gz43 243582 


F 


M00057233A:C04 


UC2-PrimColon 


2796 


454355 


1909.E16.gz43 243591 


F 


M00057224A:D07 


UC2-PrimColon 


2797 


732899 


1909.D17.gz43 243606 


F 


M00057223A:F06 


UC2-PrimColon 


2798 


733149 


1909.J17.gz43 243612 


F 


M00057231A:D04 


UC2-PrimColon 


2799 


735622 


1909.B18.gz43 243620 


F 


M00057220B:G10 


UC2-PrimColon 


2800 


453132 


1909.O19.gz43_243649 


F 


M00057236C:C07 


UC2-PrimColon 


2801 


724781 


1909.P19.gz43 243650 


F 


M00057237D.C11 


UC2-PrimColon 


2802 


556852 


1909.B20.gz43 243652 


F 


M00057220C:A08 


UC2-PrimColon 


2803 


732159 


1909.H20.gz43 243658 


F 


M00057227B:A05 


UC2-PrimColon 


2804 


734609 


1909.D21.gz43 243670 


F 


M00057223B:A07 


UC2-PrimColon 


2805 


616985 


1909.12 l.gz43 243675 


F 


M00057230B:B07 


UC2-PrimColon 


2806 


734151 


1909.N21.gz43 243680 


F 


M00057235C:C08 


UC2-PrimColon 


2807 


553850 


1909.C22.gz43 243685 


F 


M00057221C:E07 


UC2-PrimColon 


2808 


736894 


1909.D22.gz43_243686 


F 


M00057223B.B04 


UC2-PrimColon 


2809 


486051 


1909.K22.gz43 243693 


F 


M00057231D:F10 


UC2-PrimColon 


2810 


734553 


1909.N22.gz43 243696 


F 


M00057235C:F03 


UC2-PrimColon 


2811 


473578 


1909.P22.gz43 243698 


F 


M00057237D:D09 


UC2-PrimColon 


2812 


737010 


1909A23.gz43_243699 


F 


M00057219D:C02 


UC2-PrimColon 


2813 


735514 


1909.B23.gz43_243700 


F 


M00057220C:F08 


UC2-PrimColon 


2814 


418682 


1909.C23.gz43 243701 


F 


M00057221C:F02 


UC2-PrimColon 
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Table 2 



SEQ 
ID NO 




CCO XT A IV vTC 

oxivj JNAJVLb 


rvn TT7X1 

UKlxilx 

1 


CLONE ID 


LIBRARY 


2815 


734562 


1909.F23.gz43 243704 


F 


M00057225C:F09 


UC2-PrimColon 


2816 


664711 


1909.M23.gz43 243711 


F 


M00057234C:D11 


UC2-PrimColon 


2817 


727558 


1909.E24.gz43 243719 


F 


M00057224B:H02 


UC2-PrimColon 


ioio 

2818 


730028 


1910.C01.gz43 243733 


F 


M00057240A:F03 


UC2-PrimColon 


2819 


611604 


1910.E01.gz43 243735 


F 


M00057242C:G12 


UC2-PrimColon 


2820 


558617 


1910.H01.gz43 243738 


F 


M00057246A:G1 1 


UC2-PrimColon 


2821 


733144 


1910.J01.gz43 243740 


F 


M00057248B.D11 


UC2-PrimCoIon 


2822 


647242 


1910.M01.gz43 243743 


F 


M00057252B:E04 


UC2-PrimColon 




733552 


1910.F02.gz43 243752 


F 


M00057243C:D01 


UC2-PrimColon 


2824 


729428 


1910.N02.gz43 243760 


F 


M00057253C:A06 


UC2-PrimColon 


2823 


£. A 1 /TO A 

641680 


1910.A03.gz43 243763 


F 


M00057238A:D07 


UC2-PrimColon 


2826 


514838 


1910X03.gz43 243774 


F 


M00057251A:F02 


UC2-PrimColon 


2827 


735403 


1910.N03.gz43 243776 


F 


M00057253C:D11 


UC2-PrimColon 


2828 


732859 


1910.I04.gz43 243787 


F 


M00057247C:C11 


UC2-PrimColon 


2829 


730046 


1910.C06.gz43 243813 


F 


M00057240B:C01 


UC2-PrimColon 


2830 


650067 


1910.E06.gz43 243815 


F 


M00057242D:C07 


UC2-PrimColon 


2831 


734884 


1 9 1 0. F06 .gz43_243 8 1 6 


F 


M00057243C:H1 1 


UC2-PrimColon 


2832 


550730 


1910.I06.gz43 243819 


F 


M00057247C:F10 


UC2-PrimColon 


^ o o o 

2833 


449042 


1910.K06.gz43 243821 


F 


M00057249DH09 


UC2-PrimColon 


2834 


647704 


1910.B07.gz43 243828 


F 


M00057239B:F05 


UC2-PrimColon 


2835 


555641 


1910.G07.gz43 243833 


F 


M00057245A:F03 


UC2-PrimColon 


2836 


734226 


1 9 1 0.D08 .gz43 243846 


F 


M00057241D:C04 


UC2-PrimColon 


2837 


551437 


1910.A09.gz43_243859 


F 


M00057238B:F05 


UC2-PrimColon 


2838 


639494 


1910.M09.gz43 243871 


F 


M00057252D:B10 


UC2-PrimColon 


2839 


661194 


1910.P09.gz43 243874 


F 


M00057256B:A05 


UC2-PrimColon 


2840 


734466 


1910.B10.gz43_243876 


F 


M00057239C:E05 


UC2-PrimColon 


2841 


734371 


1910.I10.gz43 243883 


F 


M00057247D:D10 


UC2-PrimColon 


2842 


731392 


1910.Kll.gz43 243901 


F 


M00057250B:A03 


UC2-PrimColon 


2843 


484964 


1910.G12.gz43 243913 


F 


M00057245B:E02 


UC2-PrimColon 


2844 


733991 


1910.H12.gz43 243914 


F 


M00057246C:B12 


UC2-PrimColon 


2845 


585976 


1910.M12.gz43_243919 


F 


M00057252D:D04 


UC2-PrimColon 


2846 


729125 


1910.B13.gz43_243924 


F 


M00057239C:GO7 


UC2-PrimColon 


2847 


733855 


1910.C13.gz43 243925 


F 


M00057240C:A06 


UC2-PrimColon 


2848 


733945 


1910.D14.gz43_243942 


F 


M00057241D:G01 


UC2-PrimColon 


2849 


555103 


1 9 1 0 J 1 4 .gz43_243 948 


F 


M00057249A:C06 


UC2-PrimColon 


2850 


726011 


1 9 1 0 . K 1 4 .gz43_243 949 


F 


M00057250B:D04 


UC2-PrimColon 


2851 




iy 1 u.in iH-.gzno zhovjz 


r 


M00057254A:G07 


UC2-PnmColon 


2852 


457846 


1910.E15.gz43_243959 


F 


M00057243A:H03 


UC2-PrimCoIon 


2853 


456753 


1910.B16.gz43 243972 


F 


M00057239D:D01 


UC2-PrimColon 


2854 


731238 


1910.M17.gz43 243999 


F 


M00057253A:C05 


UC2-PrimColon 


2855 


558332 


1910.O17.gz43 244001 


F 


M00057255C:A07 


UC2-PrimCoIon 


2856 


649390 


1910.P17.gz43_244002 


F 


M00057256D:A11 


UC2-PrimColon 



140 



Table 2 



Tr> Tsjn 

i-L/ i> yj 


CLUSTER 


SEQ NAME 


ORIEN 
T 


CLONE ID 


LIBRARY 


2857 


734808 


1910.D19.gz43 244022 


F 


M00057242A:H11 


UC2-PrimColon 


2858 


731125 


1910.L19.gz43 244030 


F 


M00057252A:D10 


UC2-PrimColon 


2859 


476455 


1910.G20.gz43 244041 


F 


M00057245D:G02 


UC2-PrimColon 


2860 


735754 


1910.M20.gz43 244047 


F 


M00057253A:H10 


UC2-PrimColon 


2861 


735477 


1910.D21.gz43 244054 


F 


M00057242B:F07 


UC2-PrimColon 


2862 


734894 


1910.I21.gz43 244059 


F 


M00057248A:H10 


UC2-PrimColon 


2863 


473588 


1910.J21.gz43 244060 


F 


M00057249C:C07 


UC2-PrimColon 


2864 


735665 


1910.K21.gz43_244061 


F 


M00057250C:G02 


UC2-PrimColon 


2865 


725095 


1910.M22.gz43_244079 


F 


M00057253B:C06 


UC2-PrimColon 


2866 


726081 


1910.P23.gz43 244098 


F 


M00057257A:H10 


UC2-PrimColon 


2867 


455821 


1910.F24.gz43 244104 


F 


M00057244C:E06 


UC2-PrimColon 


2868 


449537 


1910.O24.gz43 244113 


F 


M00057255D:E02 


UC2-PrimCoIon 


2869 


514697 


1910.P24.gz43 244114 


F 


M00057257B:C11 


UC2-PrimColon 


2870 


586794 


1911I01.gz43_244123 


F 


M00057269D:F02 


UC2-PrimColon 


2871 


648710 


1911.M01.gz43 244127 


F 


M00057275B:B02 


UC2-PrimColon 


2872 


630348 


1911.C02.gz43_244133 


F 


M00057260A:E05 


UC2-PrimColon 


2873 


733570 


1911.F02.gz43 244136 


F 


M00057266D:B12 


UC2-PrimColon 


2874 


480723 


1911J02.gz43_244140 


F 


M00057271A:E04 


UC2-PrimColon 


2875 


728756 


1911.C03.gz43_244149 


F 


M00057260A:E11 


UC2-PrimColon 


2876 


735071 


1911.E04.gz43 244167 


F 


M00057265D:B12 


UC2-PrimColon 


2877 


630655 


1911.K04.gz43_244173 


F 


M00057273A:C08 


UC2-PrimColon 


2878 


736014 


1911.N04.gz43_244176 


F 


M00057279A:G02 


UC2-PrimColon 


2879 


42 


1911.P04.gz43_244178 


F 


M00057283A:E06 


UC2-PrimColon 


2880 


482145 


1911 .D07.gz43_2442 14 


F 


M00057262B:C03 


UC2-PrimColon 


2881 


735834 


1911.K10.gz43_244269 


F 


M00057273B:A12 


UC2-PrimColon 


2882 


42 


1911.P10.gz43 244274 


F 


M00057283B:D09 


UC2-PrimColon 


2883 


736014 


1911.Mll.gz43_244287 


F 


M00057277B:C09 


UC2-PrimColon 


2884 


450692 


19H.OH.gz43_244289 


F 


M00057281A:D08 


UC2-PrimColon 


2885 


4470 


1911.M12.gz43_244303 


F 


M00057277B:E10 


UC2-PrimColon 


2886 


456065 


1911.O12.gz43_244305 


F 


M00057281A:H02 


UC2-PrimColon 


2887 


482985 


1911.K15.gz43_244349 


F 


M00057273C:F11 


UC2-PrimColon 


2888 


652782 


1911.G18.gz43_244393 


F 


M00057268B:B03 


UC2-PrimColon 


2889 


725784 


1911.H18.gz43_244394 


F 


M00057269C:E01 


UC2-PrimColon 


2890 


729321 


1911.K19.gz43_244413 


F 


M00057273D:A06 


UC2-PrimColon 


2891 


732213 


1911.H22.gz43 244458 


F 


M00057269C:H06 


UC2-PrimColon 


2892 


420504 


1911.B23.gz43_244468 


F 


M00057259D:D11 


UC2-PrimColon 


2893 


735687 


1911 .F23 .gz43_244472 


F 


M00057267C:G09 


UC2-PrimColon 


2894 


728790 


1911.J23.gz43_244476 


F 


M00057272C:D08 


UC2-PrimColon 


2895 


734131 


1911.P23.gz43_244482 


F 


M00057283D:B12 


UC2-PrimColon 


2896 


725951 


1911.D24.gz43_244486 


F 


M00057265C:F03 


UC2-PrimColon 


2897 


554703 


1911.E24.gz43_244487 


F 


M00057266C:G12 


UC2-PrimColon 


2898 


546642 


1911.L24.gz43_244494 


F 


M00057275B:A12 


UC2-PrimColon 
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Table 2 



a by 

IU IN LI 


CLUSTER 


SEQ NAME 


ORIEN 
1 


CLONE ID 


LIBRARY 


2899 


730189 


1912.E03.gz43_244535 


F 


M00057291B:H08 


UC2-PrimColon 


2900 


728768 


1912.D08.gz43_244614 


F 


M00057290B:A02 


UC2-PrimColon 


2901 


420402 


1912.H10.gz43_244650 


F 


M00057299C:A08 


UC2-PrimColon 


2902 


737109 


1912.F12.gz43_244680 


F 


M00057293A:H03 


UC2-PrimColon 


2903 


424672 


1912.H15.gz43_244730 


F 


M00057299D:E04 


UC2-PrimColon 


2904 


558045 


1912.D16.gz43_244742 


F 


M00057290D:G03 


UC2-PrimColon 


2905 


733856 


1912.F16.gz43_244744 


F 


M00057293B:H04 


UC2-PrimColon 


2906 


736449 


1912.H17.gz43_244762 


F 


M00057300A:A08 


UC2-PrimColon 


2907 


148201 


1912.H18.gz43_244778 


F 


M00057300A:B06 


UC2-PrimColon 


2908 


639427 


1912E19.gz43_244791 


F 


M00057292A:B08 


UC2-PrimColon 


2909 


730187 


1912.B21.gz43_244820 


F 


M00057287A:H06 


UC2-PrimColon 


2910 


732300 


1912.D24.gz43_244870 


F 


M00057291B:D08 


UC2-PrimColon 


2911 


465528 


1921.A01.gz43_244883 


F 


M00057314A:A10 


UC2-PrimColon 


2912 


549552 


1921.J01.gz43_244892 


F 


M00057331D:D08 


UC2-PrimColon 


2913 


470199 


1921.E02.gz43_244903 


F 


M00057320A:G08 


UC2-PrimColon 


2914 


729125 


1921.F03.gz43_244920 


F 


M00057323B:G04 


UC2-PrimColon 


2915 


640341 


1921.C04.gz43_244933 


F 


M00057316D:D02 


UC2-PrimColon 


2916 


77737 


1921.F04.gz43_244936 


F 


M00057323B:H08 


UC2-PrimColon 


2917 


735469 


1921.G04.gz43_244937 


F 


M00057324B:E04 


UC2-PrimColon 


2918 


736733 


1921.B06.gz43_244964 


F 


M00057315D:C06 


UC2-PrimColon 


2919 


732969 


1921.106.gz43_244971 


F 


M00057328C:F11 


UC2-PrimColon 


2920 


635849 


1921J06.gz43_244972 


F 


M00057332A:C06 


UC2-PrimColon 


2921 


733146 


1921.O06.gz43_244977 


F 


M00057339D:H09 


UC2-PrimColon 


2922 


737114 


1921.A07.gz43_244979 


F 


M00057314B:H06 


UC2-PrimColon 


2923 


732114 


1921.L07.gz43_244990 


F 


M00057334D:E03 


UC2-PrimColon 


2924 


482788 


1921.P08.gz43_245010 


F 


M00057341D:B09 


UC2-PrimColon 


2925 


554647 


1921.M10.gz43_245039 


F 


M00057337C:G12 


UC2-PrimColon 


2926 


552623 


1921.Bll.gz43_245044 


F 


M00057316A:B04 


UC2-PrimColon 


2927 


447705 


1921.Ell.gz43_245047 


F 


M00057320D:C02 


UC2-PrimColon 


2928 


731748 


1921.Gll.gz43_245049 


F 


M00057324C:G05 


UC2-PrimColon 


2929 


727093 


1921.B12.gz43_245060 


F 


M00057316A:D09 


UC2-PrimColon 


2930 


551607 


1921.N12.gz43_245072 


F 


M00057339A:E08 


UC2-PrimColon 


2931 


727013 


1921.G13.gz43_245081 


F 


M00057324D:E10 


UC2-PrimColon 


2932 


727018 


1921.L13.gz43_245086 


F 


M00057336B:E01 


UC2-PrimColon 


2933 


548635 


1921.G14.gz43_245097 


F 


M00057324D:H03 


UC2-PrimColon 


2934 


726692 


1921.J14.gz43_245100 


F 


M00057332C:F12 


UC2-PrimColon 


2935 


556430 


192 1 .D 1 5.gz43_245 1 10 


F 


M00057319D:E06 


UC2-PrimColon 


2936 


726692 


1921.A16.gz43_245123 


F 


M00057314D:E09 


UC2-PrimColon 


2937 


473225 


1921.F16.gz43_245128 


F 


M00057323D:E10 


UC2-PrimColon 


2938 


892 


1921.K16.gz43_245133 


F 


M00057334A:C12 


UC2-PrimColon 


2939 


730997 


1921.C17.gz43_245141 


F 


M00057318B:H05 


UC2-PrimColon 


2940 


51616 


1921.E18.gz43_245159 


F 


M00057323A:F01 


UC2-PrimColon 
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Table 2 



1JJ 


CLUSTEF 


SEQNAME 


ORIE 

NT 


CLONE ID 


LIBRARY 


2941 


546642 


1921.F18.gz43_245160 


F 


M00057324A:A09 


UC2-PrimColon 


2942 


643005 


1921.G18.gz43_245161 


F 


M00057325B.D06 


UC2-PrimColon 


2943 


735054 


1921.H18.gz43 245162 


F 


M00057326D:B10 


UC2-PrimColon 


2944 


727231 


1921.F19.gz43_245176 


F 


M00057324A:C08 


UC2-PrimColon 


2945 


733464 


1921J19.gz43_245180 


F 


M00057333A:D08 


UC2-PrimColon 


2946 


457092 


1921. B20.gz43_245 188 


F 


M00057316C:A06 


UC2-PrimColon ' 


2947 


733723 


1921.H21.gz43_245210 


F 


M00057328A:F02 


UC2-PrimColon 


2948 


557177 


1921.121.gz43_245211 


F 


M00057331C:E10 


UC2-PrimColon 


2949 


734582 


1921.K21.gz43_245213 


F 


M00057334B:F01 


UC2-PrimColon 


2950 


448431 


1921.L21.gz43_245214 


F 


M00057336D:F07 


UC2-PrimColon 


2951 


697006 


1 92 1 .H22 .gz43_245 226 


F 


M00057328A:F10 


UC2-PrimColon 


2952 


452936 


1921.122.gz43_245227 


F 


M00057331C:F10 


UC2-PrimColon 


2953 


727132 


1921.L22.gz43_245230 


F 


M00057337A:A06 


UC2-PrimColon 


2954 


513306 


1921F23.gz43_245240 


F 


M00057324B:A06 


UC2-PrimColon 


2955 


642986 


1921.P24.gz43_245266 


F 


M00057344A:G07 


UC2-PrimColon 


2956 


496752 


1924.G03.gz43_245338 


F 


M00043308A:D09 


UC2-ColonMetLiver 


2957 


498509 


1924.H03.gz43_245339 


F 


M00043310A:F01 


UC2-ColonMetLiver 


2958 


450805 


1924.E04.gz43_245352 


F 


M00043304B:A10 


UC2-ColonMetLiver 


2959 


451456 


1924.G04.gz43_245354 


F 


M00043308A:F06 


UC2-ColonMetLiver 


2960 


494625 


1924.M04.gz43_245360 


F 


M00043317B:B12 


UC2-ColonMetLiver 


2961 


490401 


1924.P05.gz43 245379 


F 


M00043321B:E05 


UC2-ColonMetLiver 


2962 


495105 


1924.H08.gz43_245419 


F 


M00043310C:B04 


UC2-ColonMetLiver 


2963 


496957 


1 924.E09.gz43_245432 


F 


M00043304C:E01 


UC2-ColonMetLiver 


2964 


494306 


1924.110.gz43_245452 


F 


M00043312D:A02 


UC2-ColonMetLiver 


2965 


492544 


1924.Jll.gz43_245469 


F 


M00043313D:B04 


UC2-ColonMetLiver 


2966 


451081 


1924.Mll.gz43_245472 


F 


M00043317D:C02 


UC2-ColonMetLiver 


2967 


495951 


1924.K14.gz43_245518 


F 


M00043315C:D05 


UC2-ColonMetLiver 


2968 


450349 


1924.D15.gz43_245527 


F 


M00043303B:E1 1 


UC2-ColonMetLiver 


2969 


498951 


1924.L15.gz43_245535 


F 


M00043316C:F06 


UC2-ColonMetLiver 


2970 


450999 


1924.C17.gz43_245558 


F 


M00043301D:B12 


UC2-ColonMetLiver 


2971 


499693 


1924.G17.gz43_245562 


F 


M00043309A:H06 


UC2-ColonMetLiver 


2972 


498242 


1924.F18.gz43_245577 


F 


M00043307B:F11 


UC2-ColonMetLiver 


2973 


499700 


1924.H18.gz43_245579 


F 


M00043311B:H08 


UC2-ColonMetLiver 


2974 


479604 


1924.L19.gz43_245599 


F 


M00043316D:F04 


UC2-ColonMetLiver 


2975 


562516 


1924.N19.gz43_245601 


F 


M00043319A:D01 


UC2-ColonMetLiver 


2976 


553779 


1924.L21.gz43_245631 


F 


M00043316D:F09 


UC2-ColonMetLiver 


2977 


493746 


1924.E24.gz43_245672 


F 


M00043306B:A09 


UC2-ColonMetLiver 


2978 


494325 


1924.H24.gz43_245675 


F 


M00043312B:A10 


UC2-ColonMetLiver 


2979 


453078 


1933.H03.gz43_245723 


F 


M00043340B:B04 


UC2-ColonMetLiver 


2980 


492887 


1933.J04.gz43_245741 


F 


M00043343D:H03 


UC2-ColonMetLiver 


2981 


450283 


1933.F05.gz43_245753 


F 


M00043335D:E02 


UC2-ColonMetLiver 


2982 


494271 


1933.M05.gz43_245760 


F 


M00043351C:A07 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTEI 


* SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


2983 


448110 


1933.O05.gz43 245762 


F 


M00043354D:C01 


UC2-ColonMetLiver 


2984 


412621 


1933.D06.gz43 245767 


F 


M00043330D:G05 


UC2-ColonMetLiver 


2985 


492629 


1933.I06.gz43 245772 


F 


M00043342C:G02 


UC2-ColonMetLiver 


2986 


491212 


1933.E07.gz43 245784 


F 


M00043334A:F10 


UC2-ColonMetLiver 


2987 


447057 


1933.H09.gz43 245819 


F 


M00043340D:C03 


UC2-ColonMetLiver 


2988 


496909 


1933.C10.gz43 245830 


F 


M00043328A:E12 


UC2-ColonMetLiver 


2989 


491933 


1933.D10.gz43 245831 


F 


M00043331C:G01 


UC2-ColonMetLiver 


2990 


496870 


1933.N10.gz43 245841 


F 


M00043353D:E12 


UC2-ColonMetLiver 


2991 


500040 


1933.B12.gz43 245861 


F 


M00043326A:H06 


UC2-ColonMetLiver 


2992 


553594 


1933.012.gz43 245874 


F 


M00043354D:H08 


UC2-ColonMetLiver 


2993 


561712 


1933.N14.gz43 245905 


F 


M00043354A:E06 


UC2-ColonMetLiver 


2994 


450027 


1933.J15.gz43 245917 


F 


M00043344C:C11 


UC2-ColonMetLiver 


2995 


450129 


1933.K16.gz43 245934 


F 


M00043348A:D02 


UC2-ColonMetLiver 


2996 


492779 


1933.C17.gz43 245942 


F 


M00043329B:H07 


UC2-ColonMetLiver 


2997 


453068 


1933.C18.gz43 245958 


F 


M00043329C:D12 


UC2-ColonMetLiver 


2998 


450804 


1933.F18.gz43 245961 


F 


M00043336C:A04 


UC2-ColonMetLiver 


2999 


1015 


1933.N18.gz43_245969 


F 


M00043354B:C04 


UC2-ColonMetLiver 


3000 


561834 


1933.N20.gz43 246001 


F 


M00043354B:F12 


UC2-ColonMetLiver 


3001 


552687 


1933.O23.gz43_246050 


F 


M00043355B.B01 


UC2-ColonMetLiver 


3002 


517346 


1 933 . 024 .gz43_246066 


F 


M00043355B:D12 


UC2-ColonMetLiver 


3003 


491448 


1 935 .B05 gz43_246289 


F 


M00043392C:F02 


UC2-ColonMetLiver 


3004 


553890 


1935.K06.gz43 246314 


F 


M00043501A:D07 


UC2-ColonMetLiver 


3005 


635439 


1935.N06.gz43 246317 


F 


M00043504B:C02 


UC2-ColonMetLiver 


3006 


402070 


1935.I07.gz43 246328 


F 


M00043409C.C07 


UC2-ColonMetLiver 


3007 


448924 


1935.H08.gz43_246343 


F 


M00043406D:C04 


UC2-ColonMetLiver 


3008 


562886 


1935.O08.gz43_246350 


F 


M00043505B:G03 


UC2-ColonMetLiver 


3009 


558055 


1935.O09.gz43 246366 


F 


M00043505B:G07 


UC2-ColonMetLiver 


3010 


451456 


1935H15.gz43_246455 


F 


M00043407C:H08 


UC2-ColonMetLiver 


3011 


555399 


1935.N16.gz43 246477 


F 


M00043504C:E03 


UC2-ColonMetLiver 


3012 


498629 


1935.H17.gz43_246487 


F 


M00043407D:G06 


UC2-ColonMetLiver 


3013 


490805 


1935.E18.gz43_246500 


F 


M00043401A:E09 


UC2-ColonMetLiver 


3014 


450335 


1935.F18.gz43_246501 


F 


M00043403B.A12 


UC2-ColonMetLiver 


3015 


446964 


1935.N22.gz43 246573 


F 


M00043504D:G08 


UC2-ColonMetLiver 


3016 


522703 


1935.P24.gz43 246607 


F 


M00043506D:F06 


UC2-ColonMetLiver 


3017 


549853 


1936.K05.gz43_246682 


F 


M00054494A:H04 


UC2-ColonMetLiver 


3018 


560275 


1936.M05.gz43_246684 


F 


M00054497B:C10 


UC2-ColonMetLiver 


3019 


481057 


1936.P05.gz43 246687 


F 


M00054501C:E08 


UC2-ColonMetLiver 


3020 


414739 


1936.C08.gz43_246722 


F 


M00043508D:C01 


UC2-ColonMetLiver 


3021 


559720 


1936.O08.gz43_246734 


F 


M00054500A:F04 


UC2-ColonMetLiver 


3022 


554908 


1936.Mll.gz43 246780 


F 


M00054497D:A04 


UC2-ColonMetLiver 


3023 


449000 


1936.Nll.gz43_246781 


F 


M00054498D:F01 


UC2-ColonMetLiver 


3024 


556768 


1936.F13.gz43_246805 


F 


M00054488A:F01 


UC2-ColonMetLiver 
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Table 2 



ID 
NO 




W bbi^ JNAME 


OKIE 

IN I 


CLONE ID 


LIBRARY 


3025 


A A AAIA 

449978 


1936.I15.gz43 246840 


F 


M00054492A:D04 


UC2-ColonMetLiver 


3026 


489368 


1936.J16.gz43 246857 


F 


M00054493C:E04 


UC2-ColonMetLiver 


3027 


562823 


1936.B18.gz43 246881 


F 


M00043508B:G1 1 


UC2-ColonMetLiver 


302i» 


559574 


1936.E21.gz43 246932 


F 


M00054487C:A01 


UC2-ColonMetLiver 


3029 


552920 


1936.023.gz43 246974 


F 


M00054501A:E11 


UC2-ColonMetLiver 


3030 


552086 


1971.A01.gz43 246992 


F 


M00054717A:A03 


UC2-ColonMetLiver 


3031 


553123 


1971.D01.gz43 246995 


F 


M00054720C:A01 


UC2-ColonMetLiver 


3032 


561636 


1971.H01.gz43 246999 


F 


M00054724D:C05 


UC2-ColonMetLiver 


3033 


496772 


1971.K01.gz43 247002 


F 


M00054727D:C06 


UC2-ColonMetLiver 


3034 


549550 


1971.A03.gz43 247024 


F 


M00054717A:C07 


UC2-ColonMetLiver 


3035 


555837 


1971.B05.gz43 247057 


F 


M00054718B:D03 


UC2-ColonMetLiver 


3036 


557714 


1971.D05.gz43 247059 


F 


M00054720C:G10 


UC2-ColonMetLiver 


3037 


446999 


1971.E05.gz43 247060 


F 


M00054721C:D11 


UC2-ColonMetLiver 


3038 


450410 


1971.F05.gz43 247061 


F 


M00054722C:D01 


UC2-ColonMetLiver 


3039 


484043 


1971.L05.gz43 247067 


F 


M00054729B:A08 


UC2-ColonMetLiver 


3040 


452488 


1971.N05.gz43 247069 


F 


M00054731A:D07 


UC2-ColonMetLiver 


3041 


450278 


1971.A06.gz43 247072 


F 


M00054717A:G09 


UC2-ColonMetLiver 


3042 


561313 


1 97 1 .D06.gz43_247075 


F 


M00054720C:H01 


UC2-ColonMetLiver 


3043 


557426 


1971.L06.gz43_247083 


F 


M00054729B:B03 


UC2-ColonMetLiver 


3044 


553316 


1971.F09.gz43 247125 


F 


M00054722D:C08 


UC2-ColonMetLiver 


3045 


555818 


1971.I09.gz43 247128 


F 


M00054726A:D02 


UC2-ColonMetLiver 


3046 


551096 


1971.H10.gz43 247143 


F 


M00054725A:A05 


UC2-ColonMetLiver 


3047 


556654 


1971.P10.gz43 247151 


F 


M00054734B:C06 


UC2-ColonMetLiver 


3048 


555491 


1971.Hll.gz43 247159 


F 


M00054725A:F09 


UC2-ColonMetLiver 


3049 


559380 


1971.Mll.gz43 247164 


F 


M00054730B:F11 


UC2-ColonMetLiver 


3050 


86175 


1971.H12.gz43 247175 


F 


M00054725A:H06 


UC2-CoIonMetLiver 


3051 


561830 


197LL12 .gz43_247 1 79 


F 


M00054729C:C08 


UC2-ColonMetLiver 


3052 


421959 


1971.G14.gz43 247206 


F 


M00054724A:G02 


UC2-ColonMetLiver 


3053 


448453 


1971.N14.gz43 247213 


F 


M00054731B:D04 


UC2-ColonMetLiver 


one /i 

3054 


562263 


1971.L15.gz43 247227 


F 


M00054729C:G07 


UC2-ColonMetLiver 


3055 


553739 


1971.I16.gz43 247240 


F 


M00054726B:F08 


UC2-ColonMetLiver 


3056 


557747 


1971.I17.gz43 247256 


F 


M00054726B:F09 


UC2-ColonMetLiver 


3057 


14573 


1971.N17.gz43 247261 


F 


M00054731B:G02 


UC2-ColonMetLiver 


3058 


553108 


1971.P17.gz43 247263 


F 


M00054734C:A07 


UC2-ColonMetLiver 


3059 


554048 


1971.K18.gz43 247274 


F 


M00054728D:B07 


UC2-ColonMetLiver 


3060 


550315 


1971.L18.gz43 247275 


F 


M00054729D:D05 


UC2-ColonMetLiver 


3061 


44Q0^S 




r 


M00054726C:D 1 0 


UC2-ColonMetLiver 


3062 


524721 


1971. 122 .gz43_247336 


F 


M00054726C:E07 


UC2-ColonMetLiver 


3063 


555571 


1 97 1 . A24.gz43_247360 


F 


M000547l8A:Dll 


UC2-ColonMetLiver 


3064 


553869 


1971.N24.gz43 247373 


F 


M0005473lC:Hl2 


UC2-ColonMetLiver 


3065 


552686 


1971.P24.gz43 247375 


F 


M00054735A:G09 


UC2-ColonMetLiver 


3066 


424723 


1972.I03.gz43 247416 


F 


M00054743A:C07 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTE1 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3067 


499517 


1972.P03.gz43 247423 


F 


M00054750D:H12 


UC2-Cr>1rmMetT iwr 


3068 


551441 


1972.L04.gz43 247435 


F 


M00054745D:G09 


UC2-Colr>nMptT iwr 


3069 


86145 


1972.P04.gz43 247439 


F 


M00054751A:A05 


UC2-ColnnMetT ivf»r 


3070 


556167 


1972.C05.gz43 247442 


F 


M00054737B:H1 1 


UC2-ColonMetT iver 


3071 


553834 


1972.J05.gz43 247449 


F 


M00054744A:G10 


UC2-ColonMetT ,i ver 


3072 


454186 


1972.C06.gz43 247458 


F 


M00054737C:B01 


UC 2 -Colon M etLi ver 


3073 


466235 


1972.E08.gz43 247492 


F 


M00054739C:B12 


UC2-ColonMetLiver 


3074 


562323 


1972.O09.gz43 247518 


F 


M00054750A:G10 


UC2 -ColonMetLi ver 


3075 


450142 


1972.E12.gz43 247556 


F 


M00054739C:E05 


UC2 -ColonMetLiver 


3076 


557568 


1972.012.gz43 247566 


F 


M00054750B:F02 


UC2-ColonMetLiver 


3077. 


522322 


1972.B13.gz43 247569 


F 


M00054736B:H03 


UC2-ColonMetLiver 


3078 


558679 


1972.D13.gz43 247571 


F 


M00054738D:F01 


UC2 -ColonMetLiver 


3079 


452257 


1972.I13.gz43 247576 


F 


M00054743C:E11 


UC2-ColonMetLiver 


3080 


523753 


1972.N13.gz43 247581 


F 


M00054748B:G10 


UC 2 -ColonMetLi ver 


3081 


59202 


1972.N14.gz43 247597 


F 


M00054748B:H09 


UC2-ColonlVIetLriver 


3082 


553702 


1972.P14.gz43 247599 


F 


M00054751B:F12 


UC2-CoIonMetLiver 


3083 


495832 


1972.D15.gz43 247603 


F 


M00054738D:G07 


UC2-ColonMetLiver 


3084 


493122 


1972.F15.gz43 247605 


F 


M00054740C:H08 


UC 2 -ColonMetLi ver 


3085 


493135 


1972.G15.gz43 247606 


F 


M00054741C:D08 


UC 2 -ColonMetLi ver 


3086 


562793 


1972K15.gz43_247610 


F 


M00054745C:B12 


UC 2 -ColonMetLi ver 


3087 


587854 


1972J16.gz43_247625 


F 


M00054744C:D02 


UC2-ColonMetLiver 


3088 


553457 


1972.018.gz43_247662 


F 


M00054750C:D01 


UC2-ColonMetLiver 


3089 


551778 


1972.G19.gz43 247670 


F 


M00054741D.C05 


UC2-ColonMetLiver 


3090 


555077 


1972.I19.gz43 247672 


F 


M00054743D:F04 


UC2-ColonMetLiver 


3091 


557871 


1972.K19.gz43 247674 


F 


M00054745C:H02 


UC2-ColonMetLiver 


3092 


560282 


1972.C21.gz43 247698 


F 


M00054738B:E12 


UC 2 -ColonMetLi ver 


3093 


503452 


1972.L22.gz43 247723 


F 


M00054746D:E05 


UC2-ColonMetLiver 


3094 


555773 


1972.D24.gz43 247747 


F 


M00054739A:F07 


UC 2 -ColonMetLi ver 


3095 


474309 


1972.M24.gz43 247756 


F 


M00054747D:B05 


UC 2 -ColonMetl >i ver 


3096 


556019 


1981.J01.gz43 247769 


F 


M00054764D:F01 


UC2-ColonMetLiver 


3097 


472196 


1981. 102 .gz43_247784 


F 


M00054763A:A10 


UC2-ColonMetLiver 


3098 


513632 


1981.D03.gz43 247795 


F 


M00054755D.E05 


UC2-ColonMetLiver 


3099 


553158 


1981.F03.gz43 247797 


F 


M00054759A:B08 


UC2-ColonMetLiver 


3100 


498662 


1981.L03.gz43 247803 


F 


M00054766C.B08 


UC2-ColonMetLiver 


3101 


551693 


1981.C04.gz43 247810 


F 


M00054754D:F11 


UC2-ColonMetLiver 


3102 


554021 


1981.I04.gz43 247816 


F 


M00054763C:D07 


UC2-ColonMetLiver 


3103 


550402 


1981.O04.gz43 247822 


F 


M00054769B:D12 


UC2-ColonMetLiver 


3104 


448510 


1981.P04.gz43 247823 


F 


M00054770B:D09 


UC2-ColonMetLiver 


3105 


558900 


1981.I05.gz43 247832 


F 


M00054763C:F10 


UC2-ColonMetLiver 


3106 


554294 


1981.K05.gz43 247834 


F 


M00054765D.D05 


UC2-ColonMetLiver 


3107 


450840 


1981.106.gz43_247848 


F 


M00054763C:H04 


UC2-ColonMetLiver 


3108 


561487 


1981.J07.gz43 247865 


F 


M00054765B.C03 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTEF 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3109 


552541 


1981.L07.gz43_247867 


F 


M00054766C:E01 


UC2-ColonMetLiver 


3110 


528775 


1981.P07.gz43_247871 


F 


M00054770C:A04 


UC2-ColonMetLiver 


3111 


553918 


1981B08.gz43_247873 


F 


M00054753C:H02 


UC2-ColonMetLiver 


3112 


477046 


1981.B09.gz43_247889 


F 


M00054753D:A03 


UC2-ColonMetLiver 


3113 


551289 


1981.D09.gz43 247891 


F 


M00054756A:C12 


UC2-ColonMetLiver 


3114 


263800 


1981.G09.gz43 247894 


F 


M00054760D:B03 


UC2-ColonMetLiver 


3115 


495942 


1981.A10.gz43_247904 


F 


M00054752B:H06 


UC2-ColonMetLiver 


3116 


556183 


1981.H10.gz43 247911 


F 


M00054762A:D09 


UC2-ColonMetLiver 


3117 


448357 


1981.N10.gz43_247917 


F 


M00054768D:A01 


UC2-ColonMetLiver 


3118 


554161 


1981P10.gz43_247919 


F 


M00054770C:C04 


UC2-ColonMetLiver 


3119 


256179 


1981.Bll.gz43_247921 


F 


M00054753D:C12 


UC2-CoIonMetLiver 


3120 


555478 


1981.Pll.gz43_247935 


F 


M00054770C:D05 


UC2-ColonMetLiver 


3121 


554581 


1981P12.gz43_247951 


F 


M00054770C:F10 


UC2-ColonMetLiver 


3122 


553922 


1981B13.gz43_247953 


F 


M00054753D:H10 


UC2-ColonMetLiver 


3123 


554336 


1981.E13.gz43_247956 


F 


M00054758A:F03 


UC2-ColonMetLiver 


3124 


560529 


1981.F13.gz43_247957 


F 


M00054759C:G10 


UC2-ColonMetLiver 


3125 


558900 


1981H14.gz43_247975 


F 


M00054762B:F02 


UC2-ColonMetLiver 


3126 


447667 


1981.M14.gz43_247980 


F 


M00054767C:D03 


UC2-ColonMetLiver 


3127 


551288 


1981D15.gz43_247987 


F 


M00054756C:C08 


UC2-ColonMetLiver 


3128 


122169 


1981E15.gz43_247988 


F 


M00054758B:C10 


UC2-ColonMetLiver 


3129 


551617 


1981H15.gz43_247991 


F 


M00054762B:F07 


UC2-ColonMetLiver 


3130 


260558 


1981.L15.gz43_247995 


F 


M00054766D:H02 


UC2-ColonMetLiver 


3131 


558609 


1981.B16.gz43_248001 


F 


M00054754B:F04 


UC2-ColonMetLiver 


3132 


554352 


1981.E16.gz43_248004 


F 


M00054758B:D03 


UC2-ColonMetLiver 


3133 


556064 


1981.F17.gz43_248021 


F 


M00054759D:E01 


UC2-ColonMetLiver 


3134 


508126 


1981.K17.gz43_248026 


F 


M00054766A:H10 


UC2-ColonMetLiver 


3135 


549576 


1981.M17.gz43_248028 


F 


M00054767C:H06 


UC2-ColonMetLiver 


3136 


554000 


1981.A18.gz43_248032 


F 


M00054753A:A05 


UC2-ColonMetLiver 


3137 


554049 


1981.E18.gz43_248036 


F 


M00054758B:H03 


UC2-ColonMetLiver 


3138 


558503 


1981.H18.gz43 248039 


F 


M00054762C:A12 


UC2-ColonMetLiver 


3139 


451993 


1981.D19.gz43_248051 


F 


M00054756D:F05 


UC2-ColonMetLiver 


3140 


558949 


1981.O19.gz43_248062 


F 


M00054770A:C06 


UC2-ColonMetLiver 


3141 


553131 


1981.J20.gz43_248073 


F 


M00054765C:F10 


UC2-ColonMetLiver 


3142 


555343 


1981.D21.gz43_248083 


F 


M00054757A:H07 


UC2-ColonMetLiver 


3143 


512432 


1981.N21.gz43_248093 


F 


M00054769A:G10 


UC2-ColonMetLiver 


3144 


550580 


1981B22.gz43_248097 


F 


M00054754C:H09 


UC2-ColonMetLiver 


3145 


551117 


1981.F22.gz43_248101 


F 


M00054760A:A12 


UC2-ColonMetLiver 


3146 


556475 


1981.L22.gz43_248107 


F 


M00054767A.F08 


UC2-ColonMetLiver 


3147 


552549 


1981.M22.gz43_248108 


F 


M00054767D:G09 


UC2-ColonMetLiver 


3148 


470351 


1981.P23.gz43_248127 


F 


M00054771A:E01 


UC2-ColonMetLiver 


3149 


561535 


1981F24.gz43_248133 


F 


M00054760A:D10 


UC2-ColonMetLiver 


3150 


1093 


1981.G24.gz43_248134 


F 


M00054761C:E02 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTEF 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3151 


2284 


1981.H24.gz43_248135 


F 


M00054762D:C06 


UC2-ColonMetLiver 


3152 


555958 


1982.F01.gz43_248149 


F 


M00054778A:D01 


UC2-ColonMetLiver 


3153 


492779 


1982.A02.gz43i248160 


F 


M00054771B:F12 


UC2-ColonMetLiver 


3154 


553848 


1982.P02.gz43_248175 


F 


M00054806A:G04 


UC2-ColonMetLiver 


3155 


555686 


1982.C03.gz43_248178 


F 


M00054774B:A07 


UC2-ColonMetLiver 


3156 


555524 


1982.H03.gz43_248183 


F 


M00054780B:B06 


UC2-ColonMetLiver 


3157 


551068 


1982.A04.gz43_248192 


F 


M00054771C:A11 


UC2-ColonMetLiver 


3158 


552183 


1982.O04.gz43_248206 


F 


M00054805A:H04 


UC2-ColonMetLiver 


3159 


551003 


1982.F05.gz43_248213 


F 


M00054778A:F08 


UC2-ColonMetLiver 


3160 


465207 


1982.G05.gz43_248214 


F 


M00054779B:A07 


UC2-ColonMetLiver 


3161 


533991 


1982.H05.gz43_248215 


F 


M00054780B:E03 


UC2-ColonMetLiver 


3162 


553868 


1982.K05.gz43_248218 


F 


M00054786D:F08 


UC2-ColonMetLiver 


3163 


553774 


1982B06.gz43_248225 


F 


M00054772D:G12 


UC2-ColonMetLiver 


3164 


554072 


1982C06.gz43_248226 


F 


M00054774B:G08 


UC2-ColonMetLiver 


3165 


554828 


1982.E06.gz43_248228 


F 


M00054776C:G06 


UC2-ColonMetLiver 


3166 


559872 


1982G06.gz43_248230 


F 


M00054779B:B07 


UC2-ColonMetLiver 


3167 


491570 


1982.H06.gz43_248231 


F 


M00054780B:G10 


UC2-ColonMetLiver 


3168 


275 


1982.M06.gz43_248236 


F 


M00054802C:A07 


UC2-ColonMetLiver 


3169 


552019 


1982.C07.gz43_248242 


F 


M00054774C:A03 


UC2-ColonMetLiver 


3170 


556183 


1982.G07.gz43_248246 


F 


M00054779B:B11 


UC2-ColonMetLiver 


3171 


552535 


1982.J07.gz43_248249 


F 


M00054784C:D11 


UC2-ColonMetLiver 


3172 


551718 


1982.A08.gz43_248256 


F 


M00054771D:G01 


UC2-ColonMetLiver 


3173 


551169 


1982.B08.gz43_248257 


F 


M00054773A:E09 


UC2-ColonMetLiver 


3174 


551080 


1982.F08.gz43_248261 


F 


M00054778B:A02 


UC2-ColonMetLiver 


3175 


517146 


1982.P08.gz43_248271 


F 


M00054806C:C12 


UC2-ColonMetLiver 


3176 


402147 


1982.B09.gz43_248273 


F 


M00054773A:H05 


UC2-ColonMetLiver 


3177 


557269 


1982.D09.gz43_248275 


F 


M00054776A:D10 


UC2-ColonMetLiver 


3178 


539353 


1982.N09.gz43_248285 


F 


M00054804A:H04 


UC2-ColonMetLiver 


3179 


560520 


1982.D10.gz43_248291 


F 


M00054776A:F01 


UC2-ColonMetLiver 


3180 


575924 


1982.110.gz43_248296 


F 


M00054782D:D12 


UC2-ColonMetLiver 


3181 


554582 


1982J10.gz43_248297 


F 


M00054784D:B01 


UC2-ColonMetLiver 


3182 


555696 


1982.M10.gz43_248300 


F 


M00054802D:A09 


UC2-ColonMetLiver 


3183 


556856 


1982.O10.gz43_248302 


F 


M00054805B:E06 


UC2-ColonMetLiver 


3184 


447926 


1982.Dll.gz43_248307 


F 


M00054776A:F07 


UC2-ColonMetLiver 


3185 


553743 


1982.Ell.gz43_248308 


F 


M00054776D:G09 


UC2-ColonMetLiver 


3186 


553979 


1982.Nll.gz43_248317 


F 


M00054804B:E07 


UC2-ColonMetLiver 


3187 


497912 


1982.01 l.gz43_248318 


F 


M00054805B:E11 


UC2-ColonMetLiver 


3188 


412621 


1982.D12.gz43_248323 


F 


M00054776A:G01 


UC2-ColonMetLiver 


3189 


497477 


1982.M12.gz43_248332 


F 


M00054802D:C02 


UC2-ColonMetLiver 


3190 


552188 


1982J13.gz43_248345 


F 


M00054785B:B07 


UC2-ColonMetLiver 


3191 


456052 


1982.M13.gz43_248348 


F 


M00054802D:C03 


UC2-ColonMetLiver 


3192 


143346 


1982.H14.gz43_248359 


F 


M00054780C:G11 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3193 


553237 


1982N14.gz43_248365 


F 


M00054804C:F04 


UC2-ColonMetLiver 


3194 


556856 


1982.014.gz43_248366 


F 


M00054805B:G02 


UC2-ColonMetLiver 


3195 


586600 


1982.A15.gz43_248368 


F 


M00054772B:B01 


UC2-ColonMetLiver 


3196 


562263 


1982J15.gz43_248377 


F 


M00054785C:C02 


UC2-ColonMetLiver 


3197 


555462 


1982.C16.gz43_248386 


F 


M00054775A:D07 


UC2-ColonMetLiver 


3198 


394168 


1982.016.gz43_248398 


F 


M00054805B:G10 


UC2-ColonMetLiver 


3199 


472672 


1982.A17.gz43_248400 


F 


M00054772B:D04 


UC2-ColonMetLiver 


3200 


595181 


1982.C17.gz43_248402 


F 


M00054775A:G03 


UC2-ColonMetLiver 


3201 


554627 


1982G17.gz43_248406 


F 


M00054779D:F07 


UC2-ColonMetLiver 


3202 


552325 


1982.117.gz43_248408 


F 


M00054783C:C03 


UC2-ColonMetLiver 


3203 


462511 


1982J17.gz43_248409 


F 


M00054785C:G1 1 


UC2-ColonMetLiver 


3204 


552813 


1982.K17.gz43_248410 


F 


M00054788C:G04 


UC2-ColonMetLiver 


3205 


175758 


1982.M17.gz43_248412 


F 


M00054803A:D08 


UC2-ColonMetLiver 


3206 


493085 


1982.A18.gz43_248416 


F 


M00054772B:F03 


UC2-ColonMetLiver 


3207 


557834 


1982.B18.gz43_248417 


F 


M00054773C:F09 


UC2-ColonMetLiver 


3208 


548998 


1982.F18.gz43_248421 


F 


M00054778C:F09 


UC2-ColonMetLiver 


3209 


551425 


1982.G18.gz43_248422 


F 


M00054779D:F08 


UC2-ColonMetLiver 


3210 


490414 


1982.K18.gz43_248426 


F 


M00054789A:A02 


UC2-ColonMetLiver 


3211 


551553 


1982.M18.gz43_248428 


F 


M00054803A:E10 


UC2-ColonMetLiver 


3212 


555571 


1982.N18.gz43_248429 


F 


M00054804D:D07 


UC2-ColonMetLiver 


3213 


448629 


1982.B19.gz43_248433 


F 


M00054773C:F10 


UC2-ColonMetLiver 


3214 


552005 


1982J19.gz43_248441 


F 


M00054785D:A07 


UC2-ColonMetLiver 


3215 


554116 


1982.M20.gz43_248460 


F 


M00054803B:B12 


UC2-ColonMetLiver 


3216 


505792 


1982.H21.gz43_248471 


F 


M00054781A:H09 


UC2-ColonMetLiver 


3217 


561422 


1982.J21.gz43_248473 


F 


M00054786A:G1 1 


UC2-ColonMetLiver 


3218 


551755 


1982.M24.gz43_248524 


F 


M00054803C:G01 


UC2-ColonMetLiver 


3219 


550063 


1945J03.gz43_248963 


F 


M00054513A:B08 


UC2-ColonMetLiver 


3220 


388055 


1945.D04.gz43_248973 


F 


M00054506B:A07 


UC2-ColonMetLiver 


3221 


572807 


1945.C05.gz43_248988 


F 


M00054505A:G12 


UC2-ColonMetLiver 


3222 


455814 


1945.J05.gz43_248995 


F 


M00054513A:F09 


UC2-ColonMetLiver 


3223 


452471 


1945.H07.gz43_249025 


F 


M00054510C:G07 


UC2-ColonMetLiver 


3224 


551518 


1945J07.gz43_249027 


F 


M00054513B:E11 


UC2-ColonMetLiver 


3225 


555660 


1945J09.gz43_249059 


F 


M00054513C:A01 


UC2-ColonMetLiver 


3226 


524721 


1945.Bll.gz43_249083 


F 


M00054504B:A04 


UC2-ColonMetLiver 


3227 


550986 


1945.Dll.gz43_249085 


F 


M00054506B:H01 


UC2-ColonMetLiver 


3228 


499696 


1945.Gll.gz43_249088 


F 


M00054509C:G01 


UC2-ColonMetLiver 


3229 


554611 


1945.Nll.gz43_249095 


F 


M00054519A:C04 


UC2-ColonMetLiver 


3230 


557681 


1945.F12.gz43_249103 


F 


M00054508B:E08 


UC2-ColonMetLiver 


3231 


550678 


1945J13.gz43_249123 


F 


M00054513D:F04 


UC2-ColonMetLiver 


3232 


554246 


1945.K13.gz43_249124 


F 


M00054515B:E11 


UC2-ColonMetLiver 


3233 


550129 


1945.F15.gz43_249151 


F 


M00054508C:B04 


UC2-ColonMetLiver 


3234 


555771 


1945.F17.gz43_249183 


F 


M00054508C:C08 


UC2-ColonMetLiver 



149 




Table 2 



SEQ 
ID 
NO 


CLUSTEI 


? SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3235 


572992 


1945.O18.gz43_249208 


F 


M00054521D:H01 


UC2-ColonMetLiver 


3236 


461313 


1945J21.gz43_249251 


F 


M00054514C:B01 


UC2-ColonMetLiver 


3237 


554456 


1945.L21.gz43 249253 


F 


M00054516D:F09 


UC2-ColonMetLiver 


3238 


547866 


1945.F23.gz43 249279 


F 


M00054509A:C01 


UC2-ColonMetLiver 


3239 


458974 


1945.P23.gz43_249289 


F 


M00054523C:A11 


UC2-ColonMetLiver 


3240 


554839 


1 945 .C24 .gz43_249292 


F 


M00054506A:D05 


UC2-ColonMetLiver 


3241 


487658 


1945.124.gz43_249298 


F 


M00054512D:H08 


UC2-ColonMetLiver 


3242 


556940 


1 946.H0 1 .gz43_2493 13 


F 


M00054533C:B04 


UC2-ColonMetLiver 


3243 


493575 


1946.N03.gz43_249351 


F 


M00054541C:C10 


UC2-ColonMetLiver 


3244 


466020 


1946.H05.gz43_249377 


F 


M00054533C:E07 


UC2-ColonMetLiver 


3245 


552006 


1946.C06.gz43_249388 


F 


M00054527B:A07 


UC2-ColonMetLiver 


3246 


551157 


1946.E06.gz43_249390 


F 


M00054529C:D11 


UC2-ColonMetLiver 


3247 


559574 


1946.G06.gz43 249392 


F 


M00054532D:A01 


UC2-ColonMetLiver 


3248 


556216 


1946.106.gz43_249394 


F 


M00054534D:D02 


UC2-ColonMetLiver 


3249 


559676 


1 946.B07.gz43_249403 


F 


M00054525D:H05 


UC2-ColonMetLiver 


3250 


550018 


1946.J07.gz43_249411 


F 


M00054536B:A03 


UC2-ColonMetLiver 


3251 


526984 


1 946.G08 .gz43_249424 


F 


M00054532D:E07 


UC2-ColonMetLiver 


3252 


481594 


1 946.N09.gz43_249447 


F 


M00054541C:F11 


UC2-ColonMetLiver 


3253 


550618 


1946.P09.gz43 249449 


F 


M00054544B:E03 


UC2-ColonMetLiver 


3254 


454336 


1946.A10.gz43_249450 


F 


M00054523D:G09 


UC2-ColonMetLiver 


3255 


550370 


1946.J10.gz43_249459 


F 


M00054536B:D02 


UC2-ColonMetLiver 


3256 


464154 


1 946.M 1 0 gz43_249462 


F 


M00054540A:H07 


UC2-ColonMetLiver 


3257 


550475 


1946.Bll.gz43_249467 


F 


M00054526A:E04 


UC2-ColonMetLiver 


3258 


584071 


1946.Fll.gz43_249471 


F 


M00054531D:F05 


UC2-ColonMetLiver 


3259 


550730 


1946.Jll.gz43_249475 


F 


M00054536B:F08 


UC2-ColonMetLiver 


3260 


552638 


1946.012.gz43_249496 


F 


M00054543B:E06 


UC2-ColonMetLiver 


3261 


560014 


1 946.L 1 3 .gz43_249509 


F 


M00054538D:H11 


UC2-ColonMetLiver 


3262 


553338 


1946.I14.gz43 249522 


F 


M00054535A:G03 


UC2-ColonMetLiver 


3263 


556809 


1946.B15.gz43 249531 


F 


M00054526B:C09 


UC2-ColonMetLiver 


3264 


498433 


1946H15.gz43_249537 


F 


M00054534A:B06 


UC2-ColonMetLiver 


3265 


455821 


1946.K16.gz43 249556 


F 


M00054538A:E10 


UC2-ColonMetLiver 


3266 


82864 


1946J17.gz43_249571 


F 


M00054536C:D12 


UC2-ColonMetLiver 


3267 


560519 


1946.L17.gz43 249573 


F 


M00054539B:D06 


UC2-ColonMetLiver 


3268 


409262 


1946.N17.gz43_249575 


F 


M00054542B:B01 


UC2-ColonMetLiver 


3269 


560717 


1946.P17.gz43 249577 


F 


M00054544C:F04 


UC2-CoIonMetLiver 


3270 


550172 


1946.K18.gz43 249588 


F 


M00054538B:B05 


UC2-ColonMetLiver 


3271 


552753 


1946.018.gz43 249592 


F 


M00054543C:F01 


UC2-ColonMetLiver 


3272 


549734 


1946.L19.gz43 249605 


F 


M00054539B:G03 


UC2-ColonMetLiver 


3273 


550714 


1946.G20.gz43 249616 


F 


M00054533B:F04 


UC2-CoIonMetLiver 


3274 


585380 


1946.D21.gz43 249629 


F 


M00054529A:H01 


UC2-ColonMetLiver 


3275 


550204 


1946.F21.gz43_249631 


F 


M00054532B:C05 


UC2-ColonMetLiver 


3276 


560205 


1946.I22.gz43 249650 


F 


M00054535B:H08 


UC2-ColonMetLiver 
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Table 2 



CT7H 

NO 




crA XT A IV /f 17 

aby JNAJVLb 


UKlli 
IN I 


CLONE ID 


LIBRARY 


3277 


312036 


1946.E23.gz43_249662 


F 


M00054530D:C10 


UC2-CoIonMetLiver 


3278 


553002 


1946.E24.gz43_249678 


F 


M00054530D:H07 


UC2-ColonMetLiver 


3279 


394567 


1948.001 .gz43_249704 


F 


M00054583D:E04 


UC2-ColonMetLiver 


3280 


468613 


1948.C02.gz43_249708 


F 


M00054569D:E01 


UC2-ColonMetLiver 


3281 


378610 


1948.102.gz43_249714 


F 


M00054576D:C07 


UC2-ColonMetLiver 


3282 


559752 


1 948 . M02 .gz43__2497 1 8 


F 


M00054581B:A01 


UC2-ColonMetLiver 


3283 


550804 


1 948 .K03 .gz43_249732 


F 


M00054579A:G10 


UC2-ColonMetLiver 


3284 


549956 


1948.003 .gz43_249736 


F 


M00054584A:A07 


UC2-ColonMetLiver 


3285 


556065 


1 948 . A04 .gz43_24973 8 


F 


M00054567C:B03 


UC2-CoIonMetLiver 


3286 


550397 


1948.M04.gz43__249750 


F 


M00054581B:D03 


UC2-ColonMetLiver 


3287 


550874 


1 948.M05 .gz43_249766 


F 


M00054581B:G10 


UC2-ColonMetLiver 


3288 


533812 


1 948.005 ,gz43_249768 


F 


M00054584A:B03 


UC2-ColonMetLiver 


3289 


549597 


1948.C08.gz43_249804 


F 


M00054570A:F02 


UC2-CoIonMetLiver 


3290 


553749 


1948.K08.gz43_249812 


F 


M00054579B:D10 


UC2-ColonMetLiver 


3291 


446814 


1 948 .G 1 0 .gz43_249840 


F 


M00054574D:F11 


UC2-ColonMetLiver 


3292 


555000 


1 948 .M 1 0.gz43_249846 


F 


M00054581D:C12 


UC2-ColonMetLiver 


3293 


554887 


1948.O10.gz43_249848 


F 


M00054584B:A03 


UC2-ColonMetLiver 


3294 


561876 


1948.Ell.gz43_249854 


F 


M00054572C:E09 


UC2-ColonMetLiver 


3295 


556488 


1948.M1 Lgz43_249862 


F 


M00054581D:D01 


UC2-ColonMetLiver 


3296 


560080 


1948.G12.gz43_249872 


F 


M00054575A:B06 


UC2-ColonMetLiver 


3297 


556336 


1 948 .M 1 2 .gz43_2498 78 


F 


M00054581D:E04 


UC2-ColonMetLiver 


3298 


558729 


1 948 . 1 1 3 .gz43_249890 


F 


M00054577B:A09 


UC2-ColonMetLiver 


3299 


549716 


1948.A14.gz43_249898 


F 


M00054568A:G02 


UC2-ColonMetLiver 


3300 


568467 


1948.M14.gz43_249910 


F 


M00054582A:A05 


UC2-ColonMetLiver 


3301 


550694 


1948.N14.gz43_249911 


F 


M00054583A:F05 


UC2-ColonMetLiver 


3302 


549994 


1 948 . M 1 5 .gz43_249926 


F 


M00054582A:A07 


UC2-ColonMetLiver 


3303 


549858 


1948.A16.gz43_249930 


F 


M00054568A:H03 


UC2-ColonMetLiver 


3304 


554793 


1 948 .A 1 7 .gz43_249946 


F 


M00054568B:A07 


UC2-ColonMetLiver 


3305 


466020 


1 948.0 1 7.gz43_249960 


F 


M00054584D:C01 


UC2-ColonMetLiver 


3306 


509202 


1948.119.gz43_249986 


F 


M00054577B:F01 


UC2-ColonMetLiver 


3307 


450963 


1948.A21.gz43_250010 


F 


M00054568C:D06 


UC2-ColonMetLiver 


3308 


451049 


1948.C2Lgz43_250012 


F 


M00054571A:B10 


UC2-ColonMetLiver 


3309 


528404 


1 948 .K2 1 .gz43_25 0020 


F 


M00054580A:B11 


UC2-ColonMetLiver 


3310 


160320 


1 948 .02 1 .gz43_25 0024 


F 


M00054585A:E07 


UC2-ColonMetLiver 


3311 


492139 


1948.K22.gz43_250036 


F 


M00054580A:C10 


UC2-ColonMetLiver 


3312 


557656 


1 948 . A23 .gz43_25 0042 


F 


M00054568C:G12 


UC2-ColonMetLiver 


i5YS 


570248 


1 948. K24.gz43_25 0068 


F 


M00054580A:D04 


UC2-ColonMetLiver 


3314 


561836 


1957.G01.gz43_250080 


F 


M00054594A:D04 


UC2-CoIonMetLiver 


3315 


558332 


1957J01.gz43_250083 


F 


M00054597A:C11 


UC2-ColonMetLiver 


3316 


558981 


1957.J02.gz43_250099 


F 


M00054597A:D08 


UC2-ColonMetLiver 


3317 


535129 


1957.B03.gz43_250107 


F 


M00054587C:G02 


UC2-ColonMetLiver 


3318 


549781 


1957.J03.gz43_250115 


F 


M00054597A:G08 


UC2-ColonMetLiver 
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LxJ 

NO 


r~*j T TCTC1 


SEQ NAME 


OR1I 
NT 


CLONE ID 


LIBRARY 


3319 


555920 


1957.C04.gz43 250124 


F 


M00054589A:D09 


UC2-ColonMetLiver 


3320 


550044 


1957.E04.gz43 250126 


F 


M00054591C:A01 


UC2-ColonMetLiver 


3321 


447356 


1957.I04.gz43 250130 


F 


M00054596B:C01 


UC2-ColonMetLiver 


3322 


560751 


1957.A05.gz43 250138 


F 


M00054586C.H02 


UC2-ColonMetLiver 


3323 


550322 


1957.B05.gz43 250139 


F 


M00054587D:D04 


UC2-ColonMetLiver 


3324 


416884 


1957.N05.gz43 250151 


F 


M00054602A:E06 


UC2-ColonMetLiver 


3325 


559857 


1957.A06.gz43 250154 


F 


M00054586D:A03 


UC2-ColonMetLiver 


3326 


446164 


1957.C06.gz43 250156 


F 


M00054589B:A07 


UC2-ColonMetLiver 


3327 


550088 


1957.D06.gz43 250157 


F 


M00054590B:B11 


UC2-ColonMetLiver 


3328 


554764 


1957.E06.gz43 250158 


F 


M00054591C:H09 


UC2-ColonMetLiver 


3329 


552907 


1957.O07.gz43 250184 


F 


M00054603A.G06 


UC2-ColonMetLiver 


3330 


498827 


1957.D08.gz43 250189 


F 


M00054590C:A03 


UC2-ColonMetLiver 


3331 


498777 


1957.G08.gz43 250192 


F 


M00054594C:E06 


UC2-ColonMetLiver 


3332 


559883 


1957.L08.gz43 250197 


F 


M00054599D:B07 


UC2-ColonMetLiver 


3333 


554869 


1957.M08.gz43 250198 


F 


M00054600D:H07 


UC2-ColonMetLiver 


3334 


559419 


1957.A09.gz43 250202 


F 


M00054586D:G01 


UC2-ColonMetLiver 


3335 


473859 


1957.F09.gz43 250207 


F 


M00054593B:B03 


UC2-ColonMetLiver 


3336 


555949 


1957.M09.gz43 250214 


F 


M00054601A:E08 


UC2-ColonMetLiver 


3337 


552913 


1957.O09.gz43 250216 


F 


M00054603A:G12 


UC2-ColonMetLiver 


3338 


551463 


1957.C10.gz43 250220 


F 


M00054589B.F05 


UC2-ColonMetLiver 


3339 


550855 


1957.Bll.gz43 250235 


F 


M00054588A:G08 


UC2-ColonMetLiver 


3340 


526733 


1957.OH.gz43 250248 


F 


M00054603B:B04 


UC2-ColonMetLiver 


3341 


557970 


1957.F12.gz43 250255 


F 


M00054593B:E03 


UC2-ColonMetLiver 


3342 


549665 


1957.H12.gz43 250257 


F 


M00054595C:G06 


UC2-ColonMetLiver 


3343 


446503 


1957.M12.gz43 250262 


F 


M00054601B:D08 


UC2-ColonMetLiver 


3344 


549599 


1957.P12.gz43 250265 


F 


M00054604C:F03 


UC2-ColonMetLiver 


3345 


558413 


1957.A13.gz43 250266 


F 


M00054587A:A08 


UC2-ColonMetLiver 


3346 


553331 


1957.B13.gz43 250267 


F 


M00054588B:B12 


UC2-ColonMetLiver 


3347 


549609 


1957.I14.gz43 250290 


F 


M00054596C.F09 


UC2-ColonMetLiver 


3348 


455298 


1957.I15.gz43 250306 


F 


M00054596C:F10 


UC2-ColonMetLiver 


3349 


563514 


1957.P15.gz43 250313 


F 


M00054604D:A04 


UC2-ColonMetLiver 


3350 


471181 


1957.A16.gz43 250314 


F 


M00054587A:F06 


UC2-ColonMetLiver 


3351 


487176 


1957.N16.gz43 250327 


F 


M00054602C:C12 


UC2-ColonMetLiver 


3352 


561130 


1957.117.gz43_250338 


F 


M00054596C:G09 


UC2-ColonMetLiver 


3353 


461718 


1957.J17.gz43_250339 


F 


M00054597D:E12 


UC2-ColonMetLiver 


3354 


559554 


1957.L17.gz43 250341 


F 


M00054600A:G12 


UC2-ColonMetLiver 


JJJJ 




IVD /.Alo.gz43_230346 


F 


M00054587A.F09 


UC2-ColonMetLiver 


3356 


551975 


1957.D18.gz43_250349 


F 


M00054590D:C12 


UC2-ColonMetLiver 


3357 


485029 


1957.H18.gz43 250353 


F 


M00054595D:A10 


UC2-ColonMetLiver 


3358. 


549816 


1957.L18.gz43 250357 


F 


M00054600A:H03 


UC2-ColonMetLiver 


3359 


408130 


1957.018.gz43 250360 


F 


M00054603C:G09 


UC2-CoIonMetLiver 


3360 


558927 


1957.P18.gz43 250361 


F 


M00054605A:A03 


UC2-ColonMetLiver 
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Table 2 



ID 
NO 


PI TTQTTM 
ULUo 1 HJ 


< bbQ NAME 


URIE 
IN 1 


CLONE ID 


LIBRARY 


3361 


550217 


1957.B19.gz43 250363 


F 


M00054588C:F04 


UC2-ColonMetLiver 


3362 


556376 


1957.C20.gz43 250380 


F 


M00054589D:C06 


UC2-ColonMetLiver 


3363 


549964 


1957.E20.gz43 250382 


F 


M00054592C:A05 


UC2-ColonMetLiver 


3364 


549228 


1957.K20.gz43 250388 


F 


M00054599B:C06 


UC2-ColonMetLiver 


3365 


459581 


1957.L20.gz43 250389 


F 


M00054600C:B10 


UC2-ColonMetLiver 


3366 


562722 


1957.N20.gz43 250391 


F 


M00054602D.A06 


UC2-ColonMetLiver 


3367 


552121 


1957.O20.gz43 250392 


F 


M00054603D:B02 


UC2-ColonMetLiver 


33611 


447311 


1957.C23.gz43 250428 


F 


M00054590A:A01 


UC2-ColonMetLiver 


3369 


467035 


1957.I23.gz43 250434 


F 


M00054597A:A07 


UC2-ColonMetLiver 


3370 


549611 


1957.K23.gz43 250436 


F 


M00054599B:F09 


UC2-ColonMetLiver 


337j 


549731 


1957.H24.gz43 250449 


F 


M00054596A:G1 1 


UC2-ColonMetLiver 


3372 


380127 


1957.P24.gz43 250457 


F 


M00054605B:F10 


UC2-ColonMetLiver 


3373 


405042 


1958.H01.gz43 250465 


F 


M00054614B:E01 


UC2-ColonMetLiver 


337^ 


552418 


1958J01.gz43 250467 


F 


M00054616D:C10 


UC2-ColonMetLiver 


3375 


558642 


1958.N01.gz43 250471 


F 


M00054621D:C06 


UC2-ColonMetLiver 


3376 


549649 


1958.A02.gz43 250474 


F 


M00054605B:G01 


UC2-ColonMetLiver 


3377 


557190 


1958.B02.gz43 250475 


F 


M00054606D:B05 


UC2-ColonMetLiver 


3378 


549320 


1958.N02.gz43 250487 


F 


M00054621D:D11 


UC2-ColonMetLiver 


3379 


549739 


1958J03.gz43 250499 


F 


M00054616D:G09 


UC2-ColonMetLiver 


3380 


561116 


1958.M03.gz43 250502 


F 


M00054620D:D06 


UC2-ColonMetLiver 


3381 


473111 


1958.M04.gz43__250518 


F 


M00054620D:D11 


UC2-ColonMetLiver 


3382 


558103 


1958.P04.gz43 250521 


F 


M00054625A:D07 


UC2-ColonMetLiver 


3383 


548864 


1958.A05.gz43 250522 


F 


M00054605C:A04 


UC2-CoIonMetLiver 


3384 


560862 


1958.B05.gz43 250523 


F 


M00054607A:B06 


UC2-ColonMetLiver 


3385 


557199 


1958.P05.gz43 250537 


F 


M00054625A:E05 


UC2-ColonMetLiver 


3386 


188753 


1958.A06.gz43 250538 


F 


M00054605C:D03 


UC2-ColonMetLiver 


3387 


550177 


1958.G06.gz43 250544 


F 


M00054613A:D07 


UC2-ColonMetLiver 


3388 


556896 


1 95 8 .M06.gz43_25 05 5 0 


F 


M00054620D:F11 


UC2-ColonMetLiver 


3389 


552823 


1958.B07.gz43 250555 


F 


M00054607A:G02 


UC2-ColonMetLiver 


3390 


395411 


1958.M07.gz43 250566 


F 


M00054620D:G11 


UC2-ColonMetLiver 


3391 


10397 


1958.N07.gz43 250567 


F 


M00054622A:D06 


UC2-ColonMetLiver 


3392 


551250 


1958.P07.gz43_250569 


F 


M00054625B:B02 


UC2-ColonMetLiver 


3393 


561892 


1958.C08.gz43 250572 


F 


M00054609A.F01 


UC2-ColonMetLiver 


3394 


549347 


1958.F08.gz43 250575 


F 


M00054612A:D12 


UC2-ColonMetLiver 


3395 


561068 


1 95 8 N08 .gz43_25 05 83 


F 


M00054622A:H01 


UC2-ColonMetLiver 


3396 


549912 


1958.C09.gz43_250588 


F 


M00054609A:H04 


UC2-ColonMetLiver 


3397 


5491 14 


loss Pin q'tA'x *)^c\£s\i 


r 


MUU054612B:B1 1 


UC2-CoIonMetLiver 


3398 


548965 


1958.J10.gz43_250611 


F 


M00054617B:A09 


UC2-ColonMetLiver 


3399 


491644 


1958.N10.gz43_250615 


F 


M00054622B:F05 


UC2-ColonMetLiver 


3400 


550018 


1958.Gll.gz43_250624 


F 


M00054613B;H04 


UC2-ColonMetLiver 


3401 


556530 


1958.C12.gz43 250636 


F 


M00054609B:E01 


UC2-ColonMetLiver 


3402 


562749 


1958.L12.gz43 250645 


F 


M00054619D:C10 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTEF 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3403 


556308 


1958.N12.gz43_250647 


F 


M00054622B:H09 


UC2-ColonMetLiver 


3404 


551305 


1958.O12.gz43_250648 


F 


M00054623D:C12 


UC2-ColonMetLiver 


3405 


507349 


1958.C13.gz43_250652 


F 


M00054609B:H1 1 


UC2-ColonMetLiver 


3406 


549626 


1958.E13.gz43_250654 


F 


M00054611B:F12 


UC2-ColonMetLiver 


3407 


549388 


1958.J13.gz43_250659 


F 


M00054617B:D06 


UC2-ColonMetLiver 


3408 


478511 


1958.E14.gz43_250670 


F 


M00054611B:G09 


UC2-ColonMetLiver 


3409 


555371 


1958.K14.gz43_250676 


F 


M00054618C:H02 


UC2-ColonMetLiver 


3410 


553758 


1958I15.gz43_250690 


F 


M00054616A:H01 


UC2-ColonMetLiver 


3411 


553204 


1958.L15.gz43_250693 


F 


M00054620A:C09 


UC2-ColonMetLiver 


3412 


497477 


1958.H16.gz43_250705 


F 


M00054615B:E03 


UC2-ColonMetLiver 


3413 


481360 


1958.116.gz43_250706 


F 


M00054616A:H03 


UC2-ColonMetLiver 


3414 


552437 


1958.K16.gz43_250708 


F 


M00054618D:D04 


UC2-ColonMetLiver 


3415 


550164 


1958.0 16.gz43_2507 12 


F 


M00054624A:B11 


UC2-ColonMetLiver 


3416 


555660 


1958.E18.gz43_250734 


F 


M00054611C:E01 


UC2-ColonMetLiver 


3417 


562932 


1958.N18.gz43_250743 


F 


M00054622D:C02 


UC2-ColonMetLiver 


3418 


490393 


1958.B19.gz43_250747 


F 


M00054608B:D08 


UC2-ColonMetLiver 


3419 


490890 


1958K19.gz43_250756 


F 


M00054618D:E06 


UC2-ColonMetLiver 


3420 


451009 


1958.K20.gz43_250772 


F 


M00054618D:E11 


UC2-ColonMetLiver 


3421 


552194 


1958.L20.gz43_250773 


F 


M00054620B:B10 


UC2-ColonMetLiver 


3422 


451025 


1958.M20.gz43_250774 


F 


M00054621C:G03 


UC2-ColonMetLiver 


3423 


551463 


1958.N20.gz43_250775 


F 


M00054622D:D10 


UC2-ColonMetLiver 


3424 


553028 


1958.G21.gz43_250784 


F 


M00054613D:H09 


UC2-ColonMetLiver 


3425 


485504 


1958.N21.gz43_250791 


F 


M00054622D:F05 


UC2-ColonMetLiver 


3426 


562712 


1958.P21.gz43_250793 


F 


M00054625D:F06 


UC2-ColonMetLiver 


3427 


490152 


1958.C22.gz43_250796 


F 


M00054609D:E12 


UC2-ColonMetLiver 


3428 


552364 


1958.F22.gz43_250799 


F 


M00054612D:B12 


UC2-ColonMetLiver 


3429 


448276 


1958.L22.gz43_250805 


F 


M00054620C:C08 


UC2-ColonMetLiver 


3430 


556835 


1958.N22.gz43_250807 


F 


M00054622D:G11 


UC2-ColonMetLiver 


3431 


461734 


1958.P22.gz43_250809 


F 


M00054625D:H07 


UC2-ColonMetLiver 


3432 


557947 


1958.H23.gz43_250817 


F 


M00054615C:D09 


UC2-ColonMetLiver 


3433 


560491 


1958.L23.gz43_250821 


F 


M00054620C:C12 


UC2-ColonMetLiver 


3434 


554810 


1958.O23.gz43_250824 


F 


M00054624D:B06 


UC2-ColonMetLiver 


3435 


549829 


1958.C24.gz43_250828 


F 


M00054609D:H06 


UC2-ColonMetLiver 


3436 


518049 


1958.M24.gz43_250838 


F 


M00054621D:A10 


UC2-ColonMetLiver 


3437 


484669 


1 777.00 l.gz43_25 1861 


F 


M00042831D:G06 


UC2-PrimColon 


3438 


537506 


1777.C03.gz43_251881 


F 


M00042811C:B06 


UC2-PrimColon 


3439 


545162 


1777.F03.gz43_251884 


F 


M00042816B:C08 


UC2-PrimColon 


3440 


456793 


1777.Cll.gz43_252009 


F 


M00042812B:C01 


UC2-PrimColon 


3441 


448340 


1777.Ell.gz43_252011 


F 


M00042815C:C02 


UC2-PrimColon 


3442 


448793 


1777.E13.gz43_252043 


F 


M00042815C:F10 


UC2-PrimColon 


3443 


449718 


1777.H14.gz43_252062 


F 


M00042819C:H02 


UC2-PrimColon 


3444 


735534 


1777.E17.gz43_252107 


F 


M00042816A:C09 


UC2-PrimColon 
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Table 2 



ID 


LLUSTEJ 


< SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3445 


449258 


1777.K17.gz43 252113 


F 


M00042825A:B05 


UC2-PrimColon 


3446 


485020 


1777.018.gz43 252133 


F 


M00042833C:G05 


UC2-PrimColon 


3447 


649735 


1777.A23.gz43 252199 


F 


M00042809D:C12 


UC2-PrimColon 


3448 


498504 


1923.B01.gz43 252616 


F 


M00042352D:F11 


UC2-ColonMetLiver 


3449 


424723 


1923.C01.gz43 252617 


F 


M00042355A:H09 


UC2-ColonMetLiver 


3450 


451124 


1923.E01.gz43 252619 


F 


M00042444C:E02 


UC2-ColonMetLiver 


3451 


553312 


1923.O01.gz43 252629 


F 


M00042516D:H08 


UC2-ColonMetLiver 


3452 


257547 


1923.D02.gz43 252634 


F 


M00042442B:A07 


UC2-ColonMetLiver 


3453 


492292 


1923.J02.gz43 252640 


F 


M00042455B.G09 


UC2-ColonMetLiver 


3454 


638444 


1923.O02.gz43 252645 


F 


M00042516D:H09 


UC2-ColonMetLiver 


3455 


451009 


1923.C03.gz43 252649 


F 


M00042355B:B07 


UC2-ColonMetLiver 


3456 


451518 


1923. 103 .gz43 252655 


F 


M00042453C.D12 


UC2-ColonMetLiver 


3457 


450255 


1923.J03.gz43 252656 


F 


M00042455C:D11 


UC2-ColonMetLiver 


3458 


556804 


1923.M03.gz43 252659 


F 


M00042460B:D10 


UC2-ColonMetLiver 


3459 


497400 


1923 C04.gz43_252665 


F 


M00042355B:E10 


UC2-ColonMetLiver 


3460 


451618 


1923.H04.gz43 252670 


F 


M00042451D:H10 


UC2-ColonMetLiver 


3461 


490890 


1923.J04.gz43_252672 


F 


M00042455C:E04 


UC2-ColonMetLiver 


3462 


492982 


1923.P04.gz43 252678 


F 


M00042519D:H07 


UC2-ColonMetLiver 


3463 


498777 


1923.A05.gz43 252679 


F 


M00042351C:G01 


UC2-ColonMetLiver 


3464 


493575 


1923.B05.gz43 252680 


F 


M00042353B:A11 


UC2-ColonMetLiver 


3465 


556916 


1923.L05.gz43 252690 


F 


M00042459B.F03 


UC2-ColonMetLiver 


3466 


551755 


1923.N05.gz43_252692 


F 


M00042460D:H06 


UC2-ColonMetLiver 


3467 


495074 


1923.O05.gz43_252693 


F 


M00042517C:B04 


UC2-ColonMetLiver 


3468 


487745 


1923.E06.gz43 252699 


F 


M00042445A:B04 


UC2-ColonMetLiver 


3469 


448663 


1923.F06.gz43 252700 


F 


M00042447C.H10 


UC2-ColonMetLiver 


3470 


468729 


1923.L06.gz43 252706 


F 


M00042459B:F12 


UC2-ColonMetLiver 


3471 


451470 


1923.C07.gz43 252713 


F 


M00042355C:G09 


UC2-ColonMetLiver 


3472 


489426 


1923 K07.gz43_252721 


F 


M00042457D:D02 


UC2-ColonMetLiver 


3473 


450302 


1923F08.gz43_252732 


F 


M00042447D:E04 


UC2-ColonMetLiver 


3474 


559575 


1923.M08.gz43 252739 


F 


M00042460B:G12 


UC2-ColonMetLiver 


3475 


496446 


1923.A09.gz43 252743 


F 


M00042351D:D02 


UC2-ColonMetLiver 


3476 


492110 


1923.F09.gz43 252748 


F 


M00042447D:G10 


UC2-ColonMetLiver 


3477 


495941 


1923.G09.gz43 252749 


F 


M00042450A:D02 


UC2-ColonMetLiver 


3478 


446621 


1923.J09.gz43_252752 


F 


M00042455D.H08 


UC2-ColonMetLiver 


3479 


562115 


1923.N09.gz43 252756 


F 


M00042516A:A10 


UC2-ColonMetLiver 


3480 


494300 


1923.O09.gz43 252757 


F 


M00042517D:A12 


UC2-ColonMetLiver 






1923.A10.gz43_252759 


F 


M00042351D.F08 


UC2-ColonMetLiver 


3482 


486912 


1923.H10.gz43 252766 


F 


M00042452C:A09 


UC2-ColonMetLiver 


3483 


456492 


1923.J10.gz43_252768 


F 


M00042456A:C08 


UC2-ColonMetLiver 


3484 


562060 


1923.N10.gz43_252772 


F 


M00042516A:C08 


UC2-ColonMetLiver 


3485 


4244 


1923.P10.gz43 252774 


F 


M00042520A.F09 


UC2-ColonMetLiver 


3486 


451429 


1923.All.gz43_252775 


F 


M00042351D:H05 | UC2-ColonMetLiver | 
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Table 2 



SEQ 
ID 
NO 


CLUSTEI 


H SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3487 


449974 


1923.Hll.gz43 252782 


F 


M00042452C:C10 


UC2-ColonMetLiver 


3488 


59512 


1923.Kll.gz43 252785 


F 


M00042458B:B08 


UC2-ColonMetLiver 


3489 


550362 


1923.Mll.gz43 252787 


F 


M00042460C:A02 


UC2-ColonMetLiver 


3490 


211273 


1923.OH.gz43 252789 


F 


M00042517D:G04 


UC2-ColonMetLiver 


3491 


450400 


1923.I12.gz43 252799 


F 


M00042454A:F02 


UC2-ColonMetLiver 


3492 


562627 


1923.L12.gz43 252802 


F 


M00042459C.E06 


UC2-ColonMetLiver 


3493 


494393 


1923.B13.gz43 252808 


F 


M00042353D:B08 


UC2-ColonMetLiver 


3494 


451049 


1923.F13.gz43 252812 


F 


M00042448B:C04 


UC2-ColonMetLiver 


3495 


558729 


1923.K13.gz43 252817 


F 


M00042458B:G05 


UC2-ColonMetLiver 


3496 


526733 


1923.L13.gz43 252818 


F 


M00042459C:G02 


UC2-ColonMetLiver 


3497 


558559 


1923.N13.gz43 252820 


F 


M00042516B:E03 


UC2-ColonMetLiver 


3498 


451518 


1923.A14.gz43 252823 


F 


M00042352A:G05 


UC2-ColonMetLiver 


3499 


349 


1923.M14.gz43 252835 


F 


M00042460C:B02 


UC2-ColonMetLiver 


3500 


456672 


1923.A15.gz43_252839 


F 


M00042352A:G09 


UC2-ColonMetLiver 


3501 


497101 


1923.D15.gz43 252842 


F 


M00042443D:E01 


UC2-ColonMetLiver 


3502 


488680 


1923.H15.gz43_252846 


F 


M00042452D:C04 


UC2-ColonMetLiver 


3503 


552236 


1923L15.gz43_252850 


F 


M00042459D:B07 


UC2-ColonMetLiver 


3504 


450823 


1923.B17.gz43_252872 


F 


M00042354B:A07 


UC2-ColonMetLiver 


3505 


446263 


1923.D17.gz43_252874 


F 


M00042444A:C10 


UC2-ColonMetLiver 


3506 


492094 


1923.H17.gz43 252878 


F 


M00042452D:G06 


UC2-ColonMetLiver 


3507 


552581 


1923.K17.gz43_252881 


F 


M00042458C:E06 


UC2-ColonMetLiver 


3508 


494362 


1923.G18.gz43_252893 


F 


M00042450C:B09 


UC2-ColonMetLiver 


3509 


560420 


1923.L18.gz43_252898 


F 


M00042459D:E03 


UC2-ColonMetLiver 


3510 


562399 


1923.N18.gz43_252900 


F 


M00042516C:C04 


UC2-ColonMetLiver 


3511 


487522 


1923.O18.gz43_252901 


F 


M00042518D:A08 


UC2-ColonMetLiver 


3512 


498194 


1923.A19.gz43 252903 


F 


M00042352B:F03 


UC2-ColonMetLiver 


3513 


451126 


1 923 .B 1 9gz43^252904 


F 


M00042354C:F04 


UC2-ColonMetLiver 


3514 


450507 


1923.E19.gz43_252907 


F 


M00042446B:G02 


UC2-ColonMetLiver 


3515 


450723 


1923.I19.gz43 252911 


F 


M00042454D:H10 


UC2-ColonMetLiver 


3516 


492981 


1923.J19.gz43_252912 


F 


M00042456C:H07 


UC2-ColonMetLiver 


3517 


451054 


1923.F20.gz43_252924 


F 


M00042448C:C09 


UC2-ColonMetLiver 


3518 


453667 


1923.J20.gz43_252928 


F 


M00042456D:B06 


UC2-ColonMetLiver 


3519 


562588 


1923.K20.gz43_252929 


F 


M00042458D:E06 


UC2-ColonMetLiver 


3520 


527679 


1923.L20.gz43_252930 


F 


M00042459D:G04 


UC2-ColonMetLiver 


3521 


494133 


1923.B22.gz43_252952 


F 


M00042355A:A12 


UC2-ColonMetLiver 


3522 


492304 


1923.122.gz43_252959 


F 


M00042455A:G12 


UC2-ColonMetLiver 


3523 


100821 


1923J22.gz43_252960 


F 


M00042456D.D07 


UC2-ColonMetLiver 


3524 


562603 


1923.M22.gz43_252963 


F 


M00042460C:G08 


UC2-ColonMetLiver 


3525 


562275 


1923.N22.gz43_252964 


F 


M00042516D:B11 


UC2-ColonMetLiver 


3526 


451302 


1923.A23.gz43 252967 


F 


M00042352C:G01 


UC2-ColonMetLiver 


3527 


451126 


1923.B23.gz43_252968 


F 


M00042355A:C03 


UC2-ColonMetLiver 


3528 


494130 


1923.C23.gz43_252969 


F 


M00042442A:A12 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTEF 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3529 


562587 


1923.K23.gz43_252977 


F 


M00042459A:E04 


UC2-ColonMetLiver 


3530 


562543 


1923.M23.gz43_252979 


F 


M00042460C:H10 


UC2-ColonMetLiver 


3531 


562398 


1923.N23.gz43_252980 


F 


M00042516D:C01 


UC2-ColonMetLiver 


3532 


497517 


1923P23.gz43_252982 


F 


M00043296A.E08 


UC2-ColonMetLiver 


3533 


450902 


1923.A24.gz43_252983 


F 


M00042352D:A11 


UC2-ColonMetLiver 


3534 


499178 


1923.C24.gz43_252985 


F 


M00042442A:G04 


UC2-ColonMetLiver 


3535 


450507 


1923.E24.gz43_252987 


F 


M00042446D:F04 


UC2-ColonMetLiver 


3536 


448556 


1923.F24.gz43_252988 


F 


M00042449A:H10 


UC2-ColonMetLiver 


3537 


454575 


1923.124.gz43_252991 


F 


M00042455B:D05 


UC2-ColonMetLiver 


3538 


275 


1923.L24.gz43_252994 


F 


M00042460B:C06 


UC2-ColonMetLiver 


3539 


452719 


1934.C01.gz43_253001 


F 


M00043360C:E07 


UC2-ColonMetLiver 


3540 


449842 


1934.G02.gz43_253021 


F 


M00043369D:B01 


UC2-ColonMetLiver 


3541 


489506 


1934.N02.gz43_253028 


F 


M00043384D:D05 


UC2-ColonMetLiver 


3542 


453893 


1934.P02.gz43_253030 


F 


M00043388A:D05 


UC2-ColonMetLiver 


3543 


452586 


1934A03.gz43_25303 1 


F 


M00043356B:E12 


UC2-ColonMetLiver 


3544 


491177 


1934C03.gz43_253033 


F 


M00043360C:F11 


UC2-ColonMetLiver 


3545 


496586 


1934.M03.gz43_253043 


F 


M00043382C:D07 


UC2-ColonMetLiver 


3546 


490393 


1934.O03.gz43_253045 


F 


M00043387A:E02 


UC2-ColonMetLiver 


3547 


450756 


1934.D04.gz43_253050 


F 


M00043363B:A11 


UC2-ColonMetLiver 


3548 


456492 


1934.104.gz43_253055 


F 


M00043373D:G04 


UC2-ColonMetLiver 


3549 


490395 


1934.O04.gz43_253061 


F 


M00043387A:E03 


UC2-ColonMetLiver 


3550 


489275 


1934.P04.gz43_253062 


F 


M00043388B:C02 


UC2-ColonMetLiver 


3551 


489368 


1934.C05.gz43_253065 


F 


M00043360D:D01 


UC2-ColonMetLiver 


3552 


450756 


1934E05.gz43_253067 


F 


M00043364D:H11 


UC2-ColonMetLiver 


3553 


496084 


1934.K05.gz43_253073 


F 


M00043378D:D12 


UC2-ColonMetLiver 


3554 


450193 


1934.L05.gz43_253074 


F 


M00043380D:E10 


UC2-ColonMetLiver 


3555 


499240 


1934.M05.gz43_253075 


F 


M00043382C:G09 


UC2-ColonMetLiver 


3556 


448230 


1934.A06.gz43_253079 


F 


M00043356C:F03 


UC2-ColonMetLiver 


3557 


455707 


1934.H06.gz43_253086 


F 


M00043372B:B01 


UC2-ColonMetLiver 


3558 


496760 


1934.N06.gz43_253092 


F 


M00043385A:E01 


UC2-ColonMetLiver 


3559 


447429 


1934.A07.gz43_253095 


F 


M00043356C:H07 


UC2-ColonMetLiver 


3560 


450551 


1934.G07.gz43_253101 


F 


M00043369D:G10 


UC2-ColonMetLiver 


3561 


451144 


1934.J07.gz43_253104 


F 


M00043376B:C03 


UC2-ColonMetLiver 


3562 


491402 


1934.C08.gz43_253113 


F 


M00043361B:F02 


UC2-ColonMetLiver 


3563 


449956 


1934E08.gz43_253115 


F 


M00043365A:C06 


UC2-ColonMetLiver 


3564 


449959 


1934.G08.gz43_253117 


F 


M00043370A:C08 


UC2-ColonMetLiver 


3565 


560213 


1934.108.gz43_253119 


F 


M00043374A:E05 


UG2-ColonMetLiver 


3566 


450482 


1934.J08.gz43_253120 


F 


M00043376B.F06 


UC2-ColonMetLiver 


3567 


449795 


1934.B09.gz43_253128 


F 


M00043359B:A04 


UC2-ColonMetLiver 


3568 


486856 


1934.O09.gz43_253141 


F 


M00043387B:A03 


UC2-ColonMetLiver 


3569 


450080 


1934.C10.gz43_253145 


F 


M00043361D:A02 


UC2-ColonMetLiver 


3570 


510272 


1934.110.gz43_253151 


F 


M00043374A:G04 


UC2-ColonMetLiver 



157 



# • 



Table 2 



SEQ 
ID 
NO 


CLUSTEF 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3571 


450583 


1934.All.gz43_253159 


F 


M00043357A:G04 


UC2-ColonMetLiver 


3572 


450193 


1934.Bll.gz43_253160 


F 


M00043359B:D06 


UC2-ColonMetLiver 


3573 


134392 


1934.Mll.gz43_253171 


F 


M00043383B:F12 


UC2-ColonMetLiver 


3574 


496234 


1934.Nll.gz43_253172 


F 


M00043385C:D06 


UC2-ColonMetLiver 


3575 


487893 


1934.A12.gz43_253175 


F 


M00043357B:B02 


UC2-ColonMetLiver 


3576 


491507 


1934.D12.gz43_253178 


F 


M00043363D:F07 


UC2-ColonMetLiver 


3577 


233814 


1934J12.gz43_253184 


F 


M00043376D:A12 


UC2-ColonMetLiver 


3578 


450189 


1934.A13.gz43_253191 


F 


M00043357B:D01 


UC2-ColonMetLiver 


3579 


488613 


1934.E13.gz43_253195 


F 


M00043365B:C08 


UC2-ColonMetLiver 


3580 


451011 


1934.K13.gz43_253201 


F 


M00043379C:B09 


UC2-ColonMetLiver 


3581. 


450207 


1934.G14.gz43_253213 


F 


M00043370B:D08 


UC2-ColonMetLiver 


3582 


451185 


1934.J14.gz43_253216 


F 


M00043376D:D12 


UC2-ColonMetLiver 


3583 


551380 


1934.L14.gz43_253218 


F 


M00043381B:E10 


UC2-ColonMetLiver 


3584 


495143 


1934.M14.gz43_253219 


F 


M00043383D:C07 


UC2-ColonMetLiver 


3585 


487183 


1934.014.gz43_253221 


F 


M00043387C:A11 


UC2-ColonMetLiver 


3586 


631526 


1934.115.gz43_253231 


F 


M00043374D:D07 


UC2-ColonMetLiver 


3587 


449780 


1934.E16.gz43_253243 


F 


M00043365C:A09 


UC2-ColonMetLiver 


3588 


491492 


1934.F16.gz43_253244 


F 


M00043368A:F03 


UC2-ColonMetLiver 


3589 


494099 


1934.K16.gz43_253249 


F 


M00043379D:A05 


UC2-ColonMetLiver 


3590 


450819 


1934.N16.gz43_253252 


F 


M00043386A:A11 


UC2-ColonMetLiver 


3591 


488349 


1934.P16.gz43_253254 


F 


M00043389D:B10 


UC2-CoIonMetLiver 


3592 


515423 


1934.1 17gz43_253263 


F 


M00043374D:H09 


UC2-ColonMetLiver 


3593 


490846 


1934.C18.gz43_253273 


F 


M00043362C:E02 


UC2-ColonMetLiver 


3594 


556511 


1934.L18.gz43_253282 


F 


M00043381C:E10 


UC2-ColonMetLiver 


3595 


487437 


1934.M18.gz43_253283 


F 


M00043384B:A04 


UC2-ColonMetLiver 


3596 


491240 


1934.E19.gz43_253291 


F 


M00043365C:F06 


UC2-ColonMetLiver 


3597 


452710 


1934.H19.gz43_253294 


F 


M00043373A:G11 


UC2-ColonMetLiver 


3598 


449959 


1934J20.gz43_253312 


F 


M00043377C:A11 


UC2-ColonMetLiver 


3599 


449978 


1934.P20.gz43_253318 


F 


M00043390B:C04 


UC2-ColonMetLiver 


3600 


450242 


1934.A21.gz43_253319 


F 


M00043358A:D02 


UC2-ColonMetLiver 


3601 


451294 


1934.121. gz43_253327 


F 


M00043375A:E02 


UC2-ColonMetLiver 


3602 


450080 


1934.G22.gz43_253341 


F 


M00043371B:C10 


UC2-ColonMetLiver 


3603 


451361 


1934.J22.gz43_253344 


F 


M00043377D:E01 


UC2-ColonMetLiver 


3604 


489207 


1934.M22.gz43_253347 


F 


M00043384C:C02 


UC2-ColonMetLiver 


3605 


488030 


1934.022.gz43_253349 


F 


M00043387D:B02 


UC2-ColonMetLiver 


3606 


449908 


1934.G23.gz43_253357 


F 


M00043371C:B02 


UC2-ColonMetLiver 


3607 


450211 


1934.023.gz43_253365 


F 


M00043387D:D06 


UC2-ColonMetLiver 


3608 


402916 


1934.124.gz43_253375 


F 


M00043375B:A04 


UC2-ColonMetLiver 


3609 


494450 


1934.J24.gz43_253376 


F 


M00043378B:B05 


UC2-ColonMetLiver 


3610 


492627 


1934.M24.gz43_253379 


F 


M00043384C:G01 


UC2-ColonMetLiver 


3611 


550246 


1947.J01.gz43_253404 


F 


M00054555D:C03 


UC2-ColonMetLiver 


3612 


550952 


1947.106.gz43_253483 


F 


M00054555A:H09 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3613 


555502 


1947.A10.gz43_253539 


F 


M00054545C:B09 


UC2-ColonMetLiver 


3614 


496586 


1947.P21.gz43_253730 


F 


M00054566D:G08 


UC2-ColonMetLiver 


3615 


515707 


1947.G22.gz43_253737 


F 


M00054553D:E09 


UC2-ColonMetLiver 


3616 


552092 


1959.F03.gz43_253816 


F 


M00054631D:C02 


UC2-ColonMetLiver 


3617 


549210 


1959.M04.gz43_253839 


F 


M00054640B:C05 


UC2-ColonMetLiver 


3618 


555923 


1959.D08.gz43_253894 


F 


M00054629C:G06 


UC2-ColonMetLiver 


3619 


461 


1959.P08,gz43_253906 


F 


M00054644B:F02 


UC2-ColonMetLiver 


3620 


551975 


1959.D19.gz43_254070 


F 


M00054630B:A06 


UC2-ColonMetLiver 


3621 


549516 


1959.N22.gz43_254128 


F 


M00054642D:F11 


UC2-ColonMetLiver 


3622 


555949 


1959.K24.gz43_254157 


F 


M00054638D:F02 


UC2-ColonMetLiver 


3623 


193991 


1960L03.gz43_254206 


F 


M00054661A:B03 


UC2-ColonMetLiver 


3624 


561590 


1960.107.gz43_254267 


F 


M00054656C:E10 


UC2-ColonMetLiver 


3625 


450867 


1960.C12.gz43_254341 


F 


M00054648C:H10 


UC2-ColonMetLiver 


3626 


264575 


1960.N14.gz43_254384 


F 


M00054664D:A01 


UC2-ColonMetLiver 


3627 


502683 


1960.B23.gz43_254516 


F 


M00054647D:H02 


UC2-ColonMetLiver 


3628 


452662 


1960.E24.gz43_254535 


F 


M00054651D:D02 


UC2-ColonMetLiver 


3629 


551269 


1969.F03.gz43_254584 


F 


M00054674B:B03 


UC2-ColonMetLiver 


3630 


579481 


1969C07.gz43_254645 


F 


M00054671A:H07 


UC2-ColonMetLiver 


3631 


450624 


1969A10.gz43_254691 


F 


M00054668A:D01 


UC2-ColonMetLiver 


3632 


415326 


1969.Nll.gz43_254720 


F 


M00054685D:D09 


UC2-ColonMetLiver 


3633 


573733 


1969.C24.gz43_254917 


F 


M00054671D:A12 


UC2-ColonMetLiver 


3634 


551659 


1983 D01.gz43_254934 


F 


M00054811D:F01 


UC2-ColonMetLiver 


3635 


549816 


1983.E06.gz43_255015 


F 


M00054812D:H05 


UC2-ColonMetLiver 


3636 


557525 


1983.F09.gz43_255064 


F 


M00054814A:F07 


UC2-ColonMetLiver 


3637 


447405 


1983.Kll.gz43_255101 


F 


M00054823B:E07 


UC2-ColonMetLiver 


3638 


494471 


1983.C19.gz43_255221 


F 


M00054811B:H12 


UC2-ColonMetLiver 


3639 


456520 


1983E19.gz43_255223 


F 


M00054813B:D11 


UC2-ColonMetLiver 


3640 


533520 


1984.N02.gz43_255344 


F 


M00054848B:E02 


UC2-ColonMetLiver 


3641 


560984 


1984P06.gz43_255410 


F 


M00054851B:E03 


UC2-ColonMetLiver 


3642 


562302 


1984.G07.gz43_255417 


F 


M00054839D:F08 


UC2-ColonMetLiver 


3643 


554196 


1984.A12.gz43_255491 


F 


M00054831A:E11 


UC2-ColonMetLiver 


3644 


557703 


1984.G21.gz43_255641 


F 


M00054840D:B03 


UC2-ColonMetLiver 


3645 


562243 


1994E15.gz43_255927 


F 


M00054877A:F01 


UC2-ColonMetLiver 


3646 


559531 


1994A19.gz43_255987 


F 


M00054872A:H10 


UC2-ColonMetLiver 


3647 


447483 


1994.F22.gz43_256040 


F 


M00054878B:G03 


UC2-ColonMetLiver 


3648 


549391 


1994.K24.gz43_256077 


F 


M00054885C:G06 


UC2-ColonMetLiver 


3649 


550204 


1995.C02.gz43_256101 


F 


M00054893D:F07 


UC2-ColonMetLiver 


3650 


554604 


1995.K02.gz43_256109 


F 


M00054904A:F07 


UC2-ColonMetLiver 


3651 


558326 


1995.N02.gz43_256112 


F 


M00054908D-.A07 


UC2-ColonMetLiver 


3652 


557218 


1995A03.gz43_256115 


F 


M00054891D:H05 


UC2-ColonMetLiver 


3653 


562152 


1995.C03.gz43_256117 


F 


M00054893D:G09 


UC2-ColonMetLiver 


3654 


562679 


1995.F03.gz43_256120 


F 


M00054897C:F03 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTEI 


? SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3655 


543855 


1995.G03.gz43 256121 


F 


M00054898D:H12 


UC2-ColonMetLiver 


3656 


555701 


1995.J03.gz43_256124 


F 


M00054903A:G07 


UC2-ColonMetLiver 


3657 


549376 


1995.K03.gz43_256125 


F 


M00054904A:H10 


UC2-ColonMetLiver 


3658 


522220 


1995.M03.gz43 256127 


F 


M00054908A:F10 


UC2-ColonMetLiver 


3659 


550710 


1995.O03.gz43 256129 


F 


M00054922A:C11 


UC2-ColonMetLiver 


3660 


562672 


1995.E04.gz43_256135 


F 


M00054896B:E09 


UC2-ColonMetLiver 


3661 


559088 


1995.104.gz43_256139 


F 


M00054901C:A04 


UC2-ColonMetLiver 


3662 


447858 


1995D05.gz43_256150 


F 


M00054895B:D09 


UC2-ColonMetLiver 


3663 


492256 


1995.L05.gz43 256158 


F 


M00054906D:D06 


UC2-ColonMetLiver 


3664 


550047 


1995.B06.gz43 256164 


F 


M00054893A:F08 


UC2-ColonMetLiver 


3665 


562579 


1995.P06.gz43_256178 


F 


M00054923D:A08 


UC2-ColonMetLiver 


3666 


556804 


1995.F07.gz43 256184 


F 


M00054897D:E08 


UC2-ColonMetLiver 


3667 


553102 


1995.107.gz43_256187 


F 


M00054901C:G01 


UC2-ColonMetLiver 


3668 


559118 


1995.P07.gz43_256194 


F 


M00054923D:D04 


UC2-ColonMetLiver 


3669 


559554 


1995.H08.gz43 256202 


F 


M00054900B:D12 


UC2-ColonMetLiver 


3670 


510309 


1995.K08.gz43_256205 


F 


M00054904C:B06 


UC2-ColonMetLiver 


3671 


562492 


1 995 .N08 .gz43_25 6208 


F 


M00054910A:H12 


UC2-ColonMetLiver 


3672 


553975 


1995.O08.gz43_256209 


F 


M00054922B:A08 


UC2-ColonMetLiver 


3673 


561525 


1995.N09.gz43_256224 


F 


M00054910B:F07 


UC2-ColonMetLiver 


3674 


150839 


1995.O09.gz43 256225 


F 


M00054922B:A10 


UC2-ColonMetLiver 


3675 


451049 


1995.E10.gz43_256231 


F 


M00054896C:F01 


UC2-ColonMetLiver 


3676 


560668 


1995.H10.gz43_256234 


F 


M00054900B:G10 


UC2-ColonMetLiver 


3677 


555260 


1995.110.gz43_256235 


F 


M00054901D:B01 


UC2-ColonMetLiver 


3678 


555785 


1995J10.gz43_256236 


F 


M00054903C:F03 


UC2-ColonMetLiver 


3679 


82864 


1995.Dll.gz43_256246 


F 


M00054895C:B07 


UC2-ColonMetLiver 


3680 


559554 


1995Hll.gz43_256250 


F 


M00054900C:B04 


UC2-ColonMetLiver 


3681 


556404 


1995.C12.gz43_256261 


F 


M00054894C:B11 


UC2-ColonMetLiver 


3682 


516691 


1995.H12.gz43_256266 


F 


M00054900C:C03 


UC2-ColonMetLiver 


3683 


556499 


1995.E13.gz43 256279 


F 


M00054896D:C01 


UC2-ColonMetLiver 


3684 


453715 


1995.F13.gz43_256280 


F 


M00054898A:G09 


UC2-ColonMetLiver 


3685 


556753 


1995H13.gz43_256282 


F 


M00054900C:D08 


UC2-ColonMetLiver 


3686 


549847 


1995J13.gz43_256284 


F 


M00054903C:G03 


UC2-CoIonMetLiver 


3687 


549822 


1995.K13.gz43_256285 


F 


M00054904C:G09 


UC2-ColonMetLiver 


3688 


489001 


1995.L13.gz43 256286 


F 


M00054907B:E07 


UC2-ColonMetLiver 


3689 


562989 


1995.P13.gz43_256290 


F 


M00054924B:E10 


UC2-ColonMetLiver 


3690 


555998 


1995.A14.gz43_256291 


F 


M00054892C:G05 


UC2-ColonMetLiver 


3691 


562106 


1995.B14.gz43_256292 


F 


M00054893C:A06 


UC2-ColonMetLiver 


3692 


557326 


1995.E14.gz43_256295 


F 


M00054896D:C11 


UC2-ColonMetLiver 


3693 


558935 


1995.J14.gz43_256300 


F 


M00054903C:G04 


UC2-ColonMetLiver 


3694 


399766 


1995.L14.gz43_256302 


F 


M00054907B.F01 


UC2-ColonMetLiver 


3695 


553976 


1995.0 14.gz43_256305 


F 


M00054922C:A1 1 


UC2-ColonMetLiver 


3696 


554002 


1995.B15.gz43 256308 


F 


M00054893C:A07 


UC2-ColonMetLiver 
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SEQ 
ID 
NO 


CLUSTE1 


SEQ NAME 


ORII 
NT 


CLONE ID 


LIBRARY 


3697 


553691 


1995.P15.gz43 256322 


F 


M00054924B:F07 


UC2-ColonMetLiver 


3698 


556365 


1995.B16.gz43 256324 


F 


M00054893C:B02 


UC2-ColonMetLiver 


3699 


549228 


1995.D16.gz43 256326 


F 


M00054895D:B02 


UC2-ColonMetLiver 


3700 


553534 


1995.G16.gz43 256329 


F 


M00054899C:D11 


UC2-ColonMetLiver 


3701 


557004 


1995.H16.gz43 256330 


F 


M00054900D:F11 


UC2-ColonMetLiver 


3702 


555082 


1995.M17.gz43 256351 


F 


M00054908C:A10 


UC2-ColonMetLiver 


3703 


562714 


1995.P17.gz43 256354 


F 


M00054924D:E01 


UC2-ColonMetLiver 


3704 


552122 


1995.D18.gz43 256358 


F 


M00054895D.C10 


UC2-ColonMetLiver 


3705 


411272 


1995.K18.gz43 256365 


F 


M00054904D.F07 


UC2-ColonMetLiver 


3706 


454812 


1995.A19.gz43 256371 


F 


M00054892D:D06 


UC2-ColonMetLiver 


3707 


558177 


1995.J19.gz43 256380 


F 


M00054903D:G01 


UC2-ColonMetLiver 


3708 


556852 


1995.P19.gz43 256386 


F 


M00054925A:B08 


UC2-ColonMetLiver 


3709 


551463 


1995.B20.gz43 256388 


F 


M00054893C:E04 


UC2-ColonMetLiver 


3710 


558011 


1995.G20.gz43 256393 


F 


M00054899D:C11 


UC2-ColonMetLiver 


3711 


559088 


1995.H20.gz43 256394 


F 


M00054901A:B04 


UC2-ColonMetLiver 


3712 


595066 


1995.I20.gz43 256395 


F 


M00054902B:G02 


UC2-ColonMetLiver 


3713 


558868 


1995.K20.gz43 256397 


F 


M00054904D:G04 


UC2-ColonMetLiver 


3714 


458035 


1995.F21.gz43 256408 


F 


M00054898B:D04 


UC2 -ColonMetLi ver 


3715 


555837 


1995H21.gz43_256410 


F 


M00054901A:F01 


UC2-ColonMetLiver 


3716 


452182 


1995.121.gz43_256411 


F 


M00054902C:A12 


UC2-ColonMetLiver 


3717 


555967 


1995.J21.gz43 256412 


F 


M00054903D:G10 


UC2-ColonMetLiver 


3718 


474163 


1995.B22.gz43 256420 


F 


M00054893C:G06 


UC2 -CoIonMetLiver 


3719 


556169 


1995.H22.gz43 256426 


F 


M00054901A:H11 


UC2-ColonMetLiver 


3720 


558779 


1995.C23.gz43 256437 


F 


M00054895B:A02 


UC2-ColonMetLiver 


3721 


555830 


1995.G23.gz43 256441 


F 


M00054899D:F07 


UC2-ColonMetLiver 


3722 


974 


1995.N23.gz43 256448 


F 


M00054921C:G01 


UC2-ColonMetLiver 


3723 


556632 


1995.B24.gz43 256452 


F 


M00054893D:D12 


UC2-ColonMetLiver 


3724 


596809 


1995.C24.gz43 256453 


F 


M00054895B:A08 


UC2-ColonMetLiver 


3725 


495365 


1995.N24.gz43_256464 


F 


M00054921C:G06 


UC2-ColonMetLiver 


3726 


447639 


1995.024.gz43 256465 


F 


M00054923A:H02 


UC2-ColonMetLiver 


3727 


418512 


1996.K02.gz43 256517 


F 


M00054938A:B08 


UC2-CoIonMetLiver 


3728 


559728 


1996.A04.gz43 256539 


F 


M00054925B.B01 


UC2-ColonMetLiver 


3729 


67060 


1996.P05.gz43 256570 


F 


M00054942C:B04 


UC2-ColonMetLiver 


3730 


549171 


1996.E06.gz43 256575 


F 


M00054931C:F10 


UC2-ColonMetLiver 


3731 


562550 


1996.O06.gz43 256585 


F 


M00054941C:G04 


UC2-CoIonMetLiver 


3732 


552566 


1996.M08.gz43 256615 


F 


M00054939D:E03 


UC2-CoIonMetLiver 


3733 


473512 


1996B13.gz43_256684 


F 


M00054927C:C03 


UC2-ColonMetLiver 


3734 


553535 


1996.N16.gz43 256744 


F 


M00054940D:E06 


UC2-ColonMetLiver 


3735 


555739 


1996.118.gz43_256771 


F 


M00054936B:G02 


UC2-ColonMetLiver 


3736 


873 


1996.N18.gz43_256776 


F 


M00054941A:A08 


UC2-ColonMetLiver 


3737 


549805 


1996.1 19.gz43 256787 


F 


M00054936C:B08 


UC2-ColonMetLiver 


3738 


550782 


1996.B20.gz43 256796 


F 


M00054928A:C11 


UC2-CoIonMetLiver 
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3739 562106 

3740 481166 



1996.F20.gz4 3 256800 F I M00054933B.C02 I UC2-ColonMetLiver 

1996.P20.gz4 3_256810 | F | M00054943A:H1 1 I UC2-ColnnM*tT .i ™- 

3741 447585 

3742 561878 
| 3743 1 554833 
3744 



1996.I21.gz43 ,256819 I F I M00054936C:G03 ^-ColonMetLiver 
2005.K03.gz43 ,256919 I F M00054954A:F12 I UC2-Cnlr>nMfstT .iwr 



2005.C07.gz43_256975lF I M00054945B:G09 UC2-ColonMetLiver 

126 I 2005.J08.gz 43_256998 I F M00054952D:A1 1 | UC2-ColonM etLiver 

37451 496870 2005.I10.gz43 257029 | F I M00054952A:B12 I UC2-ColonM^TW 

3746 " ' 

3747 
| 3748 I 
3749 



I 3750 1 
3751 



555526 200 5.J12.gz43 257062 I F I M00054953B.D07 1 UC2-ColonMetLiver 

562053 I 200 5.A14.gz43 257085 I F I M00054943D.F12 | UC2-Colo^fetL i^T 

450524 2006: B02.gz43 257278 F M00054961C:H07 1 UC2-ColonMe tLiVe7 

557780 I 2006.P02.gz43 257292 | F | M00054974C:A04 | UC2-ColonMetLiver 



456795 2006.A03.gz4 3_257293 I F I M00054960C.C03 | UC2-ColonMetLivcr 
556954 I 20Q6.E03.gz4 3_257297 I F M00054964A.F09 | UC2-ColonMetL iver 
37521 553021 I 2006 G03.gz43_257299 I F M00054966A:C11 I UC2-ColoT^Th^ 



3753 494306 2006.H03.gz43 ,257300 I F I M00054966D:H12 1 UC2-Colo^TT^r 



3754 

13755 



208278 2006.103 .gz4 3_ 257301 I F I M00054967D:F08 | UC2-ColonMetLiver 

504400 I 20O6.L03.gz43_257304 | F | M00054970C:C0 8 I UC2-ColonM ft tT .iwr 

3756 1 
3757 



456203 200 6.H05.gz43 257332 I F I M00054967A:D11 | UC2-ColonMetLiver 
599012 I 2006.C06.gz43 257343 | F | M00054962C:D02 I UC2-CnlonMprT 



3758 555394 2006.006^43,257344 I F M00054963C:C08 | UC2-ColonMetLive r 

3759 593173 I 2006.G06.gz43_ 257347 I F M00054966A:D09 1 UC2-ColonMetL iver 

3760 592941 I 2006.I06.gz43 ,257349 | F I M00054968A:C1 1 I UC2-ColonM^ri^r 



3761 554379 2006.K06.gz43_25735 11 F I M00054969D:D04 1 UC2-ColonMetLiver 

3762 555103 I 20Q6.O06.gz43_257355 I F M00054973B:A10 | UC2-Colo"riVfe^e7 

3763 556123 2006.J08.gz43_257382 I F M00054969A:E05 | UC2-ColonMetL iver 

3764 557421 I 2006 B09.gz43_257390 I F M00054961D:H08 | UC2-ColoriVfcE i^r 

3765 4509321 2006.K09.gz43 257399 | F I M00054969D:G1 1 I UC2-ColonM^Li^T 



3766 553204 20Q6.E10.gz43_257409 I F I M00054964C:B07 | UC2-ColonMetLiver 

3767 55612 6 I 2006.G10.gz43_257411 | F | M00054966A:G0 2 1 UC2-ColonMetI,iver 
3768 1 ' 

3769 

3770 

3771 

3772 

3773 



554204 2006 .M 11 .gz43 257433 1 F I M00054971C:C1 1 I UC2-ColonMetLiver 

556552 I 200 6.P 11. gz43 257436 F M00054974D:C11 | UC2-ColonMrtLi^T 

554524 I 200 6.N12.gz43 257450 F M00054972B:E06 | UC2-ColonMetLi^e7 

559047 I 2006. P12.gz43 257452 F M00054975A:C08 UC2-ColonMetLiVe7 

554365 I 200 6.P15.gz43 257500 F M00054975B:BQ6 \ UC2-ColonMrtLi^r 

556740 I 2006.A17.gz43 257517 | F I M00054961B:D07 | UC2-ColonMeZi^r 



3774 531145 2006.C17.gz43 ,257519 I F I M00054963A:C12 1 UC2-ColonMetLiver 

3775 557883 I 20Q6.F18.gz43_257538 | F | M00054965C:C04 1 UC2-CololdVtofw' 
37761 - " 

3777 



470386 2006.K18.gz43 257543 I F I M00054970A:G03 I UC2-ColonMetLiver 
596152 I 2006.M18.gz43 257545 | F | M00054971C:H06 I UC2-ColoiridetLi^T 



3778 556552 2006.119^43 257558 I F M00054969B:F12 | UC2-ColonMetLiver 

3779 560696 I 2006.C20.gz43_257567 I F | M00054963A:E02 | UC2-ColonMetLiver 
3780| 2181131 2006.K20.gz43 257575 | F I M00054970B:B02 I UC2-ColonMetLiver 
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SEQ 
ID 
NO 


CLUSTEE 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


3781 


557559 


2006.L20.gz43_257576 


F 


M00054971A:D04 


UC2-ColonMetLiver 


3782 


129715 


2006B22.gz43_257598 


F 


M00054962A:H07 


UC2-ColonMetLiver 


3783 


553976 


2006.C22.gz43_257599 


F 


M00054963A:G12 


UC2-ColonMetLiver 


3784 


561984 


2006.D22.gz43_257600 


F 


M00054964A:B09 


UC2-ColonMetLiver 


3785 


497938 


2006.G22.gz43_257603 


F 


M00054966D:C05 


UC2-ColonMetLiver 


3786 


556951 


2006J22.gz43_257606 


F 


M00054969C:G09 


UC2-ColonMetLiver 


3787 


553983 


2006.N22.gz43_2576 10 


F 


M00054972D.A10 


UC2-ColonMetLiver 


3788 


549579 


2006.P22.gz43_257612 


F 


M00054975C:D08 


UC2-ColonMetLiver 


3789 


555135 


2006.B23 .gz43_2576 14 


F 


M00054962B:A07 


UC2-ColonMetLiver 


3790 


471257 


2006.C23.gz43_257615 


F 


M00054963A:H04 


UC2-ColonMetLiver 


3791 


557080 


2006.J23.gz43_257622 


F 


M00054969C:H05 


UC2-ColonMetLiver 


3792 


554643 


2006.M23.gz43_257625 


F 


M00054971D:F04 


UC2-ColonMetLiver 


3793 


558707 


2007.A01 .gz43_257645 


F 


M00054975C:E02 


UC2-ColonMetLiver 


3794 


557420 


2007.M02.gz43_257673 


F 


M00054993D:F04 


UC2-ColonMetLiver 


3795 


482509 


2007. A03 .gz43_257677 


F 


M00054975C:G06 


UC2-ColonMetLiver 


3796 


471827 


2007.J03.gz43_257686 


F 


M00054989B:C11 


UC2-ColonMetLiver 


3797 


556841 


2007.K03 gz43_257687 


F 


M00054991B:E11 


UC2-ColonMetLiver 


3798 


597957 


2007.L03.gz43_257688 


F 


M00054992D:C12 


UC2-ColonMetLiver 


3799 


556768 


2007.C04.gz43_257695 


F 


M00054978C:E07 


UC2-ColonMetLiver 


3800 


170450 


2007.F04.gz43_257698 


F 


M00054983C:F05 


UC2-ColonMetLiver 


3801 


555593 


2007.104.gz43_257701 


F 


M00054987B:D06 


UC2-ColonMetLiver 


3802 


556839 


2007.J04.gz43_257702 


F 


M00054989B:E04 


UC2-ColonMetLiver 


3803 


551967 


2007.A05.gz43_257709 


F 


M00054976A:A03 


UC2-CoIonMetLiver 


3804 


550814 


2007.105. gz43_25 77 17 


F 


M00054987B:G05 


UC2-ColonMetLiver 


3805 


557811 


2007.L05.gz43_257720 


F 


M00054992D:F06 


UC2-ColonMetLiver 


3806 


559580 


2007.J06.gz43_257734 


F 


M00054989C:C12 


UC2-ColonMetLiver 


3807 


554764 


2007.N06.gz43_257738 


F 


M00054995B:G12 


UC2-ColonMetLiver 


3808 


554395 


2007.P06.gz43_257740 


F 


M00054997D:E01 


UC2-ColonMetLiver 


3809 


555639 


2007.H07.gz43_257748 


F 


M00054986B:D07 


UC2-ColonMetLiver 


3810 


556561 


2007.E08.gz43_257761 


F 


M00054981D:C06 


UC2-ColonMetLiver 


3811 


454463 


2007.108.gz43_257765 


F 


M00054987C:A1 1 


UC2-ColonMetLiver 


3812 


454463 


2007.K08.gz43_257767 


F 


M00054991C:E01 


UC2-ColonMetLiver 


3813 


562750 


2007.C09.gz43_257775 


F 


M00054978D:H09 


UC2-ColonMetLiver 


3814 


560652 


2007.F09.gz43_257778 


F 


M00054983D:D05 


UC2-ColonMetLiver 


3815 


553615 


2O07.I09.gz43_257781 


F 


M00054987C:B12 


UC2-ColonMetLiver 


3816 


555512 


2007.F10.gz43_257794 


F 


M00054984A:A05 


UC2-ColonMetLiver 


3817 


556598 


2007.110.gz43_257797 


F 


M00054987C:G04 


UC2-ColonMetLiver 


3818 


552188 


2007.J10.gz43_257798 


F 


M00054990A:F10 


UC2-ColonMetLiver 


3819 


558670 


2007.All.gz43_257805 


F 


M00054976C:A03 


UC2-ColonMetLiver 


3820 


452682 


2007.Mll.gz43_257817 


F 


M00054994B:D11 


UC2-CoIonMetLiver 


3821 


553915 


2007.Pll.gz43_257820 


F 


M00055000A:H10 


UC2-ColonMetLiver 


3822 


550886 


2007.C12.gz43_257823 


F 


M00054979A:H10 


UC2-ColonMetLiver 
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ID 


LLUSl EF 


SEQ NAME 


Okie 

JN 1 


CLONE ID 


LIBRARY 


3823 


553787 


2007H12.gz43_257828 


F 


M00054986C:D08 


UC2-ColonMetLiver 


3824 


475624 


2007I12.gz43_257829 


F 


M00054987D:D01 


UC2-ColonMetLiver 


3825 


554212 


2007.O12.gz43_257835 


F 


M00054997A:C02 


UC2-ColonMetLiver 


3826 


555883 


2007.P12.gz43_257836 


F 


M00055000B:B04 


UC2-CoIonMetLiver 


3827 


458257 


2007.B13.gz43_257838 


F 


M00054977C:A09 


UC2-ColonMetLiver 


3828 


555197 


2007.D13.gz43_257840 


F 


M00054980D:H07 


UC2-CoIonMetLiver 


3829 


553732 


2007.P13.gz43_257852 


F 


M00055000B:F02 


UC2-ColonMetLiver 


3830 


556416 


2007B14.gz43_257854 


F 


M00054977C:B08 


UC2-ColonMetLiver 


3831 


501056 


2007.B15.gz43_257870 


F 


M00054977C:D08 


UC2-ColonMetLiver 


3832 


552437 


2007.E15.gz43_257873 


F 


M00054982C:A02 


UC2-ColonMetLiver 


3833 


555658 


2O07.L15.gz43_257880 


F 


M00054993B:H06 


UC2-ColonMetLiver 


3834 


476732 


2007.O15.gz43_257883 


F 


M00054997A:G11 


UC2-ColonMetLiver 


3835 


556690 


2007.B16.gz43_257886 


F 


M00054977C:D12 


UC2-ColonMetLiver 


3836 


557615 


2007.H16.gz43 257892 


F 


M00054986D:D11 


UC2-ColonMetLiver 


3837 


543429 


2007.G17.gz43_257907 


F 


M00054985C:B12 


UC2-ColonMetLiver 


3838 


554793 


2007H17.gz43_257908 


F 


M00054986D:E03 


UC2-ColonMetLiver 


3839 


556396 


2007.11 7.gz43_257909 


F 


M00054988C:B08 


UC2-ColonMetLiver 


3840 


556288 


2007.J17.gz43_257910 


F 


M00054990C:A08 


UC2-ColonMetLiver 


3841 


552055 


2007.L17.gz43_257912 


F 


M00054993C:C10 


UC2-ColonMetLiver 


3842 


549516 


2007P17.gz43_257916 


F 


M00055000C:F08 


UC2-ColonMetLiver 


3843 


561112 


2007.B18.gz43_257918 


F 


M00054977C:F09 


UC2-ColonMetLiver 


3844 


557054 


2007.C18.gz43_257919 


F 


M00054979B:G12 


UC2-ColonMetLiver 


3845 


483410 


2007D18.gz43_257920 


F 


M00054981B:B09 


UC2-ColonMetLiver 


3846 


598589 


2007.G18.gz43_257923 


F 


M00054985C:D04 


UC2-ColonMetLiver 


3847 


559096 


2007.O18.gz43_257931 


F 


M00054997B:F10 


UC2-ColonMetLiver 


3848 


556759 


2007.119.gz43_257941 


F 


M00054988C:G02 


UC2-ColonMetLiver 


3849 


555277 


2007.K19.gz43_257943 


F 


M00054992A:C08 


UC2-ColonMetLiver 


3850 


551694 


2007.B20.gz43_257950 


F 


M00054977D:B06 


UC2-ColonMetLiver 


3851 


562655 


2007.E20.gz43_257953 


F 


M00054983A:F08 


UC2-ColonMetLiver 


3852 


559053 


2007.H20.gz43_257956 


F 


M00054987A:A09 


UC2-ColonMetLiver 


3853 


556286 


2007.J20.gz43_257958 


F 


M00054990D:A06 


UC2-ColonMetLiver 


3854 


551798 


2007.K20.gz43_257959 


F 


M00054992A:D11 


UC2-ColonMetLiver 


3855 


549214 


2007.N20.gz43_257962 


F 


M00054996A:D10 


UC2-ColonMetLiver 


3856 


467293 


2007.D21.gz43_257968 


F 


M00054981B:H12 


UC2-ColonMetLiver 


3857 


503452 


2007.J21.gz43_257974 


F 


M00054990D:F04 


UC2-ColonMetLiver 


3858 


556019 


2007.L22.gz43_257992 


F 


M00054993C:G12 


UC2-ColonMetLiver 


3859 


560748 


2007.G23 .gz43_258003 


F 


M00054985D:E03 


UC2-ColonMetLiver 


3860 


562323 


2007.L23.gz43_258008 


F 


M00054993C:H05 


UC2-ColonMetLiver 


3861 


2284 


2007.A24.gz43_258013 


F 


M00054976D:F06 


UC2-ColonMetLiver 


3862 


481243 


2007D24.gz43_258016 


F 


M00054981C:A11 


UC2-ColonMetLiver 


3863 


555958 


2007N24.gz43_258026 


F 


M00054996B:C11 


UC2-ColonMetLiver 


3864 


560984 


2007.O24.gz43_258027 


F 


M00054997C:H03 


UC2-ColonMetLiver 
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Table 2 



SEQ 


CLUSTEF 


SEQ NAME 


ORIE 

NT 


CLONE ID 


LIBRARY 


3865 


555967 


2008.N01 .gz43_258044 


F 


M00055016D:C07 


UC2-ColonMetLiver 


3866 


556488 


2008.P01.gz43_258046 


F 


M00055019B:B11 


UC2-ColonMetLiver 


3867 


553064 


2008.A02.gz43_258047 


F 


M00055001A:B10 


UC2-ColonMetLiver 


3868 


552006 


2008D02.gz43_258050 


F 


M00055005A:C07 


UC2-ColonMetLiver 


3869 


559676 


2008.E02.gz43 258051 


F 


M00055006A:D01 


UC2-ColonMetLiver 


3870 


557308 


2008J02.gz43_258056 


F 


M00055011B:F09 


UC2-ColonMetLiver 


3871 


549723 


2008.N02.gz43_258060 


F 


M00055016D:C11 


UC2-ColonMetLiver 


3872 


549912 


2O08.B03.gz43_258064 


F 


M00055002B:E07 


UC2-ColonMetLiver 


3873 


551444 


2008G03.gz43_258069 


F 


M00055008C:F09 


UC2-ColonMetLiver 


3874 


557240 


2008.P03.gz43_258078 


F 


M00055019B:H10 


UC2-ColonMetLiver 


3875 


559011 


2008.C04.gz43_258081 


F 


M00055003C:E08 


UC2-CoIonMetLiver 


3876 


555701 


2O08.D04.gz43_258082 


F 


M00055005A:E02 


UC2-ColonMetLiver 


3877 


556115 


2008.P04.gz43_258094 


F 


M00055019C:C01 


UC2-ColonMetLiver 


3878 


556042 


2O08.BO5.gz43_258096 


F 


M00055002B:G03 


UC2-ColonMetLiver 


3879 


558212 


2008.E05.gz43_258099 


F 


M00055006B:A09 


UC2-ColonMetLiver 


3880 


554932 


2008.G05 gz43_258 1 0 1 


F 


M00055008C:H10 


UC2-ColonMetLiver 


3881 


548858 


2008J05.gz43_258104 


F 


M00055011C:E04 


UC2-ColonMetLiver 


3882 


605117 


2008.L05.gz43_258 106 


F 


M00055014B:E08 


UC2-ColonMetLiver 


3883 


560525 


2008.M05.gz43_258107 


F 


M00055015D:C09 


UC2-CoIonMetLiver 


3884 


597780 


2008.106.gz43_258119 


F 


M00055010C:C10 


UC2-ColonMetLiver 


3885 


411113 


2008.N06.gz43_258 124 


F 


M00055017A:A11 


UC2-ColonMetLiver 


3886 


552674 


2008.E07.gz43_258131 


F 


M00055006B:C08 


UC2-ColonMetLiver 


3887 


495942 


2008.O07.gz43_258141 


F 


M00055018C:C06 


UC2-ColonMetLiver 


3888 


34381 


2008F08.gz43_258148 


F 


M00055007C:D01 


UC2-ColonMetLiver 


3889 


549195 


2008.H08.gz43_258150 


F 


M00055009C:F06 


UC2-ColonMetLiver 


3890 


554084 


2008L08.gz43_258154 


F 


M00055014C:F05 


UC2-ColonMetLiver 


3891 


557121 


2008.P08.gz43_258158 


F 


M00055019C:G06 


UC2-ColonMetLiver 


3892 


554693 


2008A09.gz43_258159 


F 


M00055001A:G05 


UC2-ColonMetLiver 


3893 


134237 


2008C09.gz43_258161 


F 


M00055003D:F09 


UC2-ColonMetLiver 


3894 


594994 


2008.H09.gz43_258166 


F 


M00055009C:F07 


UC2-ColonMetLiver 


3895 


555371 


2008.109.gz43_258167 


F 


M00055010D:A02 


UC2-ColonMetLiver 


3896 


347057 


2008.E10.gz43_258179 


F 


M00055006B:E05 


UC2-ColonMetLiver 


3897 


558378 


2008H10.gz43_258182 


F 


M00055009D:D12 


UC2-ColonMetLiver 


3898 


497086 


2008.K10.gz43_258185 


F 


M00055013B:B07 


UC2-ColonMetLiver 


3899 


554379 


2008.Bll.gz43_258192 


F 


M00055002D:A07 


UC2-ColonMetLiver 


3900 


451118 


2008.Lll.gz43_258202 


F 


M00055014D:A11 


UC2-ColonMetLiver 


3901 


557059 


2008.M1 l.gz43_258203 


F 


M00055015D:G05 


UC2-ColonMetLiver 


3902 


604179 


2008.01 l.gz43_258205 


F 


M00055018D:D01 


UC2-ColonMetLiver 


3903 


556487 


2008.Pll.gz43_258206 


F 


M00055019D:B02 


UC2-ColonMetLiver 


3904 


562550 


2008.B12.gz43_258208 


F 


M00055002D:E04 


UC2-ColonMetLiver 


3905 


556530 


2008.L12.gz43_258218 


F 


M00055015A:C08 


UC2-ColonMetLiver 


3906 


524897 


2008.O12.gz43_258221 


F 


M00055018D:E05 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 



CLUSTER 



3907 



SEQ NAME 



ORIE 
NT 



480960 I 2008.F13.gz43 258228 



CLONE ID 



F M00055007D.D11 | UC2-CotonMetl,h^r 



LIBRARY 



3908 



560563 | 2008.G13.gz43 258229 



M00055008D:F05 | UC2-ColonMetLi 



3909 



555512 | 2008.I13.gz43 25823 1 



M00055010D.DQ8 | UC2-ColonMetLiv e r 



39 



143218 2008.L13.gz43 258234 



M00055015A:E04 | UC2-ColonMetLiver 



391 



556616 2008.N13.gz43 258236 



M000550 1 7C:C 12 | UC2-ColonMrtI -i™- 



3912 



552613 2008.A14.gz43 258239 



M00055001B:H05 | UC2-ColonMetLiver 



3913 



3914 



555658 2008.I14.gz43 258247 



3915 



516018 2008.K14.gz43 258249 



M00055010D:E08 | UC2-ColonMctLiver 



604859 2008.M14.gz43 258251 



M00055013B H01 I UC2-ColonMf*I .iv*r 



M00055016A:E04 | UC2-ColonMetLiver 



3916 



553967 | 2008.A15.gz43 258255 



3917 



600021 | 2008.I15.gz43 258263 



M00055001C:A11 | UC2-ColonMetLiver 



M00055010D.F1 1 I UC2-ColonMetT .ivp-r 



3918 



437580 | 2008.B16.gz43 258272 



M00055003A:D1 1 I UC2-ColonMert .iv^r 



3919 



556616 I 2008.D16.gz43 25 8274 



M00055005C:G04 1 UC2-ColonMetLiver 



3920 



551268 | 2008.O16.gz43 258285 



M00055018D:H03 | UC2-ColonMetLiver 



3921 



335714 2008.A17.gz43 258287 



M00055001C:C10 I UC2-ColonMetT.iv CT 



3922 



460445 2008.D17.gz43 258290 



M00055005D;B08 I UC2-ColonMetLiver 



3923 



362109 I 2008.I17.gz43 258295 



M000550 1 1 A:B 1 1 1 UC2-Co1onMetI .irer 



3924 



556320 | 2008.J17.gz43 258296 



3925 



931 | 2008.L17.gz43 258298 



M00055012B.A10 | UC2-ColonMetLiver 



3926 



494300 I 2008.M17.gz43 258299 



M00055015B:A04 1 UC2-ColonMeH.iver 



M00055016B:E02 | UC2-ColonMetLiver 



3927 



453908 I 2008.N17.gz43 258300 



M00055017D:D05 I UC2-ColonMeM.iv^r 



3928 



549690 | 2008.A18.gz43 258303 



M00055001C.G07 1 UC2-ColonMetLiver 



3929 



555949 | 2008.B18.gz43 258304 



M00055003A:G04 I UC2-ColonMetT .hw 



3930 



555289 | 2008.D18.gz43 258306 



M00055005D:C 1 1 | UC2-ColonMetLiver 



3931 



550497 2008.F18.gz43 258308 



M00055008B:E08 I UC2-ColonMetI .irer 



3932 



559464 I 2008.I18.gz43 258311 



M00055011A:C06 1 UC2-ColonMetLiver 



3933 



601219 2008.B19.gz43 258320 



M00055003A.H0I | UC2-ColonMetLiver 



3934 



522762 | 2008.F19.gz43 258324 



M00055008B:F03 I UC2-ColonM fi fT ,iv CT 



3935 



558530 | 2008.F20.gz43 258340 



3936 



551656 2008.H20.gz43_258342 



M00055008B:F04 | UC2-ColonMetLiver 



M00055010A:H05 | UC2-ColonMetLiver 



3937 



554245 2008.N20.gz43 258348 



M00055017D:E08 I UC2-ColonMetLiver 



3938 



562142 | 2008.P20.gz43 258350 



M00055020A:G1 1 I UC2-ColonMetI .irer 



3939 



555639 2008.B21.gz43 258352 



M00055003B:E07 | UC2-ColonMetLiver 



3940 



388688 | 2008.12 l gz43 258359 



M00055011A:F06 | UC2-ColonMetLiver 



3941 



476438 | 2008.J21.gz43 258360 



M00055012B:H12 I UC2-ColonMetT,iv ft r 



3942 
3943 



560097 | 2008.L21.gz43 258362 



M00055015B:E09 I UC2-ColonMetLiver 



557308 |2008.M21.gz43 258363 



3944 
3945 



553523 | 2008.D22.gz43 258370 



M00055016C:G04 1 UC2-ColonMetLiver 



M00055005D:G04 1 UC2-ColonMetLiver 



460169 | 2008.H22.gz43 258374 



M00055010B:A06 1 UC2-ColonMetLiver 



3946 



558744 I 2008.B23.gz43 258384 



M00055003B:G11 I UC2-ColonMetLiver 



3947 
3948 



495984 I 2008.I23.gz43 258391 



M00055011A:G12 UC2-ColonMetLiver 



497513 | 2008.H24.gz43 258406 



F | M00055010B.C11 | UC2-ColonMetLiver 
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SEQ NAME 



I ORIE | 
NT 



CLONE ID 



LIBRARY 



3949 557910 

3950 557603 



2017.B02.gz4 3_258434 F I M00055020D:G06 1 UC2-ColorittetLl^r 

2017.I02.gz43 ,258441 I F I M00055029C:D05 I UC 2-CnlonM R tT i wr 

3951 523495 1 1 1 

13952 390017 

3953 610986" 

3954 551884 



2017.M03.az43 ,258461 1 F I M00055034B:G04 | UC2-ColoridetLi^r 
2017.H04.gz43 ,258472 I F I M00055028C:D05 I UCZ-CnlnnMrtT W 



3955 
3956 



2017.B05.gz43 _258482 I F I M00055021A:F08 UC2-ColonMetLiver 
2017.E05.gz43 ,258485 I F I M00055024D:E09 1 UC2-Co1onMeH .iv*r 



559447 2017. E06.gz43 258501 I F I M00055024D:F07 | UC2-ColonMetLiv er 
322123 2017:F06 g z43 258502 | F | M00055026A:G08 | UC2-ColonMetLiver 



3957 556258 2017.H06.gz43 ,258504 I F I M00055028D:A08 I UC2-ColonMetf.iv ft r 



3958 
3959 



561279 2017.K06.gz 43_258507 I F I M00055032B.D08 | UC2-ColonMetLiver 

550815 2017.E07.gz4 3_258517 I F M00055024D:G09 1 UC2-ColonMet Liv e r 

39601 561935 2017.D08.gz43 ,258532 I F I M00055023C:G1 1 | UC2-ColonMetLiv er 

3961 1 557733 2017J08.gz43 258538 | F | M00055031B:E05 | UC2-ColonMetLiver 



[3962 
3963 



139641 
3965 



556852 2017. N09.gz43 258558 I F I M00055035D:E07 | UC2-ColonMetLiv er 
557049 I 2017.D10.gz43 258564 | F | M00055023D:G02 I UC2-ColonMetLiver 



560250 2O17.E10.gz4 3_258565 I F I M00055025A.D11 | UC2-ColonMetLive r 

560294 2017.H10.gz43_258568 I F M00055028D:E10 | UC2-ColonMetLive r 

39661 481930 I 2017Dll gz43_258580 I F M00055024A:B04 | UC2-ColorivieZi^ r 

3967 550986 I 2017.E1 l.gz43_258581 I F M00055025B:A03 | UC2-Colo"^MetLfre 7 

3968 560621 I 2017.Hll.gz43_258584| F | M00055029A:A02 I UC2-Coln'^ri^ 



3969 549599 201711 l.gz43 258585 | F I M00055030B:E03 I UC2-ColonM^n^T 
3970 
3971 



558004 201 7.Jll.gz43 258586 I F I M00055031B:H08 I UC2-ColonMetLiver 
562590 2017.L12.gz43 258604 | F | M00055033B H01 I UC2-ColonMetLi^r 



3972 558755 2017 N12.gz43_258606 I F I M00055035D F09 | UC2-ColonMetLive r 

3973 556458 1 2Q17.M13 gz43_258621 I F M00055034D.A10 1 UC2-Coloritt eZ^e7 

3974 450059 I 2017.B14.gz43 ,258626 I F I M00055021B:G09 | UC2-ColonMe^e7 

3975 557353 I 2017.C14.gz43_258627 I F M00055022C:B12 | UC2-Colo^M^L i^r 

3976 560898 I 2017.D14.gz43 258628 I F M00055024A.C07 1 UC2-ColoriVletiLl^ 

3977 461850 I 2017.114.gz43_258633 I F M00055030B:H07 | UC2-ColorivietL i^e7 

3978 552848 I 2017.J14.gz43 258634 I F I M00055031C:B11 | UC2-ColonM etLi^r 

3979 552891 I 2017.F16.gz43 258662 I F I M00055026C:E07 | UC2-ColonMetL i^~ 

3980 454483H 2017.G16.gz43 ,258663 I F I M00055027D:E08 I UC2-ColonM^n^r 



3981 508671 2017 D18^z43_258692 I F M00055024B:E02 | UC2-ColonMe tLiver 

3982 456183 I 2017 N18.gz43_258702 I F I M00055036A:H10 1 UC2-Colon\toL i^T 

3983 557578 I 2Q17.O18.gz43_258703 I F I M00055037C:D01 | UC2-ColonMetLl^r 

3984 561868 I 2017.D19.gz43_258708 I F M00055024B:E03 | UC2-ColonMetLive r 
39851 556856 I 2017 F19.gz43_258710 F M00055026D:E12 | UC2-Coloriyjetl^ 
39861 557010 I 2017 L19.gz43_258716 F M00055033D:D03 | UC2-ColoriVtoL^e7 
39871 529356 I 2017.D21.gz43_258740 I F M00055024B:E06 | UC2-ColonMetLiver 
39881 557883 I 2017.L21.gz43_258748 I F M00055033D:G01 | UC2-Colo^etL^e7 
39891 549114 I 2017.B22.gz43_258754 I F M00055021D:F06 | UC2-ColonMetLiver 

[39901 556933 | 2Q17.D22.gz43_258756 | F | M00055024B:F02 I U C2-ColortVfertl^r 
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Table 2 



SEQ 
ID 

NO CLUSTEEf 



3991 494314 

3992 408586 




CLONE ID 



LIBRARY 



2Q17.F22.gz4 3_258758 I F M00055027A:E04 | UC2-ColoriVtoD^ r 

2017.M22.gz4 3_258765 1 F I M00Q55035B:DU I U r.2-rnln^T^7 

3993 460078 2017.L23.gz43 258 780 F M000 55034A:G01 1 UC 2-CnlonM^T W 

3994 561836 2017.N23.gz43 2 58782 F M00055036B:H02 I UC 2-ColonM^ 

3995 558185 2043.D03.gz43 259228 I F I M00055222D:B11 I UC2-ColnnM^T .i 



3^1 -,° 6 !- I 2043 F03 fi z43 2 5 9230 | F | M00055225C:E08 I UC2-ColnnlvtoT .i^r 
3998 



551677. 2043.J 03.gz43 259234 F M00055230A:H01 | UC2-Co lonMetLiver 
451720 204TN03.gz43 259238 | F | M00055236C:E05 | UC2-ColonMetLiver 



3999 559447 2043.L05.gz43 259268 | F | M00055233C:F09 I UC2-ColonlvfetTW 

4000 ■ 

4001 



558230 2043.P05.gz43 259272 | F | M00055239B:C05 1 UO-PnlnnM^T 



554627 
558422 



a 



2043.A06.gz 43_259273 F I M00055219B:B04 | UC2-C olonMetLiver ' 
140021 558422 I 2043.N06.gz43 259286 1 F I M00055236D:E04 I UC2- ColonM e fll^T 
4003 1 558679 I 2043.G08.gz43 259311 F | M00055226C:H05 I IIC?-Cn1nnlvWT ^ 



4004 
4005 



491240 2043.F09.gz43 2 59326 I F I M00055225D:G11 I UC2-ColonM etT W 
550815 2043.H09.gz43 259328 I F I M00055227C:D02 1 UC2-Colo nM e ^T 
40061 481362T2043.P10.gz43 259352 | F | M00055239C:F08 I UC2-ColonMetLiver 



557965 2043. B12.gz43 259370 I F I M00055220D:G07 1 UC2-ColonMe tLiTe7 
380127 2043X:i2.gz43 259371 | F | M00055222A:A06 | UC2-ColonMetLiver 



fa 



4007 467710 2043.CU.gz43 259355 I F I M00055221D:H08 I U C2-ColonMertT^ 
4008 ' ' 

[4009 

4010 1 
[4011 

4012 

4013 
14014 
1 4015 
14016 

4017 



374281 2043. M12.gz43 2593811 F I M00055235B:C01 | UC2-ColonMe tLi^r 

464067 2043. C13.gz43 259387 I F I M00055222A:C08 | UC2-ColonMetLiver ' 

552521 2043 .D13.gz43 259388 I F I M00055223B:C04 UC2-ColonMetLi ^e7 

552019 2043. P13.gz43 259400 F M00055239D:C01 | ^-ColonMetLiveT 

556790 2043 .A14.gz43 259401 F M00055219C:C10 UC2-ColonMetLi^e7 

557411 2043.K14.gz43 259411 | F | M00055232C:D04 1 UC2-ColonMetLiver 



4018 
4019 



378459 2043. L15.gz43 259428 I F I M00055233D:G1 1 UC2-ColonMetLiver 
550571 2043.H16.gz43 259440 | F | M00055227D:E07 | UC2-Coloi^MetL^e7 



557676 2043.P16.g z43_259448 I F I M00055239D:G04 UC2-ColonMetLiv"e7 

558652 I 2043.A17.g z43 259449 I F | M00055219C:H0 6 1 UC2-Co1onlvfetT iver 

|4020 | ' 

4021 , 
[4022 
4023 



559053 204 3.I17.gz43 259457 I F I M00055229C:D01 | UC2-ColonMetLiver 
552086 2043^1 8.gz43 259467 | F | M00055222B:A01 | UC2-ColonMetLiver 



552430 2043 .L18.gz43 259476 I F I M00055234A:G12 UC2-ColonMetLiver 
562317 2043.P19.gz43 259496 | F | M00055240A:B07 | UC2-ColonMettlve7 



) 4024 543540 2043.122.gz43_259537 I F [ M00055229D:D07 1 UC2-ColonM etLiver 

14025 489275 I 2043.F23.gz43 259550 I F I M00055226C:A11 | UC2-ColonMe^e7 

4026 450193 1 2Q43.M23.gz43_259557 1 F M00055236B.C07 | UC2-ColonMetLiver 

140271 55765 6 I 2043.O23.gz43_259559 F M00055238D:D10 1 UC2-ColonMetLiver 

40281 559764 I 2043.D24.gz43_259564 F M00055223D:H03 | UC2-ColonMetLiver 

[40291 560717 I 2043.E24.gz43__259565 F M00055225B.H01 | UC2-ColoriM^Ll^r 

4030 552249 I 2043.P24.gz43_259576 F M00055240B:E12 | UC2-ColoriyfoLl^r 

4031 556829 I 2044.G02.gz43_259603 F | M00055248D:B12 1 UC2-CololiMrtLi^ r 
4032| 558093 | 2044.A03.gz43_259613 F | M00055240C:A03 | UC2-Colo^Li^T 
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Table 2 



SEQ 
ID 
INU 


CLUSTEI 


? SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4033 


616134 


2044.B03.gz43 259614 


F 


M00055241D:F10 


UC2-ColonMetLiver 


4034 


558355 


2044.E03.gz43 259617 


F 


M00055245D:A04 


UC2-ColonMetLiver 


4035 


558785 


2044.J03.gz43 259622 


F 


M00055252B:D07 


UC2-ColonMetLiver 


4036 


607430 


2044.F04.gz43 259634 


F 


M00055247B:A11 


UC2-ColonMetLiver 


4037 


558512 


2044.L04.gz43 259640 


F 


M00055255B:F05 


UC2-ColonMetLiver 


4038 


488574 


2044.N04.gz43 259642 


F 


M00055257D.A11 


UC2-ColonMetLiver 


4039 


558098 


2044.P04.gz43_259644 


F 


M00055260B:A05 


UC2-ColonMetLiver 


4040 


557389 


2044.F05.gz43 259650 


F 


M00055247C:B01 


UC2-ColonMetLiver 


4041 


557928 


2044.K05.gz43 259655 


F 


M00055254A:E02 


UC2-ColonMetLiver 


4042 


551305 


2044.E06.gz43 259665 


F 


M00055245D:D01 


UC2-ColonMetLiver 


4043 


504415 


2044.H06.gz43 259668 


F 


M00055250B:A05 


UC2-ColonMetLiver 


4044 


549052 


2044.106.gz43_259669 


F 


M00055251C:D01 


UC2-CoIonMetLiver 


4045 


138927 


2044.O06.gz43_259675 


F 


M00055259A:G06 


UC2-ColonMetLiver 


4046 


554246 


2044.N07.gz43_259690 


F 


M00055257D:E02 


UC2-ColonMetLiver 


4047 


159419 


2044.J08.gz43_259702 


F 


M00055252C:E02 


UC2-ColonMetLiver 


4048 


460049 


2044.P08.gz43 259708 


F 


M00055260B:H06 


UC2-ColonMetLiver 


4049 


560317 


2044.D09.gz43_259712 


F 


M00055244D:C07 


UC2-CoIonMetLiver 


4050 


560317 


2044.E09.gz43_259713 


F 


M00055246A:C01 


UC2-ColonMetLiver 


4051 


552357 


2044.H09.gz43_259716 


F 


M00055250B:G09 


UC2-ColonMetLiver 


4052 


560144 


2044.P09.gz43_259724 


F 


M00055260C:A1 1 


UC2-ColonMetLiver 


4053 


560669 


2044.E10.gz43 259729 


F 


M00055246A:E10 


UC2-ColonMetLiver 


4054 


423884 


2044.N10.gz43_259738 


F 


M00055258A:C07 


UC2-CoIonMetLiver 


4055 


523732 


2044.11 l.gz43_259749 


F 


M00055251C:H07 


UC2-ColonMetLiver 


4056 


493359 


2044.Kll.gz43_259751 


F 


M00055254C:A06 


UC2-ColonMetLiver 


4057 


560204 


2044.E14.gz43_259793 


F 


M00055246B:C04 


UC2-ColonMetLiver 


4058 


558461 


2044.F14.gz43 259794 


F 


M00055248A:F10 


UC2-ColonMetLiver 


4059 


447380 


2044.J14.gz43_259798 


F 


M00055252D:C10 


UC2-ColonMetLiver 


4060 


558559 


2044.O14.gz43_259803 


F 


M00055259B:G1 1 


UC2-ColonMetLiver 


4061 


450025 


2044A15.gz43_259805 


F 


M00055241A:C06 


UC2-ColonMetLiver 


4062 


484086 


2044.E15.gz43_259809 


F 


M00055246B:D10 


UC2-ColonMetLiver 


4063 


558861 


2044.K15.gz43 259815 


F 


M00055254D:B10 


UC2-ColonMetLiver 


4064 


486787 


2044.C16.gz43_259823 


F 


M00055243D:C06 


UC2-ColonMetLiver 


4065 


562256 


2044.L16.gz43_259832 


F 


M00055256A.D12 


UC2-ColonMetLiver 


4066 


561579 


2044E17.gz43_259841 


F 


M00055246C:F08 


UC2-ColonMetLiver 


4067 


560205 


2044.J17.gz43_259846 


F 


M00055253A:C07 


UC2-ColonMetLiver 


4068 


558134 


2044.M17.gz43_259849 


F 


M00055257B:B10 


UC2-ColonMetLiver 


4069 


488143 


2044.H18.gz43_259860 


F 


M00055251A:B09 


UC2-ColonMetLiver 


4070 


500737 


2044.118.gz43_259861 


F 


M00055251D:H11 


UC2-ColonMetLiver 


4071 


238121 


2044C20.gz43_259887 


F 


M00055244A:C06 


UC2-ColonMetLiver 


4072 


629002 


2044.D20.gz43_259888 


F 


M00055245B:H01 


UC2-ColonMetLiver 


4073 


562256 


2044.L21.gz43_259912 


F 


M00055256B:C07 


UC2-CoIonMetLiver 


4074 


541784 


2044.B22.gz43_259918 


F 


M00055242D:E02 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4075 


560507 


2044.D22.gz43_259920 


F 


M00055245C:D04 


UC2-ColonMetLiver 


4076 


555340 


2044F22.gz43_259922 


F 


M00055248C:B11 


UC2-ColonMetLiver 


4077 


560059 


2044.G22.gz43_259923 


F 


M00055249C:B12 


UC2-ColonMetLiver 


4078 


494130 


2044K24.gz43_259959 


F 


M00055255A:F11 


UC2-ColonMetLiver 


4079 


557928 


2044.L24.gz43_259960 


F 


M00055256B:G05 


UC2-ColonMetLiver 


4080 


559321 


2053.K01.gz43_259984 


F 


M00055272D:E04 


UC2-ColcmMet:Liver 


4081 


34381 


2053.E02.gz43_259994 


F 


M00055265A:G01 


UC2-ColonMetLiver 


4082 


556447 


2053.H02.gz43_259997 


F 


M00055268B:D12 


UC2-ColonMetLiver 


4083 


555564 


2053.E03.gz43_260010 


F 


M00055265A:G07 


UC2-ColonMetLiver 


4084 


558360 


2053.M03.gz43_260018 


F 


M00055275B:H06 


UC2-ColonMetLiver 


4085 


558120 


2053.B05.gz43_260039 


F 


M00055262B:A11 


UC2-ColonMetLiver 


4086 


511276 


2053.D05.gz43_260041 


F 


M00055264A:F03 


UC2-ColonMetLiver 


4087 


550074 


2053.F05.gz43_260043 


F 


M00055266B:C12 


UC2-ColonMetLiver 


4088 


558867 


2053.L05.gz43_260049 


F 


M00055274C:C06 


UC2-ColonMetLiver 


4089 


555564 


2053.N05.gz43_260051 


F 


M00055277D:A02 


UC2-ColonMetLiver 


4090 


551798 


2053.O05.gz43_260052 


F 


M00055279A:E03 


UC2-ColonMetLiver 


4091 


556019 


205 3 .D08 gz43_260089 


F 


M00055264B:E06 


UC2-ColonMetLiver 


'4092 


480960 


2053.O09.gz43_260116 


F 


M00055279B:D02 


UC2-ColonMetLiver 


4093 


478087 


2053.CI0.gz43_260120 


F 


M00055263C:D02 


UC2-ColonMetLiver 


4094 


450507 


2053.D10.gz43_260121 


F 


M00055264B:H12 


UC2-ColonMetLiver 


4095 


512721 


2053.E10.gz43_260122 


F 


M00055265C:A04 


UC2-ColonMetLiver 


4096 


558052 


2053.Bll.gz43_260135 


F 


M00055262C:B08 


UC2-ColonMetLiver 


4097 


550863 


2053.A12.gz43_260150 


F 


M00055261B:G12 


UC2-ColonMetLiver 


4098 


560957 


2053.L12.gz43_260161 


F 


M00055274D:A11 


UC2-ColonMetLiver 


4099 


560621 


2053.M12.gz43_260162 


F 


M00055275D:E12 


UC2-ColonMetLiver 


4100 


450242 


2053.C13.gz43_260168 


F 


M00055263C:F10 


UC2-ColonMetLiver 


4101 


446557 


2053B14.gz43_260183 


F 


M00055262C:F05 


UC2-ColonMetLiver 


4102 


560678 


2053.E14.gz43_260186 


F 


M00055265C:E01 


UC2-ColonMetLiver 


4103 


557561 


2053F14.gz43_260187 


F 


M00055266D:C09 


UC2-ColonMetLiver 


4104 


505858 


2053.C15.gz43_260200 


F 


M00055263C:G09 


UC2-ColonMetLiver 


4105 


557561 


2053.E15.gz43_260202 


F 


M00055265C:F01 


UC2-ColonMetLiver 


4106 


448450 


2053.H15.gz43_260205 


F 


M00055268D:G09 


UC2-ColonMetLiver 


4107 


475578 


2053.115.gz43_260206 


F 


M00055270D:B05 


' UC2-ColonMetLiver 


4108 


549052 


2053.L15.gz43_260209 


F 


M00055274D:B10 


UC2-ColonMetLiver 


4109 


562871 


2053.M15.gz43_260210 


F 


M00055275D:H08 


UC2-ColonMetLiver 


4110 


558263 


2053.L18.gz43_260257 


F 


M00055274D:C10 


UC2-ColonMetLiver 


4111 


560538 


2053.O18.gz43_260260 


F 


M00055280A:C09 


UC2-ColonMetLiver 


4112 


554000 


2053.C19.gz43_260264 


F 


M00055263D:C03 


UC2-ColonMetLiver 


4113 


128962 


2053.A20.gz43_260278 


F 


M00055261D:D09 


UC2-ColonMetLiver 


4114 


560278 


2053.M20.gz43_260290 


F 


M00055276B:C09 


UC2-ColonMetLiver 


4115 


420892 


2053.F21.gz43_260299 


F 


M00055267A:D01 


UC2-ColonMetLiver 


4116 


558513 


2053.P21.gz43_260309 


F 


M00055281A:F08 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTEF 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4117 


511847 


2053.F22.gz43_260315 


F 


M00055267A:G1 1 


UC2-ColonMetLiver 


4118 


481958 


2053.P22.gz43_260325 


F 


M00055281B:D04 


UC2-ColonMetLiver 


4119 


561666 


2053.F24.gz43_260347 


F 


M00055267B:B06 


UC2-ColonMetLiver 


4120 


553665 


2053.124.gz43_260350 


F 


M00055271B:C01 


UC2-ColonMetLiver 


4121 


643513 


1 669. AO 1 .gz43_260687 


F 


M00055651C:F07 


UC2-NormColon 


4122 


641658 


1669.J01.gz43_260696 


F 


M00055663A:H10 


UC2-NormColon 


4123 


639752 


1669M01.gz43_260699 


F 


M00055667B:C08 


UC2-NormColon 


4124 


518569 


1669.P01.gz43_260702 


F 


M00055671A:H03 


UC2-NormColon 


4125 


642068 


1 669.N02 .gz43_2607 1 6 


F 


M00055668C:A04 


UC2-NonnColon 


4126 


639743 


1669.O02.gz43_260717 


F 


M00055669D:B08 


UC2-NormColon 


4127 


642230 


1669.P03.gz43_260734 


F 


M00055671B:B02 


UC2-NormColon 


4128 


447585 


1669.104.gz43_260743 


F 


M00055662B:F11 


UC2-NormColon 


4129 


640078 


1669J04.gz43_260744 


F 


M00055663B:E02 


UC2-NormColon 


4130 


647333 


1669B06.gz43_260768 


F 


M00055653B:C03 


UC2-NormColon 


4131 


472226 


1669.C06.gz43_260769 


F 


M00055654B:G09 


UC2-NormColon 


4132 


455778 


1669.H06.gz43_260774 


F 


M00055661B:E07 


UC2-NortnColon 


4133 


523875 


1669I07.gz43_260791 


F 


M00055662C:C11 


UC2-NormColon 


4134 


658271 


1669J07.gz43_260792 


F 


M00055663C:D05 


UC2-NormColon 


4135 


465104 


1669.K07.gz43_260793 


F 


M00055665B:B10 


UC2-NormColon 


4136 


648352 


1669.O07.gz43_260797 


F 


M00055670A:B04 


UC2-NormColon 


4137 


641469 


1669C08.gz43_260801 


F 


M00055654C:A05 


UC2-NormColon 


4138 


450883 


1669E08.gz43_260803 


F 


M00055657A:B04 


UC2-NormColon 


4139 


638971 


1669.C09.gz43_260817 


F 


M00055654C:C10 


UC2-NormColon 


4140 


641379 


1669I09.gz43_260823 


F 


M00055662C:D12 


UC2-NormColon 


4141 


618998 


1669.A10.gz43_260831 


F 


M00055652B:B11 


UC2-NormColon 


4142 


644971 


1669B10.gz43_260832 


F 


M00055653B:G01 


UC2-NormColon 


4143 


556488 


1669.C10.gz43_260833 


F 


M00055654C:D03 


UC2-NonnColon 


4144 


452687 


1669G10.gz43_260837 


F 


M00055660A:A06 


UC2-NormColon 


4145 


643909 


1669.J10.gz43_260840 


F 


M00055663D:B05 


UC2-NormColon 


4146 


646894 


1669.M10.gz43_260843 


F 


M00055667C:F07 


UC2-NormColon 


4147 


556750 


1669.N10.gz43_260844 


F 


M00055668D:E11 


UC2-NormColon 


4148 


640259 


1669.Bll.gz43_260848 


F 


M00055653B:G1 1 


UC2-NormColon 


4149 


643804 


1669.Dll.gz43_260850 


F 


M00055656A:E09 


UC2-NormColon 


4150 


97909 


1669.Fll.gz43_260852 


F 


M00055659A:A08 


UC2-NormColon 


4151 


503275 


1669.Gll.gz43_260853 


F 


M00055660A:C05 


UC2-NormColon 


4152 


640678 


1669.Lll.gz43_260858 


F 


M00055666B:E06 


UC2-NormColon 


4153 


406734 


1669.A12.gz43_260863 


F 


M00055652B:F12 


UC2-NormColon 


4154 


643563 


1669.G12.gz43_260869 


F 


M00055660A:C08 


UC2-NormColon 


4155 


658447 


1669.H12.gz43_260870 


F 


M00055661C:E11 


UC2-NormColon 


4156 


446188 


1669I12.gz43_260871 


F 


M00055662C:H06 


UC2-NormColon 


4157 


603857 


1669.L12.gz43_260874 


F 


M00055666C:C11 


UC2-NormColon 


4158 


561894 


1669M12.gz43_260875 


F 


M00055667D:B01 


UC2-NormColon 
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Table 2 



SEQ 
ID 
NO 


CLUSTEJ 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4159 


639596 


1669.I15.gz43 260919 


F 


M00055662D:A09 


UC2-NormColon 


4160 


589483 


1669.G17.gz43_260949 


F 


M00055660B:H02 


UC2-NormColon 


4161 


639665 


1669.117.gz43_260951 


F 


M00055662D:B07 


UC2-NormColon 


4162 


641683 


1669O17.gz43_260957 


F 


M00055670D:F02 


UC2-NonnColon 


4163 


642111 


1669C18.gz43_260961 


F 


M00055655A:A09 


UC2-NormColon 


4164 


640179 


1669.K18.gz43_260969 


F 


M00055665D:F11 


UC2-NormColon 


4165 


648352 


1669.N18.gz43_260972 


F 


M00055669B:G02 


UC2-NormColon 


4166 


640018 


1669I20.gz43_260999 


F 


M00055662D:E05 


UC2-NormColon 


4167 


642332 


1 669.L20.gz43_26 1 002 


F 


M00055667A:B12 


UC2-NormColon 


4168 


639706 


1669F21.gz43_261012 


F 


M00055659C:B10 


UC2-NonnColon 


4169 


642411 


1669.C22.gz43_261025 


F 


M00055655B:B08 


UC2-NormColon 


4170 


641091 


1669.122.gz43_261031 


F 


M00055663A:B02 


UC2-NormColon 


4171 


463312 


1669.B23.gz43_261040 


F 


M00055653D:H02 


UC2-NormColon 


4172 


456561 


1669L23.gz43_261050 


F 


M00055667A:H10 


UC2-NormColon 


4173 


649631 


1669A24.gz43_261055 


F 


M00055653A:G08 


UC2-NormColon 


4174 


639674 


1669.F24.gz43_261060 


F 


M00055659C:D06 


UC2-NormColon 


4175 


640261 


1669.124.gz43_261063 . 


F 


M00055663A:G04 


UC2-NormColon 


4176 


640889 


1670N03.gz43_261116 


F 


M00055691B:E07 


UC2-NormColon 


4177 


426366 


1670D05.gz43_261138 


F 


M00055676D:H07 


UC2-NormColon 


4178 


643594 


1670L05.gz43_261 146 


F 


M00055687C:F01 


UC2-NonnColon 


4179 


446675 


1670.M05.gz43_261147 


F 


M00055689B:F04 


UC2-NormCplon 


4180 


640645 


1670H06.gz43_261158 


F 


M00055683B:D10 


UC2-NormColon 


4181 


645909 


1670.J06.gz43_261160 


F 


M00055685B:E12 


UC2-NormCoIon 


4182 


551287 


1670K06.gz43_261161 


F 


M00055686B:D03 


UC2-NormColon 


4183 


646638 


1670N06.gz43_261164 


F 


M00055691C:E02 


UC2-NormColon 


4184 


639805 


1670.B07.gz43_261168 


F 


M00055673D:C01 


UC2-NormColon 


4185 


640092 


1670.A08.gz43_261183 


F 


M00055672D:F06 


UC2-NormColon 


4186 


634122 


1670.M08.gz43_261 195 


F 


M00055689C:B03 


UC2-NormColon 


4187 


641618 


1670G09.gz43_261205 


F 


M00055682C:D06 


UC2-NormColon 


4188 


641287 


1670.L09.gz43_261210 


F 


M00055688A:A02 


UC2-NormColon 


4189 


557625 


1670.H10.gz43_261222 


F 


M00055683C:A03 


UC2-NormColon 


4190 


599838 


1670I10.gz43_261223 


F 


M00055684B:D06 


UC2-NormColon 


4191 


634122 


1670.M10.gz43_261227 


F 


M00055689C:C03 


UC2-NonnColon 


4192 


641726 


1670.K12.gz43_261257 


F 


M00055686D:E04 


UC2-NormColon 


4193 


649921 


1670.L12.gz43_261258 


F 


M00055688A:E04 


UC2-NormColon 


4194 


645431 


1670.P12.gz43_261262 


F 


M00055695D:E01 


UC2-NormColon 


4195 


641728 


1670.K13.gz43_261273 


F 


M00055686D:E10 


UC2-NormColon 


4196 


502609 


1670N13.gz43_261276 


F 


M00055691D:E04 


UC2-NormColon 


4197 


650673 


1670C14.gz43_261281 


F 


M00055675D:E04 


UC2-NormColon 


4198 


644699 


1670H14.gz43_261286 


F 


M00055683C:D11 


UC2-NormColon 


4199 


645146 


1670.A16.gz43_261311 


F 


M00055673B:A04 


UC2-NormColon 


4200 


647427 


1670.E17.gz43_261331 


F 


M00055680B:H04 


UC2-NormColon 
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Table 2 



SEQ 
ID 
JNO 


CLUSTEI 


^ SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4201 


553921 


1670.H17.gz43 261334 


F 


M00055683C:H11 


UC2-NormColon 


4202 


639469 


1670.A18.gz43 261343 


F 


M00055673B:B08 


UC2-NormColon 


4203 


359500 


1670.F18.gz43 261348 


F 


M00055681D:F11 


UC2-NormCoIon 


4204 


649558 


1670.J18.gz43 261352 


F 


M00055685D:B06 


UC2-NormColon 


4205 


561920 


1670.N18.gz43 261356 


F 


M00055692A:E05 


UC2-NormColon 


4206 


446242 


1670.C20.gz43 261377 


F 


M00055676B.D05 


UC2-NonnColon 


4207 


645968 


1670.B21.gz43 261392 


F 


M00055674C:E05 


UC2-NormColon 


4208 


637966 


1670.F22.gz43_261412 


F 


M00055682A:B07 


UC2-NormColon 


4209 


640826 


1670.M22.gz43_261419 


F 


M00055691A:D08 


UC2-NormColon 


4210 


642321 


1670.D23.gz43_261426 


F 


M00055679B:E03 


UC2-NormColon 


4211 


601137 


1670.123.gz43_261431 


F 


M00055685A:E10 


UC2-NormColon 


4212 


416808 


1670.E24.gz43_261443 


F 


M00055681A:F02 


UC2-NormColon 


4213 


647856 


1670.124.gz43_261447 


F 


M00055685A:F02 


UC2-NormColon 


4214 


554564 


1670.K24.gz43_261449 


F 


M00055687C:B04 


UC2-NormColon 


4215 


561422 


1671.107.gz43_261559 


F 


M00055711B:B08 


UC2-NormColon 


4216 


642288 


167 l.P07.gz43_26 1566 


F 


M00055721B:D08 


UC2-NonnColon 


4217 


642062 


1671.E08.gz43 261571 


F 


M00055704A:D09 


UC2-NormColon 


4218 


463368 


1671.K08.gz43_261577 


F 


M00055715A:D10 


UC2-NormColon 


4219 


646028 


1671.D09.gz43_261586 


F 


M00055702C:D01 


UC2-NormColon 


4220 


478192 


1671M09.gz43_261595 


F 


M00055718A:F01 


UC2-NormColon 


4221 


640947 


1671.D10.gz43_261602 


F 


M00055702C:F11 


UC2-NormColon 


4222 


463548 


1671.G10.gz43 261605 


F 


M00055706B:H12 


UC2-NormColon 


4223 


640282 


1671.M10.gz43_261611 


F 


M00055718A.H05 


UC2-NormColon 


4224 


41141 


1671.Bll.gz43_261616 


F 


M00055699C:D02 


UC2-NormColon 


4225 


463368 


1671.11 l.gz43 261623 


F 


M00055711C:A07 


UC2-NormColon 


4226 


558086 


1671.J11 gz43_261624 


F 


M00055713C:F12 


UC2-NormColon 


4227 


560546 


1671.C12.gz43 261633 


F 


M00055701A:B04 


UC2-NormColon 


4228 


551415 


1671.E12.gz43 261635 


F 


M00055704B:C04 


UC2-NormColon 


4229 


557710 


1671N12.gz43_261644 


F 


M00055719A:D11 


UC2-NormColon 


4230 


649429 


1671.G14.gz43_261669 


F 


M00055707A:B07 


UC2-NormColon 


4231 


451401 


1671.L14.gz43 261674 


F 


M00055717B:A03 


UC2-NormColon 


4232 


642039 


1671.C17.gz43 261713 


F 


M00055701B:C03 


UC2-NormColon 


4233 


471268 


1671.F17.gz43 261716 


F 


M00055705C:F12 


UC2-NormColon 


4234 


417259 


1671.H17.gz43 261718 


F 


M00055709D:G10 


UC2-NormColon 


4235 


421826 


1671.L17.gz43_261722 


F 


M00055717B:E04 


UC2-NormColon 


4236 


465589 


1671.M17.gz43 261723 


F 


M00055718B:H11 


UC2-NormColon 


4237 


645505 


1671.N17.gz43 261724 


F 


M00055719A:G04 


UC2-NormColon 


4238 


510195 


1671.N18.gz43 261740 


F 


M00055719A:G12 


UC2-NormColon 


4239 


492094 


1671.P18.gz43 261742 


F 


M00055722A:B05 


UC2-NormColon 


4240 


453006 


1671.G20.gz43_261765 


F 


M00055707A:F11 


UC2-NormColon 


4241 


600115 


1671P21.gz43_261790 


F 


M00055722A:C04 


UC2-NormColon 


4242 


641700 


1671.H22.gz43 261798 


F 


M00055710B:E04 


UC2-NormColon 
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Table 2 



SEQ 
ID 
NO 


CLUSTEF 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4243 


466265 


1671.O22.gz43_261805 


F 


M00055721A:A07 


UC2-NormColon 


4244 


553985 


1671.B23.gz43_261808 


F 


M00055700B:E10 


UC2-NormColon 


4245 


536415 


1671.L23.gz43_261818 


F 


M00055717C:B07 


UC2-NormColon 


4246 


640282 


1671.N23.gz43_261820 


F 


M00055719C:A09 


UC2-NormColon 


4247 


498194 


1672.A01.gz43_261841 


F 


M00055722B:A01 


UC2-NormColon 


4248 


641305 


1672.E02.gz43_261861 


F 


M00055725A:F12 


UC2-NormColon 


4249 


641070 


1672.H02.gz43_261864 


F 


M00055727D:G01 


UC2-NormColon 


4250 


556326 


1672B03.gz43_261874 


F 


M00055723A:B08 


UC2-NormColon 


4251 


522497 


1672.C03.gz43_261875 


F 


M00055723D:C04 


UC2-NormColon 


4252 


557833 


1672E03.gz43_261877 


F 


M00055725A:G07 


UC2-NormColon 


4253 


647688 


1672.B04.gz43 261890 


F 


M00055723A:C04 


UC2-NormColon 


4254 


416377 


1672.M04.gz43_261901 


F 


M00055733D:H01 


UC2-NormColon 


4255 


643897 


1672.O04.gz43_261903 


F 


M00055736B:C08 


UC2-NormColon 


4256 


645171 


1672.A06.gz43_261921 


F 


M00055722B:G12 


UC2-NormColon 


4257 


649106 


1672.C06.gz43_261923 


F 


M00055723D:E05 


UC2-NormColon 


4258 


454383 


1672.K06.gz43_261931 


F 


M00055731C:F09 


UC2-NormColon 


4259 


649360 


1672B07.gz43_261938 


F 


M00055723A:F04 


UC2-NormColon 


4260 


639458 


1672.K08.gz43__261963 


F 


M00055731C:H07 


UC2-NormColon 


4261 


642078 


1672.B 1 1 .gz43_262002 


F 


M00055723B:A09 


UC2-NormColon 


4262 


642631 


1672.Lll.gz43_262012 


F 


M00055733A:G04 


UC2-NormColon 


4263 


650303 


1672.A12.gz43_262017 


F 


M00055722C:E11 


UC2-NormColon 


4264 


284586 


1672.C12.gz43_262019 


F 


M00055724A:C12 


UC2-NormCoIon 


4265 


642535 


1672.G12.gz43_262023 


F 


M00055727B:F10 


UC2-NormColon 


4266 


642491 


1672.A13.gz43_262033 


F 


M00055722C:F11 


UC2-NormColon 


4267 


100821 


1672P13.gz43_262048 


F 


M00055738B:H11 


UC2-NormColon 


4268 


641056 


1672.A14.gz43_262049 


F 


M00055722D:A07 


UC2-NormColon 


4269 


642197 


1672.B14.gz43_262050 


F 


M00055723B:C03 


UC2-NoimColon 


4270 


801 


1672.E14.gz43_262053 


F 


M00055725D:B02 


UC2-NormColon 


4271 


449956 


1672.N14.gz43_262062 


F 


M00055735C:G08 


UC2-NormColon 


4272 


645018 


1672.C15.gz43_262067 


F 


M00055724A:E03 


UC2-NormColon 


4273 


640879 


1672.K15.gz43_262075 


F 


M00055731D:F09 


UC2-NormColon 


4274 


642263 


1672.O15.gz43_262079 


F 


M00055736D:D11 


UC2-NormColon 


4275 


647906 


1672.P15.gz43_262080 


F 


M00055738C:C02 


UC2-NormColon 


4276 


559776 


1672.G16.gz43_262087 


F 


M00055727C:B02 


UC2-NormColon 


4277 


647060 


1672.H16.gz43_262088 


F 


M00055728B:G11 


UC2-NormColon 


4278 


642315 


1 672.1 16.gz43_262089 


F 


M00055729C:D11 


UC2-NormColon 


4279 


504501 


1 672 .L 1 6.gz43_262092 


F 


M00055733B:F11 


UC2-NormColon 


4280 


460967 


1672.A17.gz43_262097 


F 


M00055722D:B10 


UC2-NormColon 


4281 


504568 


1 672.C 1 7.gz43_262099 


F 


M00055724A:G08 


UC2-NormColon 


4282 


559828 


1672.D17.gz43_262100 


F 


M00055724D:A02 


UC2-NormColon 


4283 


455113 


1672.H17.gz43_262104 


F 


M00055728C:B08 


UC2-NormColon 


4284 


477064 


1672.117.gz43_262105 


F 


M00055729C:E02 


UC2-NormColon 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4285 


1558 


1672.J17.gz43_262106 


F 


M00055731A.G05 


UC2-NormColon 


4286 


419443 


1672L17.gz43_262108 


F 


M00055733B:H05 


UC2-NormColon 


4287 


546705 


1672.118.gz43_262121 


F 


M00055729D:A06 


UC2-NormColon 


4288 


235397 


1672.J18.gz43_262122 


F 


M00055731A:H07 


UC2-NormColon 


4289 


651085 


1672.K18.gz43_262123 


F 


M00055732A:B04 


UC2-NormColon 


4290 


502826 


1672.N18.gz43_262126 


F 


M00055735D:C01 


UC2-NormColon 


4291 


544797 


1672.A19.gz43_262129 


F 


M00055722D:E05 


UC2-NormColon 


4292 


489426 


1672B19.gz43_262130 


F 


M00055723C:B02 


UC2-NonnColon 


4293 


452212 


1672.E19.gz43_262133 


F 


M00055725D:F05 


UC2-NormColon 


4294 


646609 


1672.M19.gz43_262141 


F 


M00055734D:D01 


UC2-NormColon 


4295 


556336 


1672N19.gz43_262142 


F 


M00055735D:E10 


UC2-NormColon 


4296 


467822 


1672.C20.gz43_262147 


F 


M00055724B:D04 


UC2-NormColon 


4297 


650397 


1672L20.gz43_262156 


F 


M00055733C:H12 


UC2-NormColon 


4298 


37862 


1672.P20.gz43_262160 


F 


M00055738D:G08 


UC2-NonnColon 


4299 


644609 


1672I21.gz43_262169 


F 


M00055729D:F05 


UC2-NormColon 


4300 


561968 


1672.L21.gz43_262172 


F 


M00055733D:B11 


UC2-NormColon 


4301 


550049 


1672.P21.gz43_262176 


F 


M00055739A:B02 


UC2-NormColon 


4302 


554708 


1672.H22.gz43_262184 


F 


M00055728D:F02 


UC2-NormColon 


4303 


608540 


1672.P22.gz43_262192 


F 


M00055739A:B04 


UC2-NormColon 


4304 


470769 


1672A23.gz43_262193 


F 


M00055722D:G12 


UC2-NormColon 


4305 


642722 


1672.C23.gz43_262195 


F 


M00055724B:G03 


UC2-NormColon 


4306 


645151 


1672.L23.gz43_262204 


F 


M00055733D:D04 


UC2-NormColon 


4307 


648265 


1672.N23.gz43_262206 


F 


M00055736A:B11 


UC2-NonnColon 


4308 


489249 


1672.F24.gz43_262214 


F 


M00055726D:H12 


UC2-NormColon 


4309 


644173 


1682.H02.gz43_262248 


F 


M00055770A:G08 


UC2-NormColon 


4310 


642417 


1682L02.gz43_262252 


F 


M00055774D:A05 


UC2-NormColon 


4311 


553877 


1682.M02.gz43_262253 


F 


M00055775C:B10 


UC2-NormColon 


4312 


530238 


1682.H03.gz43_262264 


F 


M00055770A:H11 


UC2-NormColon 


4313 


553380 


1682.L03.gz43_262268 


F 


M00055774D:B07 


UC2-NormColon 


4314 


637387 


1682.E04.gz43_262277 


F 


M00055766B:E11 


UC2-NormColon 


4315 


634660 


1682.F04.gz43_262278 


F 


M00055767A:D10 


UC2-NormColon 


4316 


642791 


1682J04.gz43_262282 


F 


M00055772A:H08 


UC2-NormColon 


4317 


446163 


1682.K04.gz43_262283 


F 


M00055773C:C09 


UC2-NonnColon 


4318 


641056 


1682M04.gz43_262285 


F 


M00055775C:E10 


UC2-NormColon 


4319 


550562 


1682.B05.gz43_262290 


F 


M00055763B:D07 


UC2-NormColon 


4320 


648320 


1682.A06.gz43_262305 


F 


M00055761D:C03 


UC2-NormColon 


4321 


446757 


1682H06.gz43_262312 


F 


M00055770B:D06 


UC2-NormColon 


4322 


454540 


1682.K06.gz43_262315 


F 


M00055773C:H12 


UC2-NormColon 


4323 


458979 


1682.L06.gz43_262316 


F 


M00055774D:E02 


UC2-NormColon 


4324 


641174 


1682.P06.gz43_262320 


F 


M00055780C:E02 


UC2-NormColon 


4325 


642852 


1682E08.gz43_262341 


F 


M00055766B:H10 


UC2-NormColon 


4326 


488680 


1682A09.gz43_262353 


F 


M00055761D:H08 


UC2-NormColon 
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Table 2 



SEQ 
ID 
NO 


CLUSTEF 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4327 


27083 


1682.K09.gz43_262363 


F 


M00055773D:G11 


UC2-NormColon 


4328 


524470 


1682.C10.gz43 262371 


F 


M00055765A:A04 


UC2-NormColon 


4329 


463951 


1682.J10.gz43_262378 


F 


M00055772C:E11 


UC2-NormColon 


4330 


584179 


1682.Cll.gz43_262387 


F 


M00055765A:B02 


UC2-NormColon 


4331 


642460 


1682Dll.gz43_262388 


F 


M00055765D:E06 


UC2-NormColon 


4332 


448944 


1682.Ell.gz43 262389 


F 


M00055766C:C11 


UC2-NormColon 


4333 


642082 


1682.Fll.gz43_262390 


F 


M00055767B:D02 


UC2-NormColon 


4334 


642095 


1682.Nll.gz43_262398 


F 


M00055778D:A03 


UC2-NormColon 


4335 


562876 


1682.H12.gz43 262408 


F 


M00055770C:H11 


UC2-NormColon 


4336 


453079 


1682.112.gz43_262409 


F 


M00055771C:A11 


UC2-NormColon 


4337 


101499 


1682.L12.gz43_262412 


F 


M00055775A:G09 


UC2-NormColon 


4338 


639829 


1682.N12.gz43_262414 


F 


M00055778D:C03 


UC2-NormColon 


4339 


642332 


1682.P12.gz43_262416 


F 


M00055780D:D09 


UC2-NormColon 


4340 


541499 


1682.G13.gz43_262423 


F 


M00055768B:H12 


UC2-NormColon 


4341 


509410 


1682.11 3. gz43_262425 


F 


M00055771C:D09 


UC2-NormColon 


4342 


89082 


1682.013.gz43_262431 


F 


M00055780A:C04 


UC2-NormColon 


4343 


453091 


1682.P13.gz43_262432 


F 


M00055780D:F08 


UC2-NormColon 


4344 


639981 


1682.A14.gz43_262433 


F 


M00055762B:B11 


UC2-NormColon 


4345 


642564 


1682.B14.gz43_262434 


F 


M00055763C:G08 


UC2-NormColon 


4346 


561558 


1682.014.gz43_262447 


F 


M00055780A:C06 


UC2-NormColon 


4347 


562870 


1682.B15.gz43_262450 


F 


M00055763D:B11 


UC2-NortnColon 


4348 


462742 


1682.D15.gz43_262452 


F 


M00055766A:A01 


UC2-NormColon 


4349 


451037 


1682.E15.gz43_262453 


F 


M00055766C:G12 


UC2-NormColon 


4350 


641563 


1682.J15.gz43_262458 


F 


M00055772D:C10 


UC2-NormColon 


4351 


643940 


1682.A16.gz43_262465 


F 


M00055762C:B04 


UC2-NormColon 


4352 


642204 


1682.C16.gz43_262467 


F 


M00055765A:C11 


UC2-NormColon 


4353 


517912 


1682.J16.gz43_262474 


F 


M00055772D:D03 


UC2-NormColon 


4354 


641576 


1682.D17.gz43_262484 


F 


M00055766A:D08 


UC2-NormColon 


4355 


642082 


1682.F17.gz43_262486 


F 


M00055767D:A12 


UC2-NormColon 


4356 


450211 


1682.017.gz43_262495 


F 


M00055780A:E11 


UC2-NormColon 


4357 


621636 


1682.B18.gz43_262498 


F 


M00055764A:F02 


UC2-NormColon 


4358 


639232 


1682.D18.gz43_262500 


F 


M00055766A:E08 


UC2-NormColon 


4359 


450553 


1682.J18.gz43_262506 


F 


M00055772D:F10 


UC2-NormColon 


4360 


647940 


1682.K18.gz43_262507 


F 


M00055774B:F07 


UC2-NormColon 


4361 


461835 


1682.M18.gz43_262509 


F 


M00055778A:F09 


UC2-NormColon 


4362 


641496 


1682.018.gz43_262511 


F 


M00055780A:F07 


UC2-NormColon 


4363 


464510 


1682.H19.gz43_262520 


F 


M00055770D:E10 


UC2-NormColon 


4364 


642564 


1682.M19.gz43_262525 


F 


M00055778A:F12 


UC2-NormColon 


4365 


629002 


1682.C20.gz43_262531 


F 


M00055765A:G02 


UC2-NormColon 


4366 


1093 


1682.D20.gz43_262532 


F 


M00055766A:H03 


UC2-NormColon 


4367 


50351 


1682.E20.gz43_262533 


F 


M00055766D:D05 


UC2-NormColon 


4368 


642417 


1682.F20.gz43_262534 


F 


M00055767D:E07 


UC2-NormColon 
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Table 2 



CCA 

lU 
VfA 


CLUSTE1 


R SEQ NAME 


ORII 
NT 


CLONE ID 


LIBRARY 


4369 


466971 


1682.J20.gz43 262538 


F 


M00055772D:H04 


UC2-NormColon 


4370 


641185 


1682.K20.gz43 262539 


F 


M00055774B:H01 


UC2-NormColon 


4371 


644240 


1682.L20.gz43 262540 


F 


M00055775B:G02 


UC2-NormColon 


4372 


509027 


1682.M20.gz43 262541 


F 


M00055778A:G02 


UC2-NormColon 


4373 


546740 


1682.F21.gz43 262550 


F 


M00055767D:F04 


UC2-NormColon 


4374 


559662 


1682.L21.gz43 262556 


F 


M00055775B:G04 


UC2-NormColon 


4375 


388688 


1682.B22.gz43 262562 


F 


M00055764B:F05 


UC2-NormColon 


4376 


555140 


1682.G22.gz43 262567 


F 


M00055768D:G02 


UC2-NormColon 


4377 


640617 


1682.K22.gz43 262571 


F 


M00055774C:D09 


UC2-NormColon 


4378 


447224 


1682.D23.gz43 262580 


F 


M00055766B:B01 


UC2-NormColon 


4379 


644765 


1682.B24.gz43 262594 


F 


M00055764C:C07 


UC2-NormColon 


4380 


648996 


1682.F24.gz43 262598 


F 


M00055768A:B05 


UC2-NormColon 


4381 


642036 


1682.I24.gz43 262601 


F 


M00055772A:C10 


UC2-NormColon 


4382 


644914 


1682.J24.gz43 262602 


F 


M00055773B:A07 


UC2-NormColon 


4383 


458425 


1682.M24.gz43 262605 


F 


M00055778B:E01 


UC2-NormColon 


4384 


556476 


1682.N24.gz43 262606 


F 


M00055779B.F06 


UC2-NormColon 


4385 


672601 


1683.E01.gz43 262613 


F 


M00055785D.C06 


UC2-NormColon 


4386 


463290 


1683 .HOI. gz43 262616 


F 


M00055789C:F10 


UC2-NormColon 


4387 


675768 


1683.D03.gz43 262644 


F 


M00055784C:H02 


UC2-NormColon 


4388 


642146 


1683.H03.gz43 262648 


F 


M00055789D:B04 


UC2-NormColon 


4389 


648905 


1683.F04.gz43 262662 


F 


M00055787B:F10 


UC2-NormColon 


4390 


467306 


1683.I04.gz43 262665 


F 


M00055790D.G10 


UC2-NormCoIon 


4391 


497971 


1683.C05.gz43 262675 


F 


M00055783D:B02 


UC2-NormColon 


4392 


645508 


1683.N05.gz43 262686 


F 


M00055795D:E09 


UC2-NormColon 


4393 


644047 


1 683.005 .gz43 262687 


F 


M00055796B:G05 


UC2-NormColon 


4394 


554117 


1683.K06.gz43 262699 


F 


M00055793A:H09 


UC2-NormColon 


4395 


641919 


1683.K08.gz43 262731 


F 


M00055793B.B06 


UC2-NormColon 


4396 


605761 


1683.L08.gz43 262732 


F 


M00055794A:D08 


UC2-NormColon 


4397 


549434 


1683.D09.gz43 262740 


F 


M00055785A:H08 


UC2-NormColon 


4398 


446595 


1683.M09.gz43 262749 


F 


M00055795A:F09 


UC2-NormColon 


4399 


643350 


1683.F10.gz43 262758 


F 


M00055787C.D09 


UC2-NormColon 


4400 


454485 


1683.M10.gz43 262765 


F 


M00055795A:F12 


UC2-NormColon 


4401 


647639 


1 683.0 10.gz43 262767 


F 


M00055796C:E1 1 


UC2-NormColon 


4402 


510545 


1683.11 l.gz43 262777 


F 


M00055791B:E02 


UC2-NonnColon 


4403 


77144 


1683.Nll.gz43 262782 


F 


M00055795D:H08 


UC2-NormColon 


4404 


140909 


1683.C12.gz43_262787 


F 


M00055784A:B07 


UC2-NormColon 


4405 


650076 


1683.D12.gz43 262788 


F 


M00055785B:B06 


UC2-NormColon 


4406 


449836 


1683.F12.gz43 262790 


F 


M00055787C:E12 


UC2-NormColon 


4407 


642184 


1683.B14.gz43 262818 


F 


M00055783A:C06 


UC2-NormColon 


4408 


638962 


1683.H14.gz43 262824 


F 


M00055790B:A08 


UC2-NormColon 


4409 


641700 


1683.M15.gz43 262845 


F 


M00055795B:F09 


UC2-NormColon 


4410 


437580 


1683.N15.gz43 262846 


F 


M00055796A:A08 


UC2-NormColon 
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ir\ 
IU 


CLUSTE] 


R SEQ NAME 


ORII 
NT 


CLONE ID 


LIBRARY 


4411 


456626 


1683.C16.gz43 262851 


F 


M00055784A:D05 


UC2-NormColon 


4412 


553979 


1683.016.gz43 262863 


F 


M00055796D:E06 


UC2-NormColon 


4413 


640525 


1683.017.gz43 262879 


F 


M00055796D:E10 


UC2-NormColon 


4414 


553546 


1683.H18.gz43 262888 


F 


M00055790B:D05 


UC2-NormColon 


4415 


643230 


1683.A19.gz43 262897 


F 


M00055782B:C08 


UC2-NormColon 


4416 


523590 


1683.B20.gz43 262914 


F 


M00055783B.A04 


UC2-NormColon 


4417 


388085 


1683.M20.gz43 262925 


F 


M00055795C:B10 


UC2-NormColon 


4418 


457396 


1683.L21.gz43 262940 


F 


M00055794C.D10 


UC2-NormColon 


4419 


417617 


1683.023.gz43 262975 


F 


M00055797B:A11 


UC2-NormColon 


4420 


463821 


1684.G01.gz43 262999 


F 


M00055804D:F02 


UC2-NormColon 


4421 


448787 


1684.M01.gz43 263005 


F 


M00055811C:A01 


UC2-NormColon 


4422 


32812 


1684.I03.gz43 263033 


F 


M00055807A:B10 


UC2-NormColon 


4423 


418340 


1684.A04.gz43 263041 


F 


M00055798B:D12 


UC2-NormColon 


4424 


462249 


1684.C04.gz43 263043 


F 


M00055800D:B03 


UC2-NonnColon 


4425 


642099 


1684.P04.gz43 263056 


F 


M00055815C:B03 


UC2-NormColon 


4426 


644915 


1684.A05.gz43 263057 


F 


M00055798B:F02 


UC2-NormColon 


4427 


640222 


1684.G05.gz43 263063 


F 


M00055805A:A02 


UC2-NormColon 


4428 


550370 


1684.K05.gz43 263067 


F 


M00055809A:B10 


UC2-NormColon 


4429 


645560 


1684.A06.gz43 263073 


F 


M00055798B:G04 


UC2-NormColon 


4430 


513238 


1684.E06.gz43 263077 


F 


M00055803A:C06 


UC2-NormColon 


4431 


650231 


1684.M07.gz43 263101 


F 


M00055811D:C02 


UC2-NormColon 


4432 


644314 


1684.H08.gz43 263112 


F 


M00055806B:B10 


UC2-NormColon 


4433 


650782 


1684.J08.gz43_263114 


F 


M00055808B:A04 


UC2-NormColon 


4434 


648221 


1684.P08.gz43 263120 


F 


M00055815C:G05 


UC2-NormColon 


4435 


554742 


1684.M09.gz43 263133 


F 


M00055811D:C12 


UC2-NormColon 


4436 


642962 


1684.P09.gz43 263136 


F 


M00055815D:A11 


UC2-NormColon 


4437 


455028 


1684.A10.gz43 263137 


F 


M00055798D:A10 


UC2-NormColon 


4438 


447015 


1684.P10.gz43 263152 


F 


M00055815D:B02 


UC2-NormColon 


4439 


642653 


1684.Ell.gz43 263157 


F 


M00055803A:G08 


UC2-NormColon 


4440 


647336 


1684.01 l.gz43 263167 


F 


M00055814C:C07 


UC2-NormColon 


4441 


549640 


1684.Pll.gz43 263168 


F 


M00055815D:B03 


UC2-NormColon 


4442 


642198 


1684.G12.gz43 263175 


F 


M00055805B:C08 


UC2-NormColon 


4443 


520320 


1684.I12.gz43 263177 


F 


M00055807B:F05 


UC2-NormColon 


4444 


644147 


1684.N12.gz43 263182 


F 


M00055813A:D10 


UC2-NormColon 


4445 


530774 


1684.E13.gz43 263189 


F 


M00055803B:A11 


UC2-NormColon 


4446 


375814 


1684.H14.gz43 263208 


F 


M00055806C:E09 


UC2-NormColon 


AAAH 


44o /ay 


1684.I14.gz43 263209 


F 


M00055807B:G10 


UC2-NormColon 


4448 


210839 


1684.J14.gz43 263210 


F 


M00055808B:H07 


UC2-NormColon 


4449 


490414 


1684.B15.gz43_263218 


F 


M00055800A:F02 


UC2-NormColon 


4450 


503491 


1684.M15.gz43 263229 


F 


M00055812A:E01 


UC2-NormColon 


4451 


486134 


1684.D17.gz43 263252 


F 


M00055802B:H03 


UC2-NormColon 


4452 


639441 


1684.I17.gz43 263257 


F 


M00055807C:F05 | 


UC2-NormColon 
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Table 2 



sec; 

ID 

INU 


CLUSTE1 


R SEQ NAME 


ORII 
NT 


CLONE ID 


LIBRARY 


4453 


551683 


1684.N17.gz43 263262 


F 


M00055813C:E03 


UC2-NormColon 


4454 


649035 


1684.P17.gz43 263264 


F 


M00055816A:C04 


UC2-NormColon 


4455 


639194 


1684.F18.gz43 263270 


F 


M00055804B:C02 


UC2-NormColon 


4456 


454772 


1684.K19.gz43 263291 


F 


M00055809C:E11 


UC2-NormColon 


4457 


649995 


1684.L19.gz43 263292 


F 


M00055810D:G04 


UC2-NonnColon 


4458 


466016 


1684.M19.gz43 263293 


F 


M00055812B:F03 


UC2-NormColon 


4459 


494271 


1684.N19.gz43 263294 


F 


M00055813D:B12 


UC2-NormColon 


4460 


460517 


1684.J20.gz43 263306 


F 


M00055808D:C12 


UC2-NormColon 


4461 


470602 


1 684.12 l.gz43 263321 


F 


M00055807D:C04 


UC2-NormColon 


4462 


640537 


1684.B22.gz43 263330 


F 


M00055800C:E11 


UC2-NormColon 


4463 


643991 


1684.D22.gz43 263332 


F 


M00055802C:E12 


UC2-NormColon 


4464 


564382 


1684.C23.gz43 263347 


F 


M00055801D:E06 


UC2-NormColon 


4465 


529325 


1684.G23.gz43 263351 


F 


M00055805D:H01 


UC2-NormColon 


4466 


455665 


1684.L23.gz43 263356 


F 


M00055811B:A03 


UC2-NormColon 


4467 


645700 


1684.A24.gz43 263361 


F 


M00055799C:D09 


UC2-NormColon 


4468 


520320 


1684.J24.gz43 263370 


F 


M00055808D:F09 


UC2-NormColon 


4469 


648752 


1684.M24.gz43 263373 


F 


M00055812C:B07 


UC2-NormColon 


4470 


642850 


1684.024.gz43 263375 


F 


M00055815A:H12 


UC2-NormColon 


4471 


556896 


1970.H01.gz43 263397 


F 


M00054702A:C06 


UC2-ColonMetLiver 


4472 


549238 


1 970.10 l.gz43 263398 


F 


M00054703D:E07 


UC2-ColonMetLiver 


4473 


551941 


1970.M01.gz43 263402 


F 


M00054710D:A02 


UC2-ColonMetLiver 


4474 


552055 


1970.G02.gz43 263412 


F 


M00054699D:A12 


UC2-ColonMetLiver 


4475 


483141 


1970.H02.gz43 263413 


F 


M00054702A:F03 


UC2-ColonMetLiver 


4476 


561434 


1970.102.gz43__263414 


F 


M00054703D:F04 


UC2-ColonMetLiver 


4477 


453708 


1970.K02.gz43 263416 


F 


M00054707B:B08 


UC2-ColonMetLiver 


4478 


557935 


1970.A03.gz43_263422 


F 


M00054689C:B11 


UC2-ColonMetLiver 


4479 


509202 


1970.H03.gz43_263429 


F 


M00054702A:H03 


UC2-ColonMetLiver 


4480 


549388 


1970.B04.gz43 263439 


F 


M00054690D:G03 


UC2-ColonMetLiver 


4481 


551243 


1970.C05.gz43 263456 


F 


M00054693A:C09 


UC2-ColonMetLiver 


4482 


549557 


1970.N05.gz43 263467 


F 


M00054712C:C07 


UC2-ColonMetLiver 


4483 


551527 


1970.A06.gz43_263470 


F 


M00054689D:E12 


UC2-ColonMetLiver 


4484 


553128 


1970E06.gz43_263474 


F 


M00054696B:H11 


UC2-ColonMetLiver 


4485 


549151 


1970.K06.gz43 263480 


F 


M00054707C:D02 


UC2-ColonMetLiver 


4486 


32021 


1970.O06.gz43_263484 


F 


M00054714B:F05 


UC2-ColonMetLiver 


4487 


552598 


1970.P07.gz43_263501 


F 


M00054715D:E11 


UC2-ColonMetLiver 


4488 


552051 


1970.D08.gz43 263505 


F 


M00054695A:A12 


UC2-ColonMetLiver 


A A oc\ 

4489 


491827 


1970.J09.gz43_263527 


F 


M00054706A:G10 


UC2-ColonMetLiver 


4490 


552625 


1970.O09.gz43 263532 


F 


M00054714C:E01 


UC2-ColonMetLiver 


4491 


552073 


1970.Mll.gz43 263562 


F 


M00054711C:A08 


UC2-ColonMetLiver 


4492 


552649 


1970.E12.gz43 263570 


F 


M00054697A:E03 


UC2-ColonMetLiver 


4493 


389201 


1970.H12.gz43 263573 


F 


M00054702D:E07 


UC2-ColonMetLiver 


4494 


552682 


1970.112.gz43_263574 


F 


M00054704D:F02 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4495 


552686 


1970.K12.gz43_263576 


F 


M00054708A:F11 


UC2-ColonMetLiver 


4496 


559389 


1970.A14.gz43_263598 


F 


M00054690B:B02 


UC2-ColonMetLiver 


4497 


562216 


1970.E14.gz43_263602 


F 


M00054697A:G06 


UC2-ColonMetLiver 


4498 


452392 


1970H14.gz43_263605 


F 


M00054702D:F05 


UC2-ColonMetLiver 


4499 


552314 


1970.114.gz43_263606 


F 


M00054705B:C08 


UC2-ColonMetLiver 


4500 


584499 


1970.F15.gz43_263619 


F 


M00054699A:G01 


UC2-ColonMetLiver 


4501 


552418 


1970.A16.gz43_263630 


F 


M00054690B:D10 


UC2-ColonMetLiver 


4502 


551996 


1970.F16.gz43_263635 


F 


M00054699B:A05 


UC2-ColonMetLiver 


4503 


549945 


1970.H16.gz43_263637 


F 


M00054702D:H10 


UC2-ColonMetLiver 


4504 


527679 


1970.1 16.gz43_263638 


F 


M00054705B:D02 


UC2-ColonMetLiver 


4505 


391511 


1970.K16.gz43_263640 


F 


M00054708B:F04 


UC2-ColonMetLiver 


4506 


490393 


1970.P16.gz43_263645 


F 


M00054716B:D06 


UC2-ColonMetLiver 


4507 


552197 


1970.L17.gz43_263657 


F 


M00054710B:B10 


UC2-ColonMetLiver 


4508 


550975 


1970.O17.gz43_263660 


F 


M00054715A:G02 


UC2-ColonMetLiver 


4509 


498454 


1970.118.gz43_263670 


F 


M00054705B:E04 


UC2-ColonMetLiver 


4510 


553938 


1970.N18.gz43_263675 


F 


M00054713C:D07 


UC2-ColonMetLiver 


4511 


448332 


1970.C19.gz43_263680 


F 


M00054693D:A08 


UC2-ColonMetLiver 


4512 


550053 


1970.J19.gz43_263687 


F 


M00054706C:B12 


UC2-ColonMetLiver 


4513 


550330 


1970.B20.gz43_263695 


F 


M00054692B:C06 


UC2-ColonMetLiver 


4514 


450755 


1970.C20.gz43_263696 


F 


M00054693D:C04 


UC2-ColonMetLiver 


4515 


420686 


1970.B21.gz43_263711 


F 


M00054692B:D01 


UC2-ColonMetLiver 


4516 


552581 


1970.E21.gz43_263714 


F 


M00054697C:E11 


UC2-ColonMetLiver 


4517 


551995 


1970.F21.gz43_263715 


F 


M00054699C:A04 


UC2-ColonMetLiver 


4518 


521552 


1970.H22.gz43_263733 


F 


M00054703C:F01 


UC2-ColonMetLiver 


4519 


553358 


1970.J22.gz43_263735 


F 


M00054706D:B03 


UC2-ColonMetLiver 


4520 


558768 


1970.L22.gz43_263737 


F 


M00054710C:A12 


UC2-ColonMetLiver 


4521 


42994 


1970.B23.gz43_263743 


F 


M00054692C:B02 


UC2-ColonMetLiver 


4522 


555255 


1970.G23.gz43_263748 


F 


M00054701D:H05 


UC2-ColonMetLiver 


4523 


451032 


1970.O23.gz43_263756 


F 


M00054715C:D05 


UC2-ColonMetLiver 


4524 


554477 


1970.124.gz43_263766 


F 


M00054705C:D11 


UC2-ColonMetLiver 


4525 


560317 


1970.K24.gz43_263768 


F 


M00054709A:A10 


UC2-ColonMetLiver 


4526 


554900 


1993.J01.gz43_263783 


F 


M00054863D:H05 


UC2-ColonMetLiver 


4527 


560254 


1993.L01.gz43_263785 


F 


M00054866B:C01 


UC2-ColonMetLiver 


4528 


555740 


1993.E02.gz43_263794 


F 


M00054857C:G09 


UC2-ColonMetLiver 


4529 


551212 


1993.F02.gz43_263795 


F 


M00054858D:C04 


UC2-ColonMetLiver 


4530 


554456 


1993.G03.gz43_263812 


F 


M00054859D:E09 


UC2-ColonMetLiver 


4531 


493303 


1993.M03.gz43_263818 


F 


M00054867B:D02 


UC2-ColonMetLiver 


4532 


522648 


1993.P03.gz43_263821 


F 


M00054870C:C05 


UC2-ColonMetLiver 


4533 


559885 


1993.A04.gz43_263822 


F 


M00054852C:G03 


UC2-ColonMetLiver 


4534 


554257 


1993.H04.gz43_263829 


F 


M00054860D:D01 


UC2-ColonMetLiver 


4535 


554524 


1993.B05.gz43_263839 


F 


M00054854A:H06 


UC2-ColonMetLiver 


4536 


554672 


1993.C05.gz43_263840 


F 


M00054855B:E04 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTEI 


* SEQ NAME 


ORIE 

NT 


CLONE ID 


LIBRARY 


4537 


560714 


1993.D05.gz43 263841 


F 


M00054856D.E11 


UC2-ColonMetLiver 


4538 


557852 


1993.G05.gz43 263844 


F 


M00054859D.G04 


UC2-ColonMetLiver 


4539 


553295 


1993.J05.gz43 263847 


F 


M00054864B:B02 


UC2-ColonMetLiver 


4540 


559027 


1993.N06.gz43 263867 


F 


M00054868C:G11 


UC2-ColonMetLiver 


4541 


548861 


1993.B07.gz43 263871 


F 


M00054854B:C04 


UC2-ColonMetLiver 


4542 


554500 


1993.E07.gz43_263874 


F 


M00054857D:E12 


UC2-ColonMetLiver 


4543 


522507 


1993.I07.gz43 263878 


F 


M00054862C:D05 


UC2-ColonMetLiver 


4544 


501534 


1993.J08.gz43 263895 


F 


M00054864B:D08 


UC2-ColonMetLiver 


4545 


556715 


1993.A09.gz43 263902 


F 


M00054852D:D09 


UC2-ColonMetLiver 


4546 


496897 


1993.C09.gz43 263904 


F 


M00054855C:B06 


UC2-ColonMetLiver 


4547 


554084 


1993.D09.gz43_263905 


F 


M00054857A:B11 


UC2-ColonMetLiver 


4548 


554497 


1993.E10.gz43_263922 


F 


M00054858A:B05 


UC2-ColonMetLiver 


4549 


555968 


1993.F10.gz43 263923 


F 


M00054859A:D10 


UC2-ColonMetLiver 


4550 


43642 


1993.K10.gz43_263928 


F 


M00054865B:H04 


UC2-ColonMetLiver 


4551 


595506 


1993.L10.gz43 263929 


F 


M00054866C:G07 


UC2-ColonMetLiver 


4552 


594434 


1993.O10.gz43 263932 


F 


M00054869D.F02 


UC2-ColonMetLiver 


4553 


555702 


1993.P10.gz43_263933 


F 


M00054870D:G08 


UC2-ColonMetLiver 


4554 


556132 


1993.All.gz43 263934 


F 


M00054853A:A10 


UC2-ColonMetLiver 


4555 


555667 


1993.Fll.gz43_263939 


F 


M00054859A:E02 


UC2-ColonMetLiver 


4556 


519109 


1993Jll.gz43_263943 


F 


M00054864C:A01 


UC2-ColonMetLiver 


4557 


450829 


1993.B12.gz43_263951 


F 


M00054854C:H02 


UC2-ColonMetLiver 


4558 


463513 


1993.M13.gz43_263978 


F 


M00054867D:H03 


UC2-ColonMetLiver 


4559 


552396 


1993.N13.gz43 263979 


F 


M00054869A:A09 


UC2-ColonMetLiver 


4560 


557077 


1993.C14.gz43_263984 


F 


M00054855D:G12 


UC2-CoIonMetLiver 


4561 


554756 


1993.D14.gz43_263985 


F 


M00054857A:G09 


UC2-ColonMetLiver 


4562 


482306 


1993.E14.gz43_263986 


F 


M00054858A:D06 


UC2-ColonMetLiver 


4563 


560259 


1993.H14.gz43 263989 


F 


M00054861B:H08 


UC2-ColonMetLiver 


4564 


402742 


1993.L14.gz43 263993 


F 


M00054866D:C03 


UC2-ColonMetLiver 


4565 


390017 


1993.P14.gz43_263997 


F 


M00054871A:C04 


UC2-ColonMetLiver 


4566 


554989 


1993.D15.gz43_264001 


F 


M00054857A:G12 


UC2-ColonMetLiver 


4567 


550714 


1993.E15.gz43_264002 


F 


M00054858A.D08 


UC2-ColonMetLiver 


4568 


562962 


1993.I15.gz43 264006 


F 


M00054863A:A11 


UC2-ColonMetLiver 


4569 


394567 


1993.K16.gz43 264024 


F 


M00054865D:F05 


UC2-ColonMetLiver 


4570 


457825 


1993.N17.gz43_264043 


F 


M00054869A:D03 


UC2-ColonMetLiver 


4571 


553072 


1993.O17.gz43_264044 


F 


M00054870B:A12 


UC2-ColonMetLiver 


4572 


522703 


1 993 .A 1 8 gz43_264046 


F 


M00054853B:E07 


UC2-ColonMetLiver 


4573 


451491 


1993.E18.gz43 264050 


F 


M00054858B:A10 


UC2-ColonMetLiver 


4574 


551811 


1993.H18.gz43 264053 


F 


M00054861C:E10 


UC2-ColonMetLiver 


4575 


599433 


1993.A19.gz43_264062 


F 


M00054853B:E10 


UC2-ColonMetLiver 


4576 


593343 


1993.D19.gz43_264065 


F 


M00054857B:D08 


UC2-ColonMetLiver 


4577 


554613 


1993.C20.gz43_264080 


F 


M00054856A:F08 


UC2-ColonMetLiver 


4578 


497365 


1993.G21.gz43 264100 


F 


M00054860C:E04 


UC2-ColonMetLiver 
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Table 2 



ID 


CLUSTE] 


SEQ NAME 


ORII 
NT 


CLONE ID 


LIBRARY 


4579 


555553 


1993.021.gz43 264108 


F 


M00054870B:G12 


UC2-ColonMetLiver 


4580 


481641 


1993.P21.gz43 264109 


F 


M00054871A:H07 


UC2-ColonMetLiver 


4581 


557974 


1993.M23.gz43 264138 


F 


M00054868B:A05 


UC2-ColonMetLiver 


4582 


557719 


1993.F24.gz43 264147 


F 


M00054859D:D02 


UC2-ColonMetLiver 


4583 


551591 


1993.L24.gz43 264153 


F 


M00054867A:G01 


UC2-ColonMetLiver 


4584 


553586 


1993.P24.gz43 264157 


F 


M00054871B:E10 


UC2-ColonMetLiver 


4585 


553983 


2018.J01.gz43 264167 


F 


M00055049B:F05 


UC2-ColonMetLiver 


4586 


557939 


2018.N01.gz43 264171 


F 


M00055053D:G04 


UC2-ColonMetLiver 


4587 


549858 


2018.P01.gz43 264173 


F 


M00055056C:A05 


UC2-ColonMetLiver 


4588 


561838 


2018.F02.gz43 264179 


F 


M00055045D:F10 


UC2-ColonMetLiver 


4589 


561438 


2018.A03.gz43 264190 


F 


M00055039A:G06 


UC2-ColonMetLiver 


4590 


447555 


2018.N03.gz43 264203 


F 


M00055054A:A10 


UC2-ColonMetLiver 


4591 


556490 


2018.I04.gz43 264214 


F 


M00055048C:C06 


UC2-ColonMetLiver 


4592 


554372 


2018.L04.gz43 264217 


F 


M00055051C:F10 


UC2-CoIonMetLiver 


4593 


557504 


2018.P04.gz43 264221 


F 


M00055056C:C05 


UC2-ColonMetLiver 


4594 


555340 


2018.B05.gz43 264223 


F 


M00055041A:C02 


UC2-ColonMetLiver 


4595 


540000 


2018.L05.gz43 264233 


F 


M00055051D:D12 


UC2-ColonMetLiver 


4596 


528775 


2018.O05.gz43 264236 


F 


M00055055B:H11 


UC2-ColonMetLiver 


4597 


553012 


2018.P05.gz43 264237 


F 


M00055056C:D07 


UC2-ColonMetLiver 


4598 


386940 


2018.H06.gz43 264245 


F 


M00055047C:D11 


UC2-ColonMetLiver 


4599 


559775 


2018.K06.gz43 264248 


F 


M00055050D:C01 


UC2-ColonMetLiver 


4600 


493261 


2018.M06.gz43 264250 


F 


M00055052D:G12 


UC2-ColonMetLiver 


4601 


555660 


2018.A07.gz43 264254 


F 


M00055039B:E02 


UC2-ColonMetLiver 


4602 


561901 


2018.C07.gz43 264256 


F 


M00055042A:D09 


UC2-ColonMetLiver 


4603 


554936 


2018.H07.gz43 264261 


F 


M00055047C:F07 


UC2-ColonMetLiver 


4604 


555010 


2018.J07.gz43 264263 


F 


M00055049C:H12 


UC2-ColonMetLiver 


4605 


560801 


2018.L07.gz43 264265 


F 


M00055051D:F01 


UC2-ColonMetLiver 


4606 


555993 


2018.A08.gz43 264270 


F 


M00055039B:G11 


UC2-ColonMetLiver 


4607 


465207 


20 1 8 H08 .gz43_264277 


F 


M00055047C:F08 


UC2-ColonMetLiver 


4608 


419153 


2018.I08.gz43 264278 


F 


M00055048D:D08 


UC2-ColonMetLiver 


4609 


562808 


2018.P08.gz43 264285 


F 


M00055056C:F04 


UC2-ColonMetLiver 


4610 


601365 


2018.A09.gz43 264286 


F 


M00055039B:H10 


UC2-ColonMetLiver 


4611 


375380 


2018.F09.gz43 264291 


F 


M00055046B:C07 


UC2-ColonMetLiver 


4612 


561216 


2018.J09.gz43 264295 


F 


M00055049D:D09 


UC2-ColonMetLiver 


4613 


498629 


2018.O09.gz43 264300 


F 


M00055055C:C09 


UC2-ColonMetLiver 


4614 


555524 


2018.B10.gz43 264303 


F 


M00055041B:D11 


UC2-ColonMetLiver 


HOlJ 




zUl5.L10.gz43 264304 


F 


M00055042B:B09 


UC2-ColonMetLiver 


4616 


555771 


2018.E10.gz43 264306 


F 


M00055045A:F03 


UC2-ColonMetLiver 


4617 


515115 


2018.O10.gz43 264316 


F 


M00055055C:D02 


UC2-ColonMetLiver 


4618 


529733 


2018 All.gz43_264318 


F 


M00055039C:B05 


UC2-ColonMetLiver 


4619 


553850 


2018.Ell.gz43 264322 


F 


M00055045A:F12 


UC2-ColonMetLiver 


4620 


557735 


2018.Gll.gz43 264324 


F 


M00055047A:E07 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 

NO CLUSTER! 



SEQ NAME 



ORIEl 

NT 



CLONE ID 



LIBRARY 



4621 555798 



2018.Hll.gz43 264325 



F M00055047D:C12 | UC2-ColonMetI A™ 



4622 553108 



2018.Mll.gz43 264330 



F M00055053B:A02 | UC2-ColonM e M W~ 



4623 555708 



2018.C12.gz43 264336 



F M00055042B:E02 | UC2-ColonMetT .irer 



4624 559699 



2018.E12.gz43 264338 



F M00055045B:A04 | UC2-ColonM e fTW' 



4625 551068 



2018.K12.gz43 264344 



F M00055051A.C09I UC2-ColonMetLiver 



4626 559246 



2018.L12.gz43 264345 



F M00055052A:F07 | UC2-ColonMetLiver 



4627 557401 



2018.P12.gz43 264349 



F | M00055056D:B06 | UC2-ColonMetLiver 



4628 557525 



2018.M13.gz43 264362 



F | M00055053B:C02 | UC2-ColonMetLiver 



4629 8997 



2018.N13.gz43 264363 



F | M00055054B:E10 | UC2-ColonMetLiver 



4630 560720 



2018.E14.gz43 264370 



F | M00055045B:C08 | UC2-ColonMetLiver 



4631 558120 



2018.G14.gz43 264372 



F | M00055047A:H05 | UC2-ColonMetLiver 



4632 349744 



2018.I14.gz43 264374 



F | M00055048D:H04 1 UC2-ColonMetLiver 



4633 561259 



2018.A15.gz43 264382 



F I M00055039D:D07 1 UC2-ColonMetLiver 



4634 557326 



2018.G15.gz43 264388 



F | M00055047B:A10 | UC2-ColonMetLiver 



4635 449437 



2018.J15.gz43 264391 



F | M00055050A:D11 I UC2-ColonMrtT i™- 



4636 555340 



2018.K15.gz43 264392 



F I M00055051A:G09 1 UC2-ColonMetLiver 



4637 555172 



2018.C16.gz43 264400 



4638 289316 



2018.G16.gz43 264404 



F M00055042C B05 | UC2-ColonMetLl^r 



4639 557279 



2018.H16.gz43 264405 



F I M00055047B:B03 [ UC2-ColonMetLiver 



F | M00055048A:A04 | UC2-ColonMetLiver 



4640 556169 



2018.J16.gz43 264407 



4641 551897 



2018.K16.gz43 264408 



F | M00055050A:H08 | UC2-ColonMetLiver 



F | M00055051A:H10| UC2-ColonMetLiver 



556391 
562949 



2018.P16.gz43 264413 



F | M00055056D:H12 1 UC2-ColonMetLiver 



2018.D17.gz43 264417 



F | M00055044A:A08 | UC2-ColonMetLiver 



551371 
562292 



2018.L17.gz43 264425 



F | M00055052B:E03 | UC2-ColonMetLiver 



2018.M17.gz43 264426 



F | M00055053C.B03 | UC2-ColonMetLiver 



555082 
562272 



2018.N17.gz43 264427 



F | M00055054B.G12 | UC2-ColonMetLiver 



2018.P17.gz43 264429 



F | M00055057A:A04 | UC2-ColonMetLiver 



555883 
558389 



2018.A18.gz43 264430 



F | M00055040A.F01 | UC2-ColonMetLiver 



2018.F18.gz43 264435 



F | M00055046C:C05 | UC2-ColonMetLiver 



463341 
553002 



2018.O18.gz43 264444 



F | M00055056A:H12 | UC2-ColonMetLiver 



2018.P18.gz43 264445 



F | M00055057A:A05 | UC2-ColonMetLiver 



4652 224812 



2018.E19.gz43 264450 



4653 206098 



2018.J19.gz43 264455 



F | M00055045C:H05 | UC2-ColonMetLT^T 



F | M00055050B:E11 | UC2-ColonMetLiver 



4654 453893 



2018.H20.gz43 264469 



F | M00055048A:F04 UC2-ColonMetLiver 



4655 557308 



2018.N21.gz43 26449 1 



F | M00055054D:A02 1 UC2-ColonMetLiver 



4656 470617 



2018.B22.gz43 264495 



F | M00055041C:H12 | UC2-ColonMetLiver 



4657 555057 



2018.N22.gz43 264507 



447815 
561963 



2018.A23.gz43 2645 10 



F | M00055054D:E12 | UC2-ColonMetLiver 



F | M00055040C:G08 [ UC2-ColonMetLiver 



2018.N24.gz43 264539 



F 1 M00055055A:B03 | UC2-ColonMetLiver 



551544 
555213 



2018.P24.gz43 264541 



F | M00055057B:B01 \ UC2-ColonMetLiver 



2018.D07.gz43 26464 1 



F | M00055043B:B01 | UC2-ColonMetLiver 



4662 555010 



2018.J07.gz43 264647 



F | M00055049C:H12 | UC2-ColonMetLiver 



183 



Table 2 



SEQ 
ID 
INU 


rtT T fill \ i 

CLUSTE1 


* SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4663 


505451 


2018.P07.gz43 264653 


F 


M00055056C:E10 


UC2-ColonMetLiver 


4664 


555993 


2018.A08.gz43 264654 


F 


M00055039B:G11 


UC2-ColonMetLiver 


4665 


465207 


2018.H08.gz43 264661 


F 


M00055047C:F08 


UC2-ColonMetLiver 


4666 


557834 


2018.K08.gz43 264664 


F 


M00055050D:F04 


UC2-ColonMetLiver 


4667 


557882 


2018.L08.gz43 264665 


F 


M00055051D:G01 


UC2-ColonMetLiver 


4668 


551711 


2018.B09.gz43 264671 


F 


M00055041B:C10 


UC2-ColonMetLiver 


4669 


555524 


2018.B10.gz43 264687 


F 


M00055041B:D11 


UC2-ColonMetLiver 


4670 


446739 


2018.J10.gz43 264695 


F 


M00055049D:D10 


UC2-ColonMetLiver 


4671 


385531 


2018.P10.gz43_264701 


F 


M00055056C:H07 


UC2-ColonMetLiver 


4672 


559052 


2018.Fll.gz43_264707 


F 


M00055046B:D02 


UC2-ColonMetLiver 


4673 


553108 


2018.Mll.gz43 264714 


F 


M00055053B:A02 


UC2-ColonMetLiver 


4674 


397338 


2018.A12.gz43 264718 


F 


M00055039C.D11 


UC2-ColonMetLiver 


4675 


559699 


2018.E12.gz43 264722 


F 


M00055045B:A04 


UC2-ColonMetLiver 


4676 


553975 


2018.O12.gz43 264732 


F 


M00055055C:E08 


UC2-ColonMetLiver 


4677 


561994 


2018.B13.gz43 264735 


F 


M00055041B:F04 


UC2-ColonMetLiver 


4678 


448098 


2018.C13.gz43_264736 


F 


M00055042B.E05 


UC2-ColonMetLiver 


4679 


558254 


2018.D13.gz43 264737 


F 


M00055043D:D10 


UC2-ColoflMetLiver 


4680 


557420 


2018.F13.gz43_264739 


F 


M00055046B:E08 


UC2-ColonMetLiver 


4681 


558007 


2018.G13.gz43_264740 


F 


M00055047A:H04 


UC2-ColonMetLiver 


4682 


557525 


20 1 8 .M 1 3 gz43_264746 


F 


M00055053B:C02 


UC2-ColonMetLiver 


4683 


450765 


2018.C14:gz43_264752 


F 


M00055042B:E08 


UC2-ColonMetLiver 


4684 


247 


2018.D14.gz43 264753 


F 


M00055043D:F07 


UC2-ColonMetLiver 


4685 


560720 


2018.E14.gz43_264754 


F 


M00055045B:C08 


UC2-ColonMetLiver 


4686 


491799 


2018.F14.gz43_264755 


F 


M00055046B:F06 


UC2-ColonMetLiver 


4687 


558120 


2018.G14.gz43_264756 


F 


M00055047A:H05 


UC2-ColonMetLiver 


4688 


432970 


20 1 8.K 1 4 gz43_264760 


F 


M00055051A:E11 


UC2-ColonMetLiver 


4689 


561259 


2018.A15.gz43 264766 


F 


M00055039D:D07 


UC2-ColonMetLiver 


4690 


551630 


2018.O15.gz43_264780 


F 


M00055055D:D11 


UC2-ColonMetLiver 


4691 


461 


2018.M16.gz43_264794 


F 


M00055053C:A12 


UC2-ColonMetLiver 


4692 


555399 


2018.A17.gz43_264798 


F 


M00055040A:C02 


UC2-ColonMetLiver 


4693 


554149 


2018.B17.gz43 264799 


F 


M00055041C:C10 


UC2-ColonMetLiver 


4694 


553603 


2018.G17.gz43_264804 


F 


M00055047B:B10 


UC2-ColonMetLiver 


4695 


552641 


2018.K17.gz43_264808 


F 


M00055051B:B08 


UC2-ColonMetLiver 


4696 


562292 


2018.M17.gz43 264810 


F 


M00055053C:B03 


UC2-ColonMetLiver 


4697 


562272 


2018.P17.gz43_264813 


F 


M00055057A:A04 


UC2-ColonMetLiver 


4698 


450755 


2018.D18.gz43 264817 


F 


M00055044A:C02 


UC2-ColonMetLiver 


4699 


c c corn 

555892 


2018.E18.gz43_264818 


F 


M00055045C:F09 


UC2-ColonMetLiver 


4700 


551793 


2018.G18.gz43_264820 


F 


M00055047B:C03 


UC2-ColonMetLiver 


4701 


553002 


2018.P18.gz43_264829 


F 


M00055057A:A05 


UC2-ColonMetLiver 


4702 


487182 


2018.119.gz43_264838 


F 


M00055049A:F10 


UC2-ColonMetLiver 


4703 


206098 


2018.J19.gz43_264839 


F 


M00055050B:E11 


UC2-ColonMetLiver 


4704 


557572 


2018.K19.gz43_264840 


F 


M00055051B:D07 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
1NU 


CLUSTEF 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4705 


551334 


2018.M19.gz43 264842 


F 


M00055053C:F06 


UC2-ColonMetLiver 


4706 


561046 


2018.120.gz43_264854 


F 


M00055049A:G03 


UC2-ColonMetLiver 


4707 


454664 


20 1 8L20.gz43_264857 


F 


M00055052C:B12 


UC2-ColonMetLiver 


4708 


551616 


2018.N20.gz43_264859 


F 


M00055054C:G10 


UC2-ColonMetLiver 


4709 


559294 


20 1 8P20.gz43_26486 1 


F 


M00055057A:D10 


UC2-ColonMetLiver 


4710 


548935 


2018.C21.gz43_264864 


F 


M00055042D:G03 


UC2-ColonMetLiver 


4711 


488970 


2018.F21.gz43_264867 


F 


M00055046C:E07 


UC2-ColonMetLiver 


4712 


555751 


20 1 8A22.gz43_264878 


F 


M00055040B:F02 


UC2-ColonMetLiver 


4713 


423588 


2018.C22.gz43_264880 


F 


M00055042D.H02 


UC2-ColonMetLiver 


4714 


551578 


2018.E22.gz43_264882 


F 


M00055045D:A07 


UC2-ColonMetLiver 


4715 


556171 


2018.G22.gz43_264884 


F 


M00055047B:G06 


UC2-ColonMetLiver 


4716 


451172 


2018.P22.gz43_264893 


F 


M00055057A:F10 


UC2-ColonMetLiver 


4717 


555250 


2018.B23.gz43_264895 


F 


M00055041D:B07 


UC2-ColonMetLiver 


4718 


476268 


20 1 8.G23 .gz43_264900 


F 


M00055047B:G10 


UC2-ColonMetLiver 


4719 


553349 


2018.K23.gz43_264904 


F 


M00055051C:B05 


UC2-ColonMetLiver 


4720 


551561 


201 8.023 .gz43_264908 


F 


M00055056B:G01 


UC2-ColonMetLiver 


4721 


555173 


2018.C24.gz43_264912 


F 


M00055043A:B06 


UC2-ColonMetLiver 


4722 


557382 


2018.L24.gz43_264921 


F 


M00055052D:B05 


UC2-ColonMetLiver 


4723 


555616 


2020.H01 gz43_264933 


F 


M00055081A:E08 


UC2-ColonMetLiver 


4724 


549607 


2020.J01.gz43_264935 


F 


M00055083B:E05 


UC2-ColonMetLiver 


4725 


492893 


2020L01.gz43_264937 


F 


M00055086A.B10 


UC2-ColonMetLiver 


4726 


556471 


2020E02.gz43_264946 


F 


M00055077D:B01 


UC2-ColonMetLiver 


4727 


556542 


2020.L02.gz43_264953 


F 


M00055086A:C09 


UC2-ColonMetLiver 


4728 


551283 


2020.N02.gz43_264955 


F 


M00055088D:A01 


UC2-ColonMetLiver 


4729 


558720 


2020C03.gz43_264960 


F 


M00055076A:C06 


UC2-ColonMetLiver 


4730 


555512 


2020.A04.gz43_264974 


F 


M00055073D:F06 


UC2-ColonMetLiver 


4731 


549129 


2020D04.gz43_264977 


F 


M00055077A:B07 


UC2-ColonMetLiver 


4732 


556497 


2020.J04.gz43_264983 


F 


M00055083C:C05 


UC2-ColonMetLiver 


4733 


561422 


2020.N04.gz43_264987 


F 


M00055088D:B08 


UC2-ColonMetLiver 


4734 


556408 


2020.O04.gz43 264988 


F 


M00055090C:B03 


UC2-ColonMetLiver 


4735 


556635 


2020.B05.gz43_264991 


F 


M00055075A:C09 


UC2-ColonMetLiver 


4736 


555368 


2020.C05.gz43_264992 


F 


M00055076A:D11 


UC2-ColonMetLiver 


4737 


495408 


2020.F06.gz43_265011 


F 


M00055078D:G04 


UC2-ColonMetLiver 


4738 


552361 


2020.H06.gz43_265013 


F 


M00055081B:E10 


UC2-ColonMetLiver 


4739 


556446 


2020.K06.gz43_265016 


F 


M00055084D:B01 


UC2-ColonMetLiver 


4740 


557476 


2020.L06.gz43_265017 


F 


M00055086B:D10 


UC2-ColonMetLiver 


4741 


556559 


2020.O06.gz43_265020 


F 


M00055090C:C12 


UC2-ColonMetLiver 


4742 


548943 


2020.K07.gz43_265032 


F 


M00055084D:C09 


UC2-ColonMetLiver 


4743 


555126 


2020.P07.gz43_265037 


F 


M00055091B:A07 


UC2-ColonMetLiver 


4744 


142614 


2020.E08.gz43_265042 


F 


M00055078A:C05 


UC2-ColonMetLiver 


4745 


453756 


2020.M08.gz43_265050 


F 


M00055087D:D08 


UC2-ColonMetLiver 


4746 


556040 


2020.G09.gz43_265060 


F 


M00055080B:G10 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4747 


557454 


2020.N09.gz43_265067 


F 


M00055089A:H06 


UC2-ColonMetLiver 


4748 


452822 


2O20.Cll.gz43_265088 


F 


M00055076B:E08 


UC2-ColonMetLiver 


4749 


464905 


2020.Ell.gz43_265090 


F 


M00055078A:E10 


UC2-ColonMetLiver 


4750 


509505 


2020.Fll.gz43_265091 


F 


M00055079A:H05 


UC2-ColonMetLiver 


4751 


555061 


2020.Hll.gz43_265093 


F 


M00055081C:A12 


UC2-ColonMetLiver 


4752 


556668 


2020.Jll.gz43_265095 


F 


M00055083D:D08 


UC2-ColonMetLiver 


4753 


549810 


2020.C12.gz43_265104 


F 


M00055076B:F04 


UC2-ColonMetLiver 


4754 


557760 


2020.E12.gz43_265106 


F 


M00055078A:F01 


UC2-ColonMetLiver 


4755 


554084 


2020.N12.gz43_265115 


F 


M00055089B:C01 


UC2-ColonMetLiver 


4756 


16092 


2020.P12.gz43_265117 


F 


M00055091C:B04 


UC2-ColonMetLiver 


4757 


551342 


2020.L13.gz43_265129 


F 


M00055086D:C07 


UC2-ColonMetLiver 


4758 


455820 


2020.A14.gz43_265134 


F 


M00055074B:E05 


UC2-ColonMetLiver 


4759 


555710 


2020.114.gz43_265142 


F 


M00055082D:E08 


UC2-ColonMetLiver 


4760 


497086 


2020.O14.gz43_265148 


F 


M00055090D:E03 


UC2-ColonMetLiver 


4761 


561489 


2020.E15.gz43_265154 


F 


M00055078B:F05 


UC2-ColonMetLiver 


4762 


558616 


2020.F15.gz43_265155 


F 


M00055079C:G06 


UC2-ColonMetLiver 


4763 


559071 


2020.P15.gz43_265165 


F 


M00055091C:D11 


UC2-ColonMetLiver 


4764 


550701 


2020.C16.gz43_265168 


F 


M00055076B:H06 


UC2-ColonMetLiver 


4765 


551976 


2020.G16.gz43_265172 


F 


M00055080D:A01 


UC2-ColonMetLiver 


4766 


460244 


2020.116.gz43_265174 


F 


M00055082D:G01 


UC2-ColonMetLiver 


4767 


556802 


2020.J16.gz43_265175 


F 


M00055084A:E10 


UC2-ColonMetLiver 


4768 


454910 


2020.K16.gz43_265176 


F 


M00055085B:D02 


UC2-ColonMetLiver 


4769 


557214 


2020.L16.gz43_265177 


F 


M00055086D:H07 


UC2-ColonMetLiver 


4770 


612961 


2020.F17.gz43_265187 


F 


M00055079D:A03 


UC2-ColonMetLiver 


4771 


465446 


2020.117.gz43_265190 


F 


M00055082D:H02 


UC2-ColonMetLiver 


4772 


556925 


2020J17.gz43_265191 


F 


M00055084A:F10 


UC2-ColonMetLiver 


4773 


98869 


2020.O17.gz43_265196 


F 


M00055090D:F03 


UC2-ColonMetLiver 


4774 


553372 


2020.H18.gz43_265205 


F 


M00055081C:G01 


UC2-ColonMetLiver 


4775 


558231 


2020.J18.gz43_265207 


F 


M00055084B:A04 


UC2-ColonMetLiver 


4776 


553318 


2020.G19.gz43_265220 


F 


M00055080D:E07 


UC2-ColonMetLiver 


4777 


449613 


2020.H19.gz43_265221 


F 


M00055081C:H04 


UC2-ColonMetLiver 


4778 


560507 


2020.N19.gz43_265227 


F 


M00055089C:D06 


UC2-ColonMetLiver 


4779 


560069 


2020.B20.gz43_265231 


F 


M00055075D:D05 


UC2-ColonMetLiver 


4780 


557954 


2020.C20.gz43_265232 


F 


M00055076C:H07 


UC2-ColonMetLiver 


4781 


555996 


2020.E21.gz43_265250 


F 


M00055078D:A07 


UC2-ColonMetLiver 


4782 


494625 


2020.G21.gz43_265252 


F 


M00055080D:F01 


UC2-ColonMetLiver 


4783 


554828 


2020.P21.gz43_265261 


F 


M00055091D:A03 


UC2-ColonMetLiver 


4784 


562229 


2020.A22.gz43_265262 


F 


M00055074D:B04 


UC2-ColonMetLiver 


4785 


561975 


2020.M22.gz43_265274 


F 


M00055088C:D01 


UC2-ColonMetLiver 


4786 


560080 


2020.N22.gz43_265275 


F 


M00055090A:F02 


UC2-ColonMetLiver 


4787 


557783 


2020.C23.gz43_265280 


F 


M00055076D:F11 


UC2-ColonMetLiver 


4788 


556881 


2020.K23.gz43_265288 


F 


M00055085D:F03 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4789 


557747 


2020.M23 .gz43_265290 


F 


M00055088C.D02 


UC2-ColonMetLiver 


4790 


598101 


2020.023 .gz43_265292 


F 


M00055091A:C09 


UC2-ColonMetLiver 


4791 


451185 


2020.F24.gz43_265299 


F 


M00055080A:F05 


UC2-ColonMetLiver 


4792 


552019 


2020.H24.gz43_265301 


F 


M00055082A:A12 


UC2-ColonMetLiver 


4793 


555725 


2020.K24.gz43_265304 


F 


M00055085D:G09 


UC2-ColonMetLiver 


4794 


562748 


2029.P01.gz43_265325 


F 


M00055111A:F10 


UC2-ColonMetLiver 


4795 


562989 


2029.102.gz43_265334 


F 


M00055100C:H06 


UC2-ColonMetLiver 


4796 


562243 


2029.103 .gz43_265350 


F 


M00055100D:B02 


UC2-ColonMetLiver 


4797 


562768 


2029.B06.gz43_265391 


F 


M00055093A:F07 


UC2-ColonMetLiver 


4798 


491127 


2029.D06.gz43_265393 


F 


M00055094D:F09 


UC2-ColonMetLiver 


4799 


555330 


2029.H07.gz43_265413 


F 


M00055100A:C05 


UC2-ColonMetLiver 


4800 


552704 


2029.L07.gz43_265417 


F 


M00055105D:B06 


UC2-ColonMetLiver 


4801 


561426 


2029.008. gz43_265436 


F 


M00055110A:C03 


UC2-ColonMetLiver 


4802 


551967 


2029.E09.gz43_265442 


F 


M00055096A:CH)8 


UC2-ColonMetLiver 


4803 


453533 


2029.F09.gz43_265443 


F 


M00055097B:B12 


UC2-ColonMetLiver 


4804 


394189 


2029.O09.gz43_265452 


F 


M00055110A:C05 


UC2-ColonMetLiver 


4805 


562719 


2029.F10.gz43_265459 


F 


M00055097B:F08 


UC2-ColonMetLiver 


4806 


562307 


2029.Cll.gz43_265472 


F 


M00055094B:B11 


UC2-ColonMetLiver 


4807 


98484 


2029.Dll.gz43_265473 


F 


M00055095A:D08 


UC2-ColonMetLiver 


4808 


558118 


2029.Gll.gz43_265476 


F 


M00055099A:G05 


UC2-ColonMetLiver 


4809 


562881 


2029.B14.gz43_265519 


F 


M00055093B:G08 


UC2-ColonMetLiver 


4810 


553548 


2029.K14.gz43_265528 


F 


M00055104B:F09 


UC2-ColonMetLiver 


4811 


637966 


2029.A15.gz43_265534 


F 


M00055092B:G09 


UC2-ColonMetLiver 


4812 


560575 


2029.L15.gz43_265545 


F 


M00055106A:E04 


UC2-ColonMetLiver 


4813 


554496 


2029.E16.gz43_265554 


F 


M00055096C:C03 


UC2-ColonMetLiver 


4814 


561144 


2029.J16.gz43_265559 


F 


M00055103A:H11 


UC2-ColonMetLiver 


4815 


560628 


2029.120.gz43_265622 


F 


M00055102A:E11 


UC2-ColonMetLiver 


4816 


558511 


2029.J20.gz43_265623 


F 


M00055103C:B07 


UC2-ColonMetLiver 


4817 


402488 


2029.C21.gz43_265632 


F 


M00055094C:C10 


UC2-ColonMetLiver 


4818 


556613 


2029J21.gz43_265639 


F 


M00055103C:D05 


UC2-ColonMetLiver 


4819 


560898 


2029J22.gz43_265655 


F 


M00055103C:G03 


UC2-ColonMetLiver 


4820 


454499 


2029.L22.gz43_265657 


F 


M00055106C:B06 


UC2-ColonMetLiver 


4821 


559955 


2029.K23 .gz43_265672 


F 


M00055105A:A05 


UC2-ColonMetLiver 


4822 


554233 


2030.J01.gz43_265703 


F 


M00055128A:C10 


UC2-ColonMetLiver 


4823 


557615 


2030.K01.gz43_265704 


F 


M00055128D:D04 


UC2-ColonMetLiver 


4824 


449035 


2030.C02.gz43_265712 


F 


M00055115C:G09 


UC2-ColonMetLiver 


4825 


452525 


2030.E03.gz43_265730 


F 


M00055118C:B03 


UC2-ColonMetLiver 


4826 


452775 


2030.L03.gz43_265737 


F 


M00055129D:C02 


UC2-ColonMetLiver 


4827 


560868 


2030.E05 .gz43_265762 


F 


M00055118D:B04 


UC2-ColonMetLiver 


4828 


538830 


2030.H05.gz43_265765 


F 


M00055125A:A02 


UC2-ColonMetLiver 


4829 


486683 


2030.K05.gz43_265768 


F 


M00055129A:B03 


UC2-ColonMetLiver 


4830 


488030 


2030.L06.gz43_265785 


F 


M00055129D:F11 


UC2-ColonMetLiver 
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Table 2 



SEQ| 
ID 
NO 



XUSTER 



SEQ NAME 



ORIE 
NT 



CLONE ID 



LIBRARY 



4831 1 
4832 



66678 
528369 



2Q30.M06.gz43 265 786 



M00055130D.F08 



UC2-ColonMetLiver 



2030.J07.gz43 265799 



M00055128B:B12 



UC2-ColonMetLiver 



4833 479732 



2030.G08.gz43 265812 



M00055124A:F0 



UC2-ColonMetLiver 



4834 557747 



2030.O08.gz43 265820 



M00055133B:E08 



UC2-ColonMetLiver 



4835 558534 



2030.H09.gz43 265829 



4836 312036 



M00055125B:F01 



2030.D10.gz43 265841 



UC2-ColonMetLiver 



M00055117A.G08 



UC2-ColonMetLiver 



4837 461653 



2030.J10.gz43 265847 



M00055128B:E12 



UC2-ColonMetLiver 



4838 558060 



2030Gll.gz43_265860 



M00055124B:A11 



UC2-ColonMetLiver 



4839 557610 



2030.Lll.gz43 265865 



M00055130A:D10 



UC2-ColonMetLiver 



4840 557284 



2030.Pll.gz43 265869 



M00055134C:A01 



UC2-ColonMetLiver 



4841 562714 



2030.A12.gz43 265870 



M00055113B:F02 



UC2-ColonMetLiver 



4842 453274 



2030.C12.gz43 265872 



M00055116A:C07 



UC2-ColonMetLiver 



4843 551068 



2030.P12.gz43 265885 



M00055134C:B01 



UC2-ColonMetLiver 



4844 125543 



2030.J13.gz43 265895 



M00055128B:G01 



UC2-ColonMetLiver 



4845 449454 



2030.M13.gz43_265898 



M00055131A:D07 



UC2-ColonMetLiver 



4846 562760 



2030.O1 3 gz43 265900 



M00055133C:C06 



4847 557710 



2030.P13.gz43 265901 



UC2-ColonMetLiver 



M00055134C:E09 



UC2-ColonMetLiver 



4848 526334 



2030.E14.gz43 265906 



M00055119D:F08 



UC2-ColonMetLiver 



4849 451392 



2030.O1 5 gz43 265932 



M00055133C:G07 



UC2-ColonMetLiver 



4850 549591 



2030.D16.gz43 265937 



M00055117C:C03 



UC2-ColonMetLiver 



14851 557895 



2030.H16.gz43_265941 



M00055125C:H03 



UC2-ColonMetLiver 



4852 554989 



2030.J16.gz43 265943 



M00055128C:E03 



UC2-ColonMetLiver 



4853 561265 



203Q.D17gz43_265953 



M00055117C.F02 



UC2-ColonMetLiver 



4854 558015 



2030.O17.gz43 265964 



M00055133C:H11 



UC2-ColonMetLiver 



4855 561741 



2030.A18.gz43 265966 



M00055114A:E02 



UC2-ColonMetLiver 



4856 477295 



2030.O18.gz43_265980 



M00055133D:A02 



4857 556310 



UC2-ColonMetLiver 



2030.E19.gz43_ 265986 



M00055120B:F12 



UC2-ColonMetLiver 



4858 558452 



2030.H20.gz43 266005 



M00055125D:E02 



UC2-ColonMetLiver 



4859 610269 



2030.N2 1 ■gz43_266027 



M00055132D:E07 



UC2-ColonMetLiver 



4860 561279 



2030.J22.gz43 266039 



M00055128D:C11 



4861 557719 



2030.N22.gz43_266043 



UC2-ColonMetLiver 



M00055132D:E10 



UC2-ColonMetLiver 



4862 451391 



2030.O22.gz43 266044 



M00055133D:F02 



UC2-ColonMetLiver 



4863 554737 



2030.D24.gz43_266065 



M00055118B:A09 



UC2-ColonMetLiver 



4864 452759 



2030.K24.gz43 266072 



M00055129C:H08 



UC2-ColonMetLiver 



4865 562683 



2031.A01.gz43 266078 



M00055135A:E07 



UC2-ColonMetLiver 



4866 558463 



2031.C04.gz43 266128 



M00055138A:E08 



UC2-ColonMetLiver 



4867 211273 



2031.I04.gz43 266134 



M00055146A:B12 



UC2-ColonMetLiver 



4868 468257 



2031.N04.gz43_266139 



M00055152D:C03 



UC2-ColonMetLiver 



4869 558768 



2031.B05.gz43 266 1 43 



M00055136D.D09 



UC2-ColonMetLiver 



4870 561245 



2031.G05.gz43_266148 



M00055144A:A11 



UC2-ColonMetLiver 



4871 557783 



2031.K05.gz43 266152 



M00055149B.F09 



UC2-ColonMetLiver 



4872 549930 



2031.O05.gz43 266156 



M00055154D:F06 



UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 




3 ^Ffi XT A 1V/TI7 




CLONE ID 


LIBRARY 


A Q70 

48 15 


558619 


2031.C06.gz43 266160 


F 


M00055138A:G08 


UC2-ColonMetLiver 


A Q*7 A 

4874 


450630 


2031.G06.gz43 266164 


F 


M00055144A:E09 


UC2-ColonMetLiver 


4875 


524721 


2031.O06.gz43 266172 


F 


M00055154D:G12 


UC2-ColonMetLiver 


4876 


561624 


2031.P06.gz43 266173 


F 


M00055156A:D02 


UC2-ColonMetLiver 


4877 


497493 


2031.N07.gz43 266187 


F 


M00055152D:H09 


UC2-ColonMetLiver 


487c> 


490846 


2031.M08.gz43 266202 


F 


M00055152A:B05 


UC2-ColonMetLiver 


4879 


558645 


2031.P08.gz43 266205 


F 


M00055156B:C11 


UC2-ColonMetLiver 


4880 


553969 


2031.D09.gz43 266209 


F 


M00055139B:E10 


UC2-ColonMetLiver 


A QQ 1 

4881 


558007 


2031.P09.gz43 266221 


F 


M00055156C:A08 


UC2-ColonMetLiver 


/I 000 

4882 


550047 


2031.B10.gz43 266223 


F 


M00055137A:E05 


UC2-ColonMetLiver 


A OOO 

4883 


562000 


2031.I10.gz43 266230 


F 


M00055146B:E09 


UC2-ColonMetLiver 


4884 


160289 


2031.L10.gz43 266233 


F 


M00055150D:C06 


UC2-ColonMetLiver 


4885 


27586 


2031.Cll.gz43 266240 


F 


M00055138B:H12 


UC2-ColonMetLiver 


y| 00 zr 

4886 


558230 


2031.Dll.gz43 266241 


F 


M00055139B:G03 


UC2-ColonMetLiver 


4887 


549739 


2031.M12.gz43 266266 


F 


M00055152A.E09 


UC2-ColonMetLiver 


>i 0 0 0 
4888 


561558 


203 1 .P12.gz43_266269 


F 


M00055156C:D06 


UC2-ColonMetLiver 


a oon 

4889 


556630 


203 1 .B14.gz43 266287 


F 


M00055137B:B11 


UC2-ColonMetLiver 


4890 


561 178 


2031.P15.gz43 266317 


F 


M00055156D:A02 


UC2 -ColonMetLiver 


4891 


558477 


2031.B16.gz43 266319 


F 


M00055137B:F12 


UC2 -ColonMetLiver 


4892 


557250 


2031.M16.gz43 266330 


F 


M00055152B:A03 


UC2-CoIonMetLiver 


4893 


556288 


203 1 .P 1 7.gz43_266349 


F 


M00055157A:B04 


UC2-ColonMetLiver 


A QCkA 

4894 


610893 


2031.L18.gz43 266361 


F 


M00055151A:F10 


UC2-ColonMetLiver 


a one 
4895 


557708 


2031.O24.gz43 266460 


F 


M00055155D:B02 


UC2-ColonMetLiver 


4896 


562881 


2032.E01.gz43 266466 


F 


M00055162A:B03 


UC2-ColonMetLiver 


4897 


494198 


2032.L01.gz43 266473 


F 


M00055170D:B09 


UC2-ColonMetLiver 


4898 


f fl A*l C 

551475 


2032 .00 1 .gz43_266476 


F 


M00055177A:F05 


UC2-ColonMetLiver 


4899 


558720 


2032.K02.gz43 266488 


F 


M00055169D:A11 


UC2-ColonMetLiver 


4900 


2435 


2032.E03.gz43 266498 


F 


M00055162A:C12 


UC2-ColonMetLiver 


/I Oft 1 

4901 


c cn A £. A 

559464 


2032.I04.gz43 266518 


F 


M00055166D:F02 


UC2-ColonMetLiver 


/I OA'*) 

4902 


559389 


2032.A05.gz43_266526 


F 


M00055157C:C11 


UC2-ColonMetLiver 


4903 


559562 


2032.H05.gz43 266533 


F 


M00055165B:G09 


UC2-ColonMetLiver 


A C\f\ A 

4904 


402799 


2032 . M05 .gz43_2665 3 8 


F 


M00055172A:C09 


UC2-ColonMetLiver 


A OA? 

4905 


557316 


2032.J06.gz43 266551 


F 


M00055168B:F11 


UC2-ColonMetLiver 


4906 


557209 


2032.K06.gz43_266552 


F 


M00055170A:A09 


UC2-ColonMetLiver 


4907 


CCA /: (~\ A 

554604 


2032. M06.gz43 266554 


F 


M00055172A:F03 


UC2-ColonMetLiver 


4908 


c c on 1 *7 

558917 


2032.H07.gz43_266565 


F 


M00055165C:H08 


UC2-ColonMetLiver 


4909 


493261 




r 


MU0053 1 o /A: A02 


UC2-ColonMetLiver 


4910 


558755 


2032.K07.gz43_266568 


F 


M00055170A:A11 


UC2-ColonMetLiver 


4911 


558900 


2032.M09.gz43_266602 


F 


M00055172B:B04 


UC2-ColonMetLiver 


4912 


553537 


2032.C10.gz43 266608 


F 


M00055160C:D02 


UC2-ColonMetLiver 


4913 


561413 


2032.E10.gz43_266610 


F 


M00055162B:B04 


UC2-ColonMetLiver 


4914 


561876 


2032.F10.gz43 266611 


F 


M00055163B:F07 


UC2-ColonMetLiver 
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Table 2 



LU 


r»T T TPTT7T 

LLUSTEf 


1 SEQ NAME 


OKIE 

n r 


CLONE ID 


LIBRARY 


4915 


553797 


2032.H10.gz43_266613 


F 


M00055165D:C03 


UC2-ColonMetLiver 


4916 


558965 


2032.Gll.gz43__266628 


F 


M00055164C:C10 


UC2-ColonMetLiver 


4917 


561911 


2032:B12.gz43 266639 


F 


M00055159B:G09 


UC2-ColonMetLiver 


4918 


557961 


2032.1 12.gz43_266646 


F 


M00055167B:A08 


UC2-ColonMetLiver 


4919 


558022 


2032.K12.gz43 266648 


F 


M00055170B:B06 


UC2-ColonMetLiver 


4920 


409262 


2032.M12.gz43 266650 


F 


M00055172B:H07 


UC2-ColonMetLiver 


4921 


415538 


2032.115.gz43_266694 


F 


M00055167B:H07 


UC2-ColonMetLiver 


4922 


559057 


2032.M15.gz43_266698 


F 


M00055172D:D04 


UC2-ColonMetLiver 


4923 


215005 


2032.B16.gz43_266703 


F 


M00055159C:B02 


UC2-ColonMetLiver 


4924 


556542 


2032.M16.gz43_266714 


F 


M00055172D:D07 


UC2-ColonMetLiver 


4925 


562989 


2032.O17.gz43_266732 


F 


M00055177D:F07 


UC2-ColonMetLiver 


4926 


394772 


2032.A19.gz43_266750 


F 


M00055158D:C01 


UC2-ColonMetLiver 


4927 


553877 


2032.E19.gz43_266754 


F 


M00055162C:E12 


UC2-ColonMetLiver 


4928 


558858 


2032.119.gz43_266758 


F 


M00055167D.B05 


UC2-ColonMetLiver 


4929 


559355 


2032.J19.gz43_266759 


F 


M00055169B:F04 


UC2-ColonMetLiver 


4930 


551693 


2032.M19.gz43_266762 


F 


M00055172D:F12 


UC2-ColonMetI^iver 


4931 


323165 


2032.E20.gz43_266770 


F 


M00055162C:G03 


UC2-ColonMetLiver 


4932 


558981 


2032.O20.gz43_266780 


F 


M00055178A:C07 


UC2-ColonMetLiver 


4933 


561507 


2032.C21.gz43_266784 


F 


M00055161A:C02 


UC2-ColonMetLiver 


4934 


555883 


2032.E22.gz43_266802 


F 


M00055162D:B01 


UC2-ColonMetLiver 


4935 


559088 


2032.O22.gz43 266812 


F 


M00055178A:D03 


UC2-ColonMetLiver 


4936 


558395 


2032.C23.gz43_266816 


F 


M00055161A:E05 


UC2-ColonMetLiver 


4937 


447386 


2032.D23.gz43_266817 


F 


M00055161D:H03 


UC2-ColonMetLiver 


4938 


450566 


2032.L23.gz43_266825 


F 


M00055171C:C01 


UC2-ColonMetLiver 


4939 


561918 


2032.E24.gz43_266834 


F 


M00055162D:G04 


UC2-ColonMetLiver 


4940 


559389 


2032.G24.gz43_266836 


F 


M00055165A:F05 


UC2-ColonMetLiver 


4941 


555202 


2032.M24.gz43_266842 


F 


M00055174A:H12 


UC2-ColonMetLiver 


4942 


559752 


2032.N24.gz43_266843 


F 


M00055176D:H01 


UC2-ColonMetLiver 


4943 


478511 


2041.B01.gz43_266847 


F 


M00055181B:A10 


UC2-ColonMetLiver 


4944 


456517 


2041.D01.gz43_266849 


F 


M00055183A:C06 


UC2-ColonMetLiver 


4945 


557895 


2041.H01.gz43_266853 


F 


M00055187D:G11 


UC2-ColonMetLiver 


4946 


561975 


2041 .A02.gz43_266862 


F 


M00055179A:G08 


UC2-ColonMetLiver 


4947 


557714 


2041.D02.gz43_266865 


F 


M00055183A:E10 


UC2-ColonMetLiver 


4948 


550874 


2041.K02.gz43_266872 


F 


M00055192C:E04 


UC2-ColonMetLiver 


4949 


539142 


2041.L02.gz43_266873 


F 


M00055193C:C11 


UC2-ColonMetLiver 


4950 


549911 


2041.M02.gz43_266874 


F 


M00055194D:C05 


UC2-ColonMetLiver 


a nc i 

4951 


561830 


2041.N02.gz43_266875 


F 


M00055195C:H05 


UC2-ColonMetLiver 


4952 


479851 


2041.A03.gz43_266878 


F 


M00055179A:H11 


UC2-ColonMetLiver 


4953 


549472 


204 1 .B03 .gz43_266879 


F 


M00055181B:E06 


UC2-ColonMetLiver 


4954 


561687 


2041.F03.gz43_266883 


F 


M00055185D:A02 


UC2-ColonMetLiver 


4955 


504880 


2041.L03.gz43_266889 


F 


M00055193C:E10 


UC2-ColonMetLiver 


4956 


558212 


2041.E04.gz43_266898 


F 


M00055184C:C07 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ED 
NO 


CLUSTEE 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


4957 


559050 


2041A05.gz43_266910 


F 


M00055179B:D05 


UC2-ColonMetLiver 


4958 


554520 


2041.C05.gz43_266912 


F 


M00055182A.D07 


UC2-ColonMetLiver 


4959 


409612 


2041.E05.gz43_266914 


F 


M00055184C:D02 


UC2-ColonMetLiver 


4960 


488613 


2041.J05.gz43_266919 


F 


M00055191B:A10 


UC2-ColonMetLiver 


4961 


558326 


204 1 .K05 .gz43_266920 


F 


M00055192C.H06 


UC2-ColonMetLiver 


4962 


555394 


2041F06.gz43_266931 


F 


M00055185D:D11 


UC2-ColonMetLiver 


4963 


559610 


204 1 .H06.gz43_266933 


F 


M00055188A:G11 


UC2-ColonMetLiver 


4964 


560080 


2041.106.gz43_266934 


F 


M00055189B:B12 


UC2-ColonMetLiver 


4965 


555742 


2041.N06.gz43_266939 


F 


M00055195D:B10 


UC2-ColonMetLiver 


4966 


460727 


2041.P06.gz43_266941 


F 


M00055198A:E05 


UC2-ColonMetLiver 


4967 


476199 


2041.B07.gz43_266943 


F 


M00055181C:B07 


UC2-ColonMetLiver 


4968 


516484 


204 1 .G07.gz43_266948 


F 


M00055187A:F02 


UC2-ColonMetLiver 


4969 


187704 


204 1 ,H07.gz43_266949 


F 


M00055188A:H10 


UC2-ColonMetLiver 


4970 


559000 


2041.A08.gz43_266958 


F 


M00055179B:G07 


UC2-ColonMetLiver 


4971 


553002 


2041.B08.gz43_266959 


F 


M00055181C:B12 


UC2-ColonMetLiver 


4972 


226324 


2041.C08.gz43_266960 


F 


M00055182B:C07 


UC2-ColonMetLiver 


4973 


562840 


2041 D08.gz43_266961 


F 


M00055183C:A02 


UC2-ColonMetLiver 


4974 


561513 


204 1 E08 .gz43_266962 


F 


M00055184C:F01 


UC2-ColonMetLiver 


4975 


477046 


2041.F08.gz43_266963 


F 


M00055185D.F07 


UC2-ColonMetLiver 


4976 


492627 


2041G08.gz43_266964 


F 


M00055187A:F06 


UC2-ColonMetLiver 


4977 


559262 


2041H08.gz43_266965 


F 


M00055188B:E06 


UC2-ColonMetLiver 


4978 


561513 


204 1 B09.gz43_266975 


F 


M00055181C:C09 


UC2-ColonMetLiver 


4979 


556632 


2041 C09.gz43_266976 


F 


M00055182B:F05 


UC2-ColonMetLiver 


4980 


552629 


2041.G09.gz43_266980 


F 


M00055187A.G02 


UC2-ColonMetLiver 


4981 


549304 


2041.M09.gz43_266986 


F 


M00055195A:B08 


UC2-ColonMetLiver 


4982 


550652 


204 1 N09.gz43_266987 


F 


M00055195D:E11 


UC2-ColonMetLiver 


4983 


490154 


2041P09.gz43_266989 


F 


M00055198B:H08 


UC2-CoIonMetLiver 


4984 


559383 


2041.A10.gz43_266990 


F 


M00055179C:F11 


UC2-ColonMetLiver 


4985 


502343 


2041.F10.gz43_266995 


F 


M00055185D:H01 


UC2-ColonMetLiver 


4986 


413915 


2041 I10.gz43_266998 


F 


M00055189C:C01 


UC2-ColonMetLiver 


4987 


495591 


2041.L10.gz43_267001 


F 


M00055194A:A01 


UC2-ColonMetLiver 


4988 


481231 


2041.O10.gz43_267004 


F 


M00055196D:F07 


UC2-ColonMetLiver 


4989 


559710 


2041.All.gz43_267006 


F 


M00055179C:H02 


UC2-ColonMetLiver 


4990 


562569 


2041.Bll.gz43_267007 


F 


M00055181C:D06 


UC2-ColonMetLiver 


4991 


552629 


2041.Fll.gz43_267011 


F 


M00055186A:D04 


UC2-ColonMetLiver 


4992 


560003 


2041 Kll.gz43_267016 


F 


M00055193A:A08 


UC2-ColonMetLiver 


4993 


550704 


2041.Mll.gz43_267018 


F 


M00055195A:C10 


UC2-ColonMetLiver 


4994 


562569 


2041.01 l.gz43_267020 


F 


M00055196D:H02 


UC2-ColonMetLiver 


4995 


561963 


2041.D12.gz43_267025 


F 


M00055183C:D07 


UC2-ColonMetLiver 


4996 


557867 


2041.F12.gz43_267027 


F 


M00055186A:E08 


UC2-ColonMetLiver 


4997 


143218 


2041.M12.gz43_267034 


F 


M00055195A:E07 


UC2-ColonMetLiver 


4998 


495074 


2041.N12.gz43_267035 


F 


M00055196A:C04 


UC2-ColonMetLiver 
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Table 2 



NO 


HT TTCTEI 


< aEC^ NAME 


URlfc 
IN 1 


CLONE ID 


LIBRARY 


4999 


560213 


2041.J13.gz43 267047 


F 


M00055191D:C05 


UC2-ColonMetLiver 


5000 


558413 


2041.L13.gz43 267049 


F 


M00055194A:E07 


UC2-ColonMetLiver 


5001 


551977 


2041.N13.gz43_267051 


F 


M00055196A:E01 


UC2-ColonMetLiver 


5002 


558582 


2041.E14.gz43_267058 


F 


M00055185A:G11 


UC2-ColonMetLiver 


5003 


549665 


2041.H14.gz43 267061 


F 


M00055188C:F08 


UC2-ColonMetLiver 


5004 


451544 


2041.K14.gz43 267064 


F 


M00055193A:C06 


UC2-ColonMetLiver 


5005 


450623 


2041.M14.gz43_267066 


F 


M00055195B.B09 


UC2-ColonMetLiver 


5006 


551855 


2041.G15.gz43 267076 


F 


M00055187C:C02 


UC2-ColonMetLiver 


5007 


2284 


2041.I15.gz43 267078 


F 


M00055190A:A05 


UC2-ColonMetLiver 


5008 


559883 


2041.J15.gz43 267079 


F 


M00055192A:A09 


UC2-ColonMetLiver 


5009 


614369 


204 1 M 1 5 .gz43_267082 


F 


M00055195B:C04 


UC2-ColonMetLiver 


5010 


553237 


2041.N15.gz43_267083 


F 


M00055196A:H07 


UC2-ColonMetLiver 


5011 


559883 


2041E16.gz43_267090 


F 


M00055185B:B01 


UC2-ColonMetLiver 


5012 


481136 


2041.F16.gz43_267091 


F 


M00055186C:A02 


UC2-ColonMetLiver 


5013 


559885 


2041.O16.gz43_267100 


F 


M00055197B:A10 


UC2-ColonMetLiver 


5014 


551415 


2041.P16.gz43 267101 


F 


M00055198D:A12 


UC2-ColonMetLiver 


5015 


561351 


2041 B17.gz43_267103 


F 


M00055181C:H01 


UC2-ColonMetLiver 


5016 


556881 


2041.C17.gz43_267104 


F 


M00055182C:E09 


UC2-ColonMetLiver 


5017 


549588 


2041.G17.gz43_267108 


F 


M00055187C:E07 


UC2-ColonMetLiver 


5018 


365634 


2041.N17.gz43_267115 


F 


M00055196B:A09 


UC2-ColonMetLiver 


5019 


556343 


2041 P17.gz43_267117 


F 


M00055198D:B08 


UC2-ColonMetLiver 


5020 


624044 


2041 K18.gz43_267128 


F 


M00055193B:A08 


UC2-ColonMetLiver 


5021 


562336 


2041N18.gz43_267131 


F 


M00055196B:C06 


UC2-ColonMetLiver 


5022 


557935 


2041.D19.gz43 267137 


F 


M00055184A:G02 


UC2-ColonMetLiver 


5023 


557298 


2041.G19.gz43 267140 


F 


M00055187D:A08 


UC2-ColonMetLiver 


5024 


559146 


2041.H19.gz43 267141 


F 


M00055188D.D05 


UC2-ColonMetLiver 


5025 


455814 


2041 I19.gz43_267142 


F 


M00055190A:F11 


UC2-ColonMetLiver 


5026 


549829 


2041 .A20.gz43_267 150 


F 


M00055181A:E01 


UC2-ColonMetLiver 


5027 


473742 


2041.B21.gz43 267167 


F 


M00055181D:D05 


UC2-ColonMetLiver 


5028 


559333 


2041.C21.gz43_267168 


F 


M00055182D:E06 


UC2-ColonMetLiver 


5029 


559825 


2041.H21.gz43_267173 


F 


M00055188D:H03 


UC2-ColonMetLiver 


5030 


625988 


2041 O21.gz43_267180 


F 


M00055197C:D10 


UC2-Co!onMetLiver 


5031 


491260 


2041. P21.gz43_267 181 


F 


M00055198D:G03 


UC2-ColonMetLiver 


5032 


557644 


2041.A22.gz43 267182 


F 


M00055181A:G02 


UC2-ColonMetLiver 


5033 


411113 


2041.E22.gz43_267186 


F 


M00055185C:B01 


UC2-ColonMetLiver 


5034 


557961 


2041.C23.gz43_267200 


F 


M00055182D:H08 


UC2-ColonMetLiver 






204 1 . A24.gz43_2672 14 


F 


M00055181A:H01 


UC2-ColonMetLiver 


5036 


558463 


2041 .H24.gz43_26722 1 


F 


M00055189A:C11 


UC2-ColonMetLiver 


5037 


560859 


2041.124.gz43_267222 


F 


M00055190C:G08 


UC2-ColonMetLiver 


5038 


456697 


2041.N24.gz43_267227 


F 


M00055196C:B07 


UC2-ColonMetLiver 


5039 


559127 


2054.E01.gz43_267234 


F 


M00055288B:D01 


UC2-ColonMetLiver 


5040 


559460 


2054.L01.gz43_267241 


F 


M00055300C:F11 


UC2-ColonMetLiver 
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Table 2 

"seqI 



ID 

NO ICLUSTEI 



SEQ NAME 



ORIE 
NT 



CLONE ID 



LIBRARY 



5041 1 624133 
5042 



2054.M03.gz43 267274 



M00055302B:B10 | UC2-ColonMetLiver 



1079 | 2054.M05.gz43 267306 



M00055302B:F07 | UC2-ColonMetLiver 



5043 
5044 



559113 | 2054.A07.gz43 267326 



M00055282A:A01 I UC2-ColonMetT .iW 



500337 | 2054.J13.gz43 26743 1 



M00055297D:C02 | UC2-ColonMetLiver 



5045| 558730 | 2054.E14.gz43 267442 



M00055288D:A03 | UC2-ColonMetLiver 



5046| 561779 | 2054.I14.gz43 267446 



M00055296A:C05 | UC2-ColonMetLiver 



5047| 397581 | 2054.H17.gz43 267493 



M00055294B:C03 I UC2-ColonMetLiver 



5048| 560420 | 2054.O1 8 .gz43 267516 



M00055305C:D08 | UC2-ColonMetLiver 



50491 559728 | 2054.C19.gz43 267520 



M00055286A:H08 1 UC2-ColonMetLiver 



50501 559531 | 2054.H20.gz43_267541 



M00055294B:G01 | UC2-ColonMetLiver 



5051| 23961 | 2054.K20.gz43 267544 



M00055300A:B06 UC2-ColonMetLiver 



50521 558446 | 2054.12 1 .gz43 267558 



50531 562236 | 2055 .KOI .gz43 267624 



M00055296C:E08 UC2-ColonMetLiver 



M00055323D:A12 1 UC2-ColonMetLiver 



50541 559776 | 2055.N01.gz43 267627 



M00055330D:H12 UC2-ColonMetLiver 



5055 1 559574 | 2055.J04.gz43 267671 



5056| 558890 1 2055.105 .gz43 267686 



M00055322C:G11 | UC2-ColonMetLiver 



5057| 552673 | 2055.J05.gz43 267687 



M00055321B:B10 | UC2-ColonMetLiver 



M00055322D:A01 UC2-ColonMetLiver 



5058| 446900 | 2055.J07.gz43 2677 1 9 



M00055322D:C12 1 UC2-ColonMetLiver 



5059| 559112 | 2055.C08.gz43 267728 



M00055312A:D11 | UC2-ColonMetLiver 



50601 440707 | 2055.C09.gz43 267744 



5061 1 555359 | 2055.F12.gz43 267795 



M00055312A:E10 | UC2-ColonMetLiver 



M00055317C:D04 UC2-ColonMetLiver 



5062| 472188 | 2055.P12.gz43 267805 



M00055335D:A03 1 UC2-ColonMetLiver 



5063| 617813 | 2055.E13.gz43 267810 



M00055315C:A09 UC2-ColonMetLiver 



5064 1 559027 | 2055.113.gz43_267814 



5065| 448677 | 2055.K13.gz43 267816 



M00055321D:C12 | UC2-ColonMetLiver 



M00055324CH10 UC2-ColonMetLiver 



5066| 555277 | 2055.K15.gz43 267848 



5067 | 560538 | 2055.O16.gz43_267868 



M00055324D:B02 1 UC2-ColonMetLiver 



M00055334CE11 UC2-ColonMetLiver 



5068| 560369 | 2055.D17.gz43 267873 



M00055313D:E10 1 UC2-ColonMetLiver 



50691 559794 | 2055.G19.gz43_267908 



M00055319B:H06 UC2-ColonMetLiver 



5070| 138470 | 2055.B20.gz43._267919 



M00055310B:E02 | UC2-ColonMetLiver 



5071| 558105 | 2055.H20.gz43 267925 



M00055320D:E09 UC2-ColonMetLiver 



5072| 473343 | 2055.K21.gz43 267944 



M00055325A:E12 | UC2-ColonMetLiver 



5073| 554176 | 2055.O2 l.gz43__267948 



M00055334D:G07 1 UC2-ColonMetLiver 



5074| 477757 | 2055.E22.gz43 267954 



M00055316B:B10 | UC2-ColonMetLiver 



5075 1 553709 | 2055.G22.gz43 267956 



M00055319C.C03 | UC2-ColonMetLiver 



5076| 554585 | 2055.K22.gz43 267960 



M00055325A.H02 UC2-ColonMetLiver 



5077| 451429 | 2055.N22.gz43 267963 



M00055333C.F12 UC2-ColonMetLiver 



5078| 551811 | 2055.A23.gz43 267966 



M00055308D;C09 1 UC2-ColonMetLiver 



5079| 62458 | 2055.C23.gz43 267968 



M00055312D:A09 UC2-ColonMetLiver 



50801 551912 | 2055.D23.gz43 267969 



M00055314B:G07 1 UC2-ColonMetLiver 



5081 1 552857 | 2055.G23.gz43_267972 



M00055319C:C07 UC2-ColonMetLiver 



5082| 555349 | 2055.N23.gz43 267979 



M00055333C:H07 UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 

NO 


CLUSTE1 


? SEO NAMF 


ORTF 
NT 


CivUJNlr: ID 


LIBRARY 


50$n 


56095Q 


zUjo.rUl .gz43 zooUU3 


F 


M00055343D:C12 


UC2-ColonMetLiver 


5084 


55006** 


ZUDo.OUl.gz4 3 zooUlw 


F 


M0005 5 344D: A09 


UC2-ColonMetLiver 


5085 


560544 


ZU3o.lYlUl.gZ4o zooUlu 


F 


M00055352B:E06 


UC2-ColonMetLiver 


5086 


561 i na 
jo 1 lUo 


zU3o.FUl.gz4 3 zoo013 


F 


M00055356C:H02 


UC2-ColonMetLiver 


5087 
j wo / 


5 19014 


ZUJO.ljUz.gZ4j ZOoUzj 


F 


M00055351B:H12 


UC2-ColonMetLiver 


JV/OO 


697186 


9n^A r^ni n^Ai 'y&Qnii 

ZUJO.UUj.gZ4 J ZOoUj3 


T7 

r 


M00055341C:F04 


UC2-ColonMetLiver 


5089 


561 194 

J W 1 X Z*T 


9056 T7H1 rr-T41 9A°.n.'2/1 
ZUjO.XlUj.gZH-j ZOoUj4 


17 

r 


M00055342C:H06 


UC2-ColonMetLiver 


5090 


41 8489 


ZUjO.rlUj.gZ4j ZOoU3 / 


r 


M0005 5 3 46 A :E 1 2 


UC2-ColonMetLiver 


5091 


558569 


9056 A 04 tr-741 ")&Q.(\A£. 
ZwjO./\U t KgZ4j Z0oU40 


T7 

r 


M00055336D:B03 


UC2-ColonMetLiver 


5092 


551919 


9056 R04 *yt41 "5£ftfi/i"7 
ZUjO.tJU4.gZ4j ZOoU4 / 


T7 

r 


M00055338C:F03 


UC2-ColonMetLiver 


509^ 


55966Q 


9056 T\f\A ct'wA'X O^ftA/IO 
ZUJO.J_Jw4.gZ4j ZOoU4y 


T? 

r 


M00055341C:G12 


UC2-ColonMetLiver 


5094 


559857 


9056 T04 rr-y41 9£8n^ 
ZUJO.JU^f .gZ4J ZOoUjj 


X7 

r 


M00055348D:A01 


UC2-ColonMetLiver 


5095 


557179 


ZU J 0.1VlU4.gZ4 j zOoUjci 


r 


X Mf\f\f\ C C ^> P ^ T-\ T V n ^ 

M00055352B:H05 


UC2-CoIonMetLiver 


5096 


5581 89 


ZUJO.INU4.gZ4j ZOoUjy 


F 


M00055354A:A01 


UC2-ColonMetLiver 


5097 


559410 


ZUjO.I3Uj.gZ4 J ZOoUOj 


T7 

F 


M00055338C:G04 


UC2-ColonMetLiver 


5098 


560919 

JUU7JZ 


9056 F05 rrrrAI 0^9/1/^*7 

ZUJO.rUJ .gZ4j ZOoUO / 


T7 

r 


\f AAAf CO jITTV /™v <^ 

M00055343D:Cj03 


UC2-ColonMetLiver 


5099 


550040 


ZUJO.VjUJ.gZ4j ZOoUOo 


17 

r 


M00055345A:D05 


UC2-ColonMetLiver 


5100 


440405 


ZUJO.^UO.gZ4j_ZOoUoU 


1? 

r 


M0005 5340B : C03 


UC2-ColonMetLiver 


5101 


558760 


ZU J O.VJUO.gZ4 j ZOoUo4 


17 

F 


H if AAACO i( f T> t^* r\ ^> 

M00055345B:B03 


UC2-ColonMetLiver 


51 09 

J 1 WZ 


55691 6 
jjoz i o 


ZU JO.JvUO.gZ4j zooUoo 


F 


M00055350A:F01 


UC2-ColonMetLiver 


510^ 

J X Wj 


5515Q1 


lV/fOX /*-»/l'7 l/CQAAA 

zujo.JViuo.gz4j zoouyu 


17 

F 


M00055352C:A07 


UC2-ColonMetLiver 


51 04 

J X WH- 


561 094 


zUjo.t>U/.gz4j zooU9j 


F 


M00055338C:H06 


UC2-ColonMetLiver 


510*, 


jOUOjz 


zUjo.CU7.gz43 268096 


F 


M00055340B:E06 


UC2-ColonMetLiver 


5 1 06 


55Q9QA 


zujo.ru /.gz43 zooUyy 


F 


M00055343D:H04 


UC2-ColonMetLiver 


5107 
j i w i 


5601 74 


zUjO.rU /.gZ4j ZOolUy 


17 

F 


M00055356D:C1 1 


UC2-ColonMetLiver 


5108 

J 1 Wo 


559157 


on^A nn<i r***Ai o/cqi n 
ZU JO.UUo.gZ4j zoo 113 


17 

F 


M00055341D:H07 


UC2-ColonMetLiver 


5109 

J X W 37 


6971 10 

DZ / 1J7 


ZUJO.JlUo.gZ4j ZOoU4 


17 

F 


M00055342D:F07 


UC2-ColonMetLiver 


5110 


500505 


zujo.riUo.gz4j zooll/ 


17 

F 


MAAAf CI il/Tl /^v ^ 

M00055346B:G03 


UC2-CoIonMetLiver 


5111 
Jin 


560160 


9n^^ r^no ^^/iq o/cetoo 

ZUJO.l_,Uy.gZ4j zOolzo 


17 

F 


M00055340C:D01 


UC2-ColonMetLiver 


5112 


559175 


9056 Cl(\Q citA'X 0^8171 
ZUJO.OUV.gZ4J zooIjz 


17 

r 


Mu0055345B:F03 


UC2-ColonMetLiver 


51 n 


440097 


9056 \CC\Q rr-r/ll OAfil'JiC 

zujo.is.uy.gz4j zooljo 


17 

F 


M00055350B:B12 


UC2-ColonMetLiver 


5114 
jut 


440751 


ZUJO.!'lU.gZ4j ZOcSIj/ 


F 


M0005 5 3 5 7 A: A09 


UC2-ColonMetLiver 


5115 

J X x j 


560170 

JOuj /W 


ZUjO.L 1 1 .gZ4j ZOoloU 


F 


M00055340D:D0 1 


UC2-ColonMetLiver 


5116 

J 1 X w 


561 1 94 


9056 "PI 1 citAI i/;fiio 
ZUJO.X2fll.gZ4j ZOOlOZ 


17 

r 


H iTA AAC CI jl 1T\ t Tr\ £T 

M00055342D:H05 


UC2-ColonMetLiver 


51 17 


540041 


9056 Til rrrr/l'J O/IQUl 

ZUjO.Jll.gZ4j ZOolO/ 


17 

r 


M00055349A:F07 


UC2-ColonMetLiver 


51 18 

J X X o 


560605 


9056 1^11 rr-^41 
ZUJO.JVll.gZ4j ZOolOo 


T7 

r 


Ik/fAAACCTCATI T^rv/ , v 

MU0055350B:D09 


UC2-ColonMetLiver 


5119 


471712 


2056.N1 l.gz43_268171 


F 


M00055'?54A*rr1 1 

iTlvV/uJJJJtA.U X X 


T Tf^9-i^/^1rml\j1WT it/or 

ij\^z-i^oioiuvieiJLivcr 


5120 


448285 


2056.F12.gz43_268179 


F 


M00055344A:G11 


UC2-ColonMetLiver 


5121 


561068 


2056.N12.gz43_268187 


F 


M00055354A:H08 


UC2-CoIonMetLiver 


5122 


562883 


2056,B13.gz43_268191 


F 


M00055339B:D10 


UC2-ColonMetLiver 


5123 


560717 


2056.C13.gz43 268192 


F 


M00055340D:F08 


UC2-ColonMetLiver 


5124 


560199 


2056.E13.gz43 268194 


F 


M00055343A:C09 


UC2-ColonMetLiver 
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# 



Table 2 



^L<^ 
ID 
lSVJ 


t t TCTCD 

LLUS ILK 


CCA XT A TV /ft? 

aLvJ iNAMfc, 


yj ivii_< 
NT 


LLUJNL ID 


T TDD A DV 

LIBRARY 


5125 


618311 


205 6 .F 1 3 .gz43_268 1 95 


F 


M00055344B:A12 


UC2-ColonMetLiver 


5126 


455379 


2056.M13.gz43_268202 


F 


M00055353B:B09 


UC2-ColonMetLiver 


5127 


471272 


2056.0 1 3 .gz43_268204 


F 


M00055356A:B06 


UC2-ColonMetLiver 


5128 


562576 


2056.D14.gz43_268209 


F 


M00055342A:E08 


UC2-ColonMetLiver 


5129 


626061 


2056.O14.gz43_268220 


F 


M00055356A:D04 


UC2-ColonMetLiver 


5130 


558867 


2056.A15.gz43_268222 


F 


M00055337D:B10 


UC2-ColonMetLiver 


5131 


553875 


2056.K15.gz43_268232 


F 


M00055350C:G11 


UC2-ColonMetLiver 


5132 


556365 


2056.O15.gz43_268236 


F 


M00055356A:D09 


UC2-ColonMetLiver 


5133 


453004 


2056.A16.gz43_268238 


F 


M00055338A:A02 


UC2-ColonMetLiver 


5134 


558653 


2056.C16.gz43_268240 


F 


M00055341B:A01 


UC2-ColonMetLiver 


5135 


560868 


2056.E16.gz43_268242 


F 


M00055343A:G03 


UC2 -ColonMetLiver 


5136 


551305 


2056.G16.gz43_268244 


F 


M00055345D:A04 


UC2 -ColonMetLiver 


5137 


559004 


2056.Hl6.gz43_268245 


F 


M00055347A:C03 


UC2 -ColonMetLiver 


5138 


238146 


2056.N16.gz43_268251 


F 


M00055354C:C 12 


UC2 -ColonMetLiver 


5139 


559971 


2056.A17.gz43_268254 


F 


M00055338A:A03 


UC2 -ColonMetLiver 


5140 


559963 


2056.1 17.gz43_268262 


F 


H gf\f\f\ C C *^ A O T"» A /~V^ 

M00055348B:A02 


UC2 -ColonMetLiver 


5141 


549781 


2056.O17.gz43268268 


F 


M00055356B:B04 


UC2 -ColonMetLiver 


5142 


491240 


2056.F18.gz43_268275 


F 


M00055344C:C08 


UC2-ColonMetLiver 


5143 


559087 


2056.G18.gz43_268276 


F 


M00055345D:D01 


UC2-ColonMetLiver 


5144 


103123 


2056.118.gz43_268278 


F 


M00055348B:B05 


UC2-CoIonMetLiver 


5145 


426698 


2056.M18.gz43_268282 


F 


M00055353C:A05 


UC2 -ColonMetLiver 


5146 


560254 


2056.C19.gz43_268288 


F 


M00055341B:C07 


UC2-ColonMetLiver 


5147 


403419 


2056.D19.gz43_268289 


F 


M00055342B:C08 


UC2-ColonMetLiver 


5148 


400047 


2056.E19.gz43_268290 


F 


M00055343C:B11 


UC2 -ColonMetLiver 


5149 


559093 


2056.G19.gz43_268292 


F 


M00055345D:D12 


UC2-ColonMetLiver 


5150 


458618 


2056.J19.gz43_268295 


F 


M00055349C:G07 


UC2-ColonMetLiver 


5151 


558890 


2056.F20.gz43_268307 


F 


M00055344C:F09 


UC2-ColonMetLiver 


5152 


559246 


2056.G20.gz43_268308 


F 


M00055345D:E02 


UC2-ColonMetLiver 


5153 


562849 


2056. J20.gz43_2683 1 1 


F 


M00055349C:H12 


UC2-ColonMetLiver 


5154 


561116 


2056. A2 1 .gz43_2683 1 8 


F 


M00055338A:H10 


UC2-ColonMetLiver 


5155 


558334 


2056.G2 1 .gz43_268324 


F 


M00055345D:E05 


UC2-ColonMetLiver 


5156 


552258 


2056.122.gz43_268342 


F 


M00055348B:F05 


UC2-ColonMetLiver 


5157 


560311 


2056.K22.gz43_268344 


F 


M00055351A:C09 


UC2 -ColonMetLiver 


5158 


560714 


2056.A23.gz43_268350 


F 


M00055338B:F06 


UC2-ColonMetLiver 


5159 


496460 


2056.D23.gz43_268353 


F 


M00055342C:E03 


UC2-ColonMetLiver 


5160 


487623 


2056.N23.gz43_268363 


F 


M00055355A:A10 


UCz-ColonMetLiver 


^ 1 tX 1 

J lol 




ZU.JO.rZ*t •gZ'f J ZOoJ / I 


r 


iv>rnnn ^ ^ ^ j. j.r* • 1-toq 

1V1UUU D D J HH- \s . nu^ 




5162 


560088 


2056.G24.gz43_268372 


F 


M00055346A:B01 


UC2-ColonMetLiver 


5163 


556011 


2056.J24.gz43_268375 


F 


M00055349D:F02 


UC2-ColonMetLiver 


5164 


555095 


2056.L24.gz43_268377 


F 


M00055352B:E01 


UC2-ColonMetLiver 


5165 


558959 


2056.M24.gz43_268378 


F 


M00055353D:C05 


UC2-ColonMetLiver 


5166 


466887 


2065.M01.gz43_268394 


F 


M00055373C:F05 


UC2-ColonMetLiver 
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Table 2 



TD 

NO 


L/LUo 1 HIS 


oilvj IN/Vlvlr!/ 


NT 


ClAJINi} ID 


T TDD A DV 

LIBRARY 


5167 


495799 


2065.C02.gz43_268400 


F 


M00055360C:C05 


UC2-ColonMetLiver 


5168 


550562 


2065 . E02 .gz43 26 8402 


F 


M00055363A:F07 


UC2-ColonMetLiver 


5169 


523606 


2065.H02.gz43_268405 


F 


M00055367D:A05 


UC2-ColonMetLiver 


5170 


549163 


2065 J02.gz43_268407 


F 


M00055370B:F07 


UC2-ColonMetLiver 


5171 


562844 


2065 .P02 .gz43_2684 1 3 


F 


M00055377B:E10 


UC2-ColonMetLiver 


5172 


636651 


2065 .F03 ,gz43_2684 1 9 


F 


M00055364D:E09 


UC2-ColonMetLiver 


5173 


543323 


2065 .M03 .gz43_268426 


F 


M00055373C:H10 


UC2-ColonMetLiver 


5174 


556416 


2065.L04.gz43_268441 


F 


M00055372D;C11 


UC2-ColonMetLiver 


5175 


498509 


2065 . M04 .gz43268442 


F 


M00055373D:B08 


UC2-ColonMetLiver 


5176 


376726 


2065.N04.gz43_268443 


F 


M00055374C:F01 


UC2-ColonMetLiver 


5177 


559696 


2065.D05.gz43_268449 


F 


M00055361D:H12 


UC2-ColonMetLiver 


5178 


484748 


2065 .K05 .gz43_268456 


F 


M00055371D:B08 


UC2-ColonMetLiver 


5179 


561229 


2065 . 005 ,gz43_268460 


F 


M00055376A:A08 


UC2-ColonMetLiver 


5180 


627297 


/"% /\ y ^ Arty O i 

2065 . A06.gz43_268462 


F 


M00055358A:F09 


UC2-ColonMetLiver 


5181 


456469 


2065 .C06.gz43_268464 


F 


M00055360C:G11 


UC2-ColonMetLiver 


5182 


552031 


2065J06.gz43_268471 


F 


M00055370C:D02 


UC2-ColonMetLiver 


5183 


527410 


2065.G07.gz43_268484 


F 


M00055366D:G10 


UC2-ColonMetLiver 


5184 


556959 


2065 .I07.gz43_268486 


F 


M00055369A:H08 


UC2-ColonMetLiver 


5185 


501534 


2065.B08.gz43_268495 


F 


M00055359C:H09 


UC2-ColonMetLiver 


5186 


91178 


2065.E08.gz43 268498 


F 


M00055363C:E02 


UC2-ColonMetLiver 


5187 


461325 


2065.P08.gz43_268509 


F 


M00055377C:G01 


UC2-ColonMetLiver 


5188 


561485 


2065 . A09.gz43_2685 1 0 


F 


M00055358B:C01 


UC2-ColonMetLiver 


5189 


557783 


2065 .109.gz43_2685 1 8 


F 


M00055369C:D04 


UC2-ColonMetLiver 


5190 


474580 


2065.O09.gz43_268524 


F 


M00055376B:A03 


UC2-ColonMetLiver 


5191 


450637 


2065.P09.gz43_268525 


F 


M00055377C:H08 


UC2-ColonMetLiver 


5192 


504167 


2065.N10.gz43_268539 


F 


M00055374D:C09 


UC2-ColonMetLiver 


5193 


561279 


2065 .0 1 0.gz43_268540 


F 


M00055376B:A11 


UC2-ColonMetLiver 


5194 


451124 


2065.B1 l.gz43_268543 


F 


M00055359D:C12 


UC2-ColonMetLiver 


5195 


558412 


2065.D1 l.gz43_268545 


F 


M00055362C:B06 


UC2-ColonMetLiver 


5196 


558334 


2065.F1 l.gz43_268547 


F 


M00055365C:D12 


UC2-ColonMetLiver 


5197 


552265 


2065. Kl l.gz43 268552 


F 


M00055372A:H02 


UC2-ColonMetLiver 


5198 


555796 


2065.Mll.gz43_268554 


F 


M00055374A:B11 


UC2-ColonMetLiver 


5199 


558507 


2065.N1 l.gz43_268555 


F 


M00055374D:F10 


UC2-ColonMetLiver 


5200 


453846 


2065.01 l.gz43_268556 


F 


M00055376B:B01 


UC2-ColonMetLiver 


5201 


556430 


2065.G12.gz43268564 


F 


M00055367A:B11 


UC2-ColonMetLiver 


5202 


549984 


2065.J12.gz43_268567 


F 


M00055370D:F06 


T T/~* ^ y~» 1 ~K A i.T 

UC2-ColonMetLiver 


0ZU3 




ZUoD.r 1 J.gz43__zooD /y 


r 


MUUU35jWL.r 1 1 


UL/Z-coionrvieiLiver 


5204 


560959 


2065J13.gz43_268583 


F 


M00055370D:H07 


UC2-ColonMetLiver 


5205 


636876 


2065.M13.gz43_268586 


F 


M00055374A:E01 


UC2-ColonMetLiver 


5206 


562801 


2065.P13.gz43_268589 


F 


M00055377D:F12 


UC2-ColonMetLiver 


5207 


559675 


2065.D14.gz43_268593 


F 


M00055362C:G08 


UC2-ColonMetLiver 


5208 


632260 


2065.K14.gz43_268600 


F 


M00055372B:E01 


UC2-ColonMetLiver 
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Table 2 



iu 

IN \J 


LLrUa ILK 


oLy JNAML 


>JT 


CLONE ID 


LIBRARY 


5209 


a y\. f\ 

490308 


2065 .L14.gz43_26860 1 


F 


M00055373B:A09 


UC2-ColonMetLiver 


5210 


559828 


2065 .B 1 5 .gz43_268607 


F 


M00055359D:H02 


UC2-ColonMetLiver 


5211 


559022 


2065 .C 1 5 .gz43J268608 


F 


M00055361A:C01 


UC2-ColonMetLiver 


5212 


556850 


2065.F15.gz43_26861 1 


F 


M00055366A:B04 


UC2-ColonMetLiver 


5213 


508515 


2065 J 1 5 .gz43_2686 1 5 


F 


M00055371A:B05 


UC2-ColonMetLiver 


5214 


465610 


2065 .A 1 7 .gz43_26863 8 


F 


M00055358D:G04 


UC2-ColonMetLiver 


5215 


463304 


2065.E17.gz43_268642 


F 


M00055364B:D01 


UC2-ColonMetLiver 


5216 


551551 


2065 .F 1 7.gz43_268643 


F 


M00055366A:H08 


UC2-ColonMetLiver 


5217 


559963 


2065 .E 1 8 .gz43_268658 


F 


M00055364B:E10 


UC2-ColonMetLiver 


5218 


562027 


2065 . J 1 8 .gz43_268663 


F 


M00055371A:H10 


T T A*~*S 1 TL. AT a T" * 

UC2-ColonMetLiver 


5219 


561868 


2065 .K 1 8 .gz43_268664 


F 


M00055372B:F11 


UC2-ColonMetLiver 


522a 


561610 


2065 J 1 9.gz43_268679 


F 


M00055371B:D01 


UC2-ColonMetLiver 


5221 


562565 


2065 .M 1 9.gz43_268682 


F 


M00055374B:D05 


UC2-ColonMetLiver 


5222 


235456 


2065.P19.gz43_268685 


F 


M00055378A:B12 


UC2-ColonMetLiver 


5223 


553615 


2065.C20.gz43_268688 


F 


M00055361B:F12 


UC2-ColonMetLiver 


5224 


534054 


2065.E20.gz43_268690 


F 


M00055364C:B08 


UC2-ColonMetLiver 


5225 


511746 


2065 .F20.gz43_26869 1 


F 


M00055366B:C04 


UC2-ColonMetLiver 


5226 


559495 


2065 . A2 1 .gz43_268702 


F 


M00055359B:F03 


UC2-ColonMetLiver 


5227 


559675 


2065 .C2 1 .gz43_268704 


F 


M00055361B:G08 


t T A**\ A*~"\ 1 "fc W . T * 

UC2-ColonMetLiver 


5228 


553705 


2065 . J2 1 .gz43_2687 1 1 


F 


M00055371B:F01 


UC2-ColonMetLiver 


5229 


559764 


2065 .M2 1 .gz43_2687 1 4 


F 


M00055374B:F06 


UC2-ColonMetLiver 


5230 


482090 


2065 .N2 1 .gz43_2687 1 5 


F 


M00055375B:H02 


UC2-ColonMetLiver 


5231 


562459 


2065.C22.gz43_268720 


F 


M00055361C:E05 


UC2-ColonMetLiver 


5232 


553904 


2065.K22.gz43_268728 


F 


M00055372C:E03 


UC2-ColonMetLiver 


5233 


555878 


2065.N22.gz4326873 1 


F 


M00055375B:H07 


UC2-ColonMetLiver 


5234 


559676 


2065 . A23 .gz43_268734 


F 


M00055359B:G09 


UC2-ColonMetLiver 


5235 


505971 


2065 .F23 .gz43_268739 


F 


M00055366C:B11 


UC2-ColonMetLiver 


5236 


62458 


2065 .H23 .gz43_26874 1 


F 


M00055368D:E03 


UC2-ColonMetLiver 


5237 


207099 


2065 .023 .gz43__268748 


F 


M00055377A:Bll 


UC2-ColonMetLiver 


5238 


491635 


2065.A24.gz43268750 


F 


M00055359B:H07 


UC2-ColonMetLiver 


5239 


32021 


2065 .D24.gz43_268753 


F 


M00055363A:D02 


UC2-ColonMetLiver 


5240 


448741 


2065.N24.gz43_268763 


F 


M00055375C:C08 


UC2-ColonMetLiver 


5241 


504560 


2078. HOI .gz43_26903 1 


F 


M00042953B:D02 


UC3-NormColon 


5242 


516415 


2078. MO 1 .gz43_269036 


F 


M00042965A:G02 


UC3-NormColon 


5243 


452981 


2078.J02.gz43_269049 


F 


M00042958B:H04 


UC3-NormCoIon 


5244 


514594 


2078.L02.gz43_26905 1 


F 


M00042963B:E12 


UC3 -NormColon 


5245 


451923 


2U/o.r>UJ.gz4i_zoyU!) / 


r 


MUUU4z-)ooL/.iiUl 


u c j -in ormL,oioii 


5246 


512721 


2078.J03.gz43_269065 


F 


M00042958C:D04 


UC3-NormColon 


5247 


512051 


2078 .K03 .gz43_269066 


F 


M00042960D:C11 


UC3-NormColon 


5248 


506920 


2078.P03.gz43_269071 


F 


M00042976B:F05 


UC3-NormColon 


5249 


451401 


2078.F04.gz43_269077 


F 


M00042628B:G11 


UC3-NormColon 


5250 


448594 


2078.104.gz43_269080 


F 


M00042955D:H03 


UC3-NormColon 



197 



JTable 2 

!SEQ 
ID 

NO IcLUSTEI 



5254 
5255 
(5256 



5257 
[5258 
15259 
I 5260 
15261 

5262 1 

5263 

5264 

5265 

5266 

5267 

5268 



452898 
505679 
451618 



451841 
448090 
449000 



373239 

451802 

511746 

516522 

423578 

513888 

452801 

452775 

415825 

447904 

514142 

92639 



SEQ NAME 
2078.M04.gz43 269084 
2078.N04.gz43_269085 
2078.O04.gz43_269086 
2078.G05.gz43 269094 
2078.105 .gz43 269096 
2078.N05.gz43_269101 



IorieI 

NT 

T 

_F 
_F 
_F 
_F 
F 



452182 

447210 

452500 

452204 

447025 

452324 

506901 

507349 

452052 

500853 

514160 

418340 

452142 



2078.O05.gz43 
2078.D06.gz43 
2078.E06.gz43 
2078.F06.gz43 
2078.J06.gz43 
2078.E07.gz43 
2078.L07.gz43 
2078.M07.gz43 
2078.C08.gz43 
2078.F08.gz43 
2078.J08.gz43 
2078.B09.gz43 



CLONE ID 

M00042965B:G08 
M00042969B:E01 
M00042972C:F07 
M00042951D:C05 
M00042956A.H02 
M00042969B:G10 



451994 
452615 



2078C09gz43_ 269154 
2078.D09.gz43 269155 
2078.O09.gz43 269166 
2078.A10.gz43 269168 
2078.E10.gz43 269172 
2078.G10.gz43 269174 
2078.All.gz43 269184 
2078Cll.gz43_269186 
2078.Fll.gz43 269189 
2078.Gll.gz43 269190 
2078Jll.gz43_269193 
2078.C12.gz43 269202 
2078.H12.gz43 269207 



.269102 I F I M00042972C:F09 

269107 F M00042623B:B09 

269108 I F M00042625D:C07 1 

269109 F M00042628C:G10 
,269113 1 F M00042959A:B07 
.269124 F M00042625D:E08 
,269131 F M00042963D:F1 1 
269132 F M00042965C:E06 
269138 F M00042589D:D08 
.269141 F M00042628D:F12 
269145 I F M00042959A.E08 
.269153 | F I M00042587B:G07 



LIBRARY 
UC3-NormColon 
UC3-NormColon 
UC3-NonnColon 
UC3-NormColon 
UC3-NormColon 
UC3-NonnColon 



UC3-NonnColon 

UC3-NormColon 

UC3-NormColon 

UC3-NormColon 

UC3-NormColop 

UC3-NormColon 

UC3-NormColon 

UC3-NormColon 

UC3-NormColon 

UC3-NormColon 



_F 

F 



F 



452899 

446438 

452462 

510169 

508088 

452031 

513155 



2078.112.gz43_269208 I F 
2078.J12.gz43 269209 I ~ 



M00042589D:F02 

M00042623D:C02 

M00042973B:B10 

M00042583D:F03 

M00042626B:A12 

M00042952A:H12 

M00042583D:F1 1 

M00042590B:G02 

M00042629A:E11 

M00042952B:A08 

M00042959B:E11 

M00042590C:C09 

M00042954A:F04 



UC3-NormColon 
UC3-NormColon 



UC3-NonnColon 
UC3-NonnColon 
UC3-NonnColon 
UC3-NormColon 



513156 
452695 



2078.K12.gz43 
2078.K13.gz43 
2078.D14.gz43 
2078.J14.gz43 
2078.B16.gz43 
2078.E16.gz43 
2078.K16.gz43 



M00042957A:D06 
M00042959C:C06 



UC3-NormColon 
UC3-NormColon 
UC3-NonnColon 
UC3-NonnColon 
UC3-NormColon 
UC3-NormColon 
UC3-NonnColon 
UC3-NormColon 
UC3-NormColon 



2078.L16.gz43 
2078.N16.gz43 



.269210 1 F I M00042962A:G04 

.269226 F M00042962B:A03 

.269235 I F M00042624D:B05 

269241 F M00042959D:A05 

.269265 F M00042588A.G10 

.269268 F M00042626D.D12 

269274 | F I M00042962C:D05 



UC3-NormColon 
UC3-NormColon 



.269275 I F I M00042964C:D06 
269277 | F I M00042970D.D02 



UC3-NonnColon 
UC3-NormColon 
UC3-NormColon 
UC3-NormColon 
UC3-NormColon 
UC3-NormColon 
UC3-NormColon 



UC3-NormColon 
UC3-NormColon 



198 



Table 2 

|SEQ 
ID 

NO ICLUSTE] 
152931 448332 

5294 513168 

5295 510717 

5296 456545 

5297 4244 

5298 452376 

5299 451993 

5300 451929 

5301 511357 

5302 505858 

5303 501401 
5304- 452662 
5305 512059 

| 5306 1 450805 



ORIE 

SEQ NAME I NT 

2078.J17.gz43 269289 I ~F~ 
2078.L17.gz43 269291 ~F 

2078.N17.gz43 269293 ~F 
2078.O17.gz43 269294 ~F 
2078.P17.gz43 269295 F 
2078.E18.gz43 269300 ~F 
2078.H18.gz43_ 269303 F 
2078I18.gz43_269304 ~F 
2078.K18.gz43_269306 ~F 
2078.B19.gz43 269313 ~F 

2078.C19.gz43_269314 ~F 

2078.N19.gz43_269325 ~F 
2078.L21.gz43 269355 F 
2078.O21.gz43 269358 I F 



CLONE ID 

M00042960A:B10 
M00Q42964C:D10 
M00042971A:B09 
M00042974D;B07 
M00042978B:F03 
M00042627B:A10 
M00042954D:D04 
M00042957D;C09 
M00042962D:B09 
M00042588C:E07 
M00042621D.A03 
M00042971A:D12 
M00042964D :C02 
M00042975C: 



A08 



5307 5040U/ ZU78.B22.gz43 269361 p M Q004258Q^iI 



LIBRARY 
UC3-NormColon 
UC3-NonnColon 
UC3-NormColon 
UC3-NormColon 
UC3-NonnColon 
UC3-NormColon 
UC3-NonnColon 
UC3-NormColon 
UC3-NormColon 
UC3-NormColon 
UC3-NormColon 
UC3-NormColon 
UC3-NormColon 
UC3-NormColon 



5308 45u^ 2078.A23.gz43 269376 F M000425i5S?Ano 



UC3-NortnColon 



|5309 452066 

15310 452459 

[5311 452506 

15312 448453 



7022 
523182 
2930 
454701 
454226 
524624 
453220 
453893 



2078.B23.gz43 269377 
2078.K23.gz43_269386 
2078.L23.gz43_ 269387 
2078.A24.gz43 269392 



2089.A01.gz43_269708 
2089.E01.gz43 269712 
2089.L01.gz43 269719 
2089.H02.gz43_269731 
2089.I02.gz43. 269732 
2089.O02.gz43. 269738 
2Q89.B03.gz43_269741 
2089.C03.gz43 269742 



F M00042589B:E03 

F M00042963B:A02 

F M00042965A:B03 

F | M00042586B:A10 



UC3-NoimColon 



453470 2089.F03.gz43 269745 | F 
524478 | 2089.G03.gz4 3 269746 I F 



F M000426UA:A01 

F M00043079D:G10 

F M00043108B:D12 

F M00043093D:F12' 

F M00043097D.B12 

F M00043139D:A06 

F M00042614D:A12~ 

F | M00042618B:G04 



UC3-NonnColon 
UC3-NormColon 
UC3-NormColon 
UC3-NormColon 



529219 2089.H03.gz43 269747 
454050 2089.P03.gz43 269755 
519988 | 2089.E04.gz43 269760" 



M00043086A:C02 
M00043090D:H07 I 



UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 



523590 

452936 

453761 

405102 

454720 

454438 

448985 

530656 

454134 



2089J04.gz43_ 269765 
2089.N04.gz43_269769 
2089.J05.gz43 269781 
2089.A06.gz43 269788 
2089.L06.gz43_269799 
2089.N06.gz43_269801 
2089.D07.gz43_269807 
2089.G07.gz43_269810 
2089.L07.gz43 269815 



F M00043094A:F01 
F M00043144D.H01 
F I M00043080B:C11 



UC3-PrimColon 
UC3-PrimColon 



F I M00043102A:G12 

F M00043137C:D02 

F M00043102B.F05" 

F M00042612A:B06 

F M00043109C:F04^ 

F M00043137D;D1Q 

F M00043077B:H0i 

F M00043091C.H05 

F I M0004 3131B.A11 

199 



UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 



UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 



Table 2 



1 orr/"\ 








1 TT\ 






ORIE 




CLUSTER 


SEQ NAME 


NT 


1 5335 1 


454910 1 


2089.G08.gz43 269826 1 


F 


5336 


417549 


2089.D09.gz43 269839 


F 1 


5337 


454374 


2089G09.gz43_269842 


F 


5338 


454518 


2089.M09.gz43_269848 


F 


5339 


523753 


2089.N09.gz43 269849 


F 


5340 


522648 


2089.D10.gz43_ 269855 


F 


5341 


407275 


2089.N10.gz43_269865 


F 


5342 


415326 1 


2089.O10.gz43 269866 1 


F 



453756 2089.P10.gz43 269867 I F 



453766 2089.Jll.gz43 269877 f' 



I 5346] 
[5347 
5348 
5349 



23649 
449430 
528134 



2089.Kll.gz43 
2089.Lll.gz43 
2089.J12.gz43 
2089.K12.gz43 
| 2089.M12.gz43 



269878 
269879 
269893 
,269894 
269896 I 



CLONE ID 

M00043091C:H08 

M00043077D:E12 

M00043091D:C01 

M00043134C:D06 

M00043138A:H03 

M00043077D:F04 

M00043138B.B08 

M00043141C.C12 



M00043146C:F10 



LIBRARY 
UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 
UC3-Pri m Colon 
UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 



UC3-PrimColon 



M00043102D:F11 IJC3-Primr»i™ 



451391 
454126 
437064 
454129 
519109 
519641 
526575 
453783 
453533 
449394 
523732 
395536 
454509 
454562 
451972 
453726 
527361 
453783 
453385 
45417 7 

801 
453494 
453202 
520616 



2089.N12.gz43 

2089.F13.gz43 

2089.H13.gz43 

2089.L13.gz43 

2089.M13.gz43 

2089.N13.gz43 

2089.01 3. gz43 

2089.C14.gz43 

2089.D14.gz43 

2089.M14.gz43 



M00043106B:F07 
M00043131B:G10 
M00043103A:G05 
M00043106C:D05 
M00043135C:E07 



269897 1 F I M00043138B:FoT 
269905 F M00043087B:D10 
269907 | F | M00043096A:E01 



UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 



, 269911 

,269912 | 

.269913 

269914 

269918 

,269919 

269928 



M00043131C:A11 
M00043135D:A11 
M00043138B:G11 
M00043141D.A12 
M00043075A:B12 
M00043078C:C04 
M00043135D:C07 



UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 



2089.A15.gz43 
2089.115 .gz43 
2089.N15.gz43 
2089.D16.gz43 
2089.F16.gz43 
2089.H16.gz43 
2089.E17.gz43 
2089.H17.gz43 
2089.K17.gz43 
2089.M17.gz43 
2089.A18.gz43 
2089.B20.gz43 
2089.G20.gz43 
2089.I20.gz43 
2089.J20.gz43 
2089.P20.gz43 
2089.F21.gz43 



.269932 1 F I M00042613A:H01 

269940 F M00043100C:D08 

269945 I F M00043138C.D09 

.269951 F M00043078C:H05 

,269953 F M00043088B.D07 

,269955 I F M00043096C:D02 

,269968 I F M00043082D:B05 

.269971 F M00043096C:H04 

269974 F M00043107A:E07 

.2699761 F M00043136A:D03 

269980 F M00042613C.F06 

.270013 F M00042617C:B07 

.270018 F M00043093A:A06 

,270020 F M00043101A:F12 

.270021 F M00043104B.C09 

270027 F M00043149B:A01 

.270033 | F I M00043 089A:D06 

200 



UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 

UC3-PrimColon 



Table 2 

SEQ 
ID 

NO ICLUSTEI 



5377 

5378 

5379 

5380 

5381 

5382 

5383 

5384 



527873 
450566 
520074 
518566 
387530 
453846 
454483 
524157 



SEQ NAME 



2089.H21.gz43 270035 
2089.K21.gz43_270038 
2089.A22.gz43 270044 
2089.C22.gz43_270046 
2089.H22.gz43_270051 



ORIE 
NT 



2089.122.gz43_270052 
2089.L22.gz43_270055 
2089.A23.gz43 270060 



5385 453132 2089.K23.gz43 270070 



53861 524622 
5387 447446 
15388] 524470 



5389 
5390 
5391 
5392 
5393 
5394 



2089.N23.gz43 270073 
2089.A24.gz43_270076 
2089.F24.gz43_270081 



5395 


562216 


5396 


560948 


5397 


446531 


5398 


556793 


5399 


499690 


5400 


558297 


5401 


453508 


5402 1 


551640 


5403 


562085 


5404 


446399 


5405 
5406 


562498 
562524 


5407 


465836 


5408 


562701 


5409 


561457 


5410 


549082 


5411 


418622 


5412 


238196 


5413 


560868 


5414 


459764 


5415 


555639 


5416 


556011 


5417 


468565 


5418 


448949 



446242 I 2066.N03.gz43 270439 
554908 2066.P07.gz43 270505 
551995 2066.F08.gz43 270511 
561325 2066.H08.gz43_270513 
553244 2066.J08.gz43 270515 
562876 | 2066.D09.gz43 270525 



2066.F09.gz43 
2066.J09.gz43 
2066.P09.gz43 
2066.K10.gz43 
2066.M10.gz43 
2066.11 lgz43 
2066.Mll.gz43 
2066.Nll.gz43 
2066.C12.gz43 
2066.E12.gz43 
2066.G12.gz43 
2066.A13.gz43 
2066.J13.gz43 
2066.N13.gz43 
2066.01 3 .gz43 
2066.C14.gz43 
2066.H14.gz43 
2066.K14.gz43 
2066.D15.gz43 
2066.E15.gz43 
2066.M15.gz43 
2066.O15.gz43 
2066.A17.gz43 
2066.B17.gz43 



.270527 
,270531 
270537 
270548 
270550 
. 270562 
.270566 
.270567 
.270572 
270574 



CLONE ID 



_F 
_F 
_F 
_F 
F 



_F 
_F 
_F 
F 



_F 
_F 
_F 
F 



270576 
,270586 
270595 
,270599 
270600 
270604 
270609 
270612 
,270621 
270622 
.270630 
.270632 
270650 
270651 



F 

F 

F 

F 

F 

F 

F 

F 

F 

F 

F 

F 

F 

F 

F 



M00043097A.D11 
M00043107D:H04 
M00042614A:C04 
M00043076A:A09 
M00043097A.F06 
M00043101C:F12 
M00043132C:D02 
M00042614A:H02 



LIBRARY 



UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 



UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 



M00043108B:A01 | UC3-PrimC^ 



M00043139C:A05 
M00042614C:B09 
M00043090B:H06 



UC3-PrimColon 
UC3-PrimColon 
UC3-PrimColon 



M00055392A:H06 I JC2-rn1onM.ti 



M00055394D:F03 
M00055383B:H09 
M00055385C:F06 
M00055388A.A09 
M00055381C:G03 



F 
_F 
¥_ 
F_ 
F_ 
_F 
F_ 
F_ 
F_ 
F_ 
F_ 
F_ 
F_ 
_F 

201 



M00055383C.A08 

M00055388A.B06 

M00055395A:C02 

M00055388D:F11 

M00055391B.D05 

M00055386D:G02 

M00055391B:D07 

M00055392C.G07 

M00055380D:H02 

M00055382C:H06 



UC2-ColonMetLiver 
UC2-ColonMetLiver 
UC2-ColonMetLiver 
UC2-ColonMetLiver 
UC2-ColonMetLiver 



M00055384B.D10 

M00055378D:D04 

M00055388B:B02 

M00055392D:A06 

M00055394B:C06 

M00055381A:F02 

M00055385D:D03 

M00055389A:D08 

M00055381D:DQ8 

V100055382D;D04 

M00055391B:H08 

M00055394B:D08 

M00055379A:D07 

M00055380A:C06 



UC2-ColonMetLivcr 

UC2-ColonMetLiver 

UC2-ColonMetLiyer 

UC2-ColonMetLiver 

UC2-ColonMetLiver 

UC2-ColonMetLiver 

UC2-ColonMetLiver 

UC2-ColonMetLiver 

UC2-ColonMetLiver 

UC2-ColonMetLiver 



UC2-ColonMetLiver 

UC2-ColonMetLiver 

UC2-ColonMetLiver 

UC2-ColonMetLiver 

UC2-ColonMetLiver 

UC2-ColonMetLiver 

UC2-ColonMetLiver 

UC2-ColonMetLiver 

UC2-ColonMetLiver 

UC2-ColonMetLiver 

UC2-ColonMetLiver 

UC2-ColonMetLiver 

UC2-ColonMetLiver 

UC2-ColonMetLivcr 




Table 2 



SEQ 
ID 
NO 


CLUSTEI 


3 SFO NAMF 


NT 


CLfUJNrj ID 


LIBRARY 


^41 Q 


OjOJjZ 


zUoo.J 1 /.gz43 27U659 


F 


M00055388B:E01 


UC2-ColonMetLiver 


J4ZU 


3014o / 


zUoo.Fl /.gz43 270665 


F 


M00055395B:C04 


UC2-ColonMetLiver 


^47 1 


3UOU4 


zUoo.Lrlo.gz43 270677 


F 


M00055390C:E06 


UC2-ColonMetLiver 


^477 
34ZZ 


jj /Z4y 


zOoo.JzO.gz43 270707 


F 


M00055388B:H04 


UC2-CoIonMetLiver 


3*fZ3 


3ozo4y 


zUoo.Hzl.gz43 270721 


F 


M00055386A:G05 


UC2-ColonMetLiver 


^AOA 


jOZJoO 


zUoo.Cz2.gz43 270732 


F 


M00055381B:C10 


UC2-ColonMetLiver 


^47^ 


OjjVOj 


zUoo.Jz2.gz43 270739 


F 


M00055388C:D01 


UC2-CoIonMetLiver 


^47£ 


4^345 / 


zUoo.Dz3 .gz43_270749 


F 


M00055382B:E02 


UC2-ColonMetLiver 


S477 




zUoo.iNz3.gz4 J 270759 


F 


M00055393B:F04 


UC2-ColonMetLiver 


S47R 




zUoo.rz4.gz4J Z/U767 


F 


M00055383D:H11 


UC2-ColonMetLiver 


S47Q 

J*tZ7 


ZoZUlJ 


ZUOO.lVz4.gZ4j Z/U//Z 


F 


M00055389C:F12 


UC2-ColonMetLiver 




JO 1ZU0 


zUo /.rUz.gz43 270799 


F 


M00055402A:A05 


UC2-ColonMetLiver 


S43 1 


4ouo 34 


o T7AOAA 

ZUo /.LrUz.gz43 z/UoUU 


F 


M00055402D:A11 


UC2-ColonMetLiver 


^4^7 




zUo/.lvUz.gz4J z7Uo04 


F 


M00055407B:G07 


UC2«ColonMetLiver 


3*fr33 


JOUojo 


zUo/.rUi .gz43_270o25 


F 


M00055412D:A12 


UC2-ColonMetLiver 


^4^4 
3*f34 


jjOJoZ 


zUo/.liU4.gz43 270830 


F 


M00055400D:G01 


UC2-ColonMetLiver 


S4^ 
3*f33 


jOIZju 


7A/C7 TA^ AP ,/10 17AOCA 

zUo /.1U3 .gz43_270850 


F 


M00055405A:A02 


UC2-ColonMetLiver 


3*t 30 


30Z354 


1A/:7 TA< 11AOC1 

zUo/JU3.gz43 270851 


F 


M00055406B:E07 


UC2-ColonMetLiver 


^4^7 

3*0 / 




zUo /.JNU3.gz43_270855 


F 


M00055410C.G08 


UC2-ColonMetLiver 


3435 


jjOjoU 


O A/C*7 nn/ „ /i o ttaoca 

zUo/.BUo.gz43 270859 


F 


M00055397A:H07 


UC2-ColonMetLiver 




cii 701 

j i i /yz 


zUo /.CUo.gz43 270860 


F 


M00055398C:F07 


UC2-ColonMetLiver 


344U 


43Z /Ol 


OA/IT TA/C /I "> ttao^/' 

zUo7.1Uo.gz43_270866 


F 


M00055405A:C01 


UC2-ColonMetLiver 


344 1 


403Z54 


zUo7.RUo.gz43_270868 


F 


M00055407C:E04 


UC2-ColonMetLiver 


344Z 


/I 1 QA QCi 

4iy4<>y 


OA/C7 TAT ^_ /I O OTAOO^ 

z0o7.107.gz43 270882 


F 


M00055405A:G11 


UC2-ColonMetLiver 


^AAI 

344 D 


34UUUU 


2067 .F07 .gz43 2708 89 


F 


M00055413A:A02 


UC2-ColonMetLiver 


^444 
3444 


«A 1 ac 

3301U3 


zUo/.KU8.gz43 270900 


F 


M00055407C:G04 


UC2-ColonMetLiver 


<AA^ 
3443 


3ozUUl 


2067. N 08 .gz43 270903 


F 


M00055410C:H11 


UC2-ColonMetLiver 


3440 


7QQ/£0/£ 
ZOOOZO 


OA^I XTAA —__/!'*> nAAl A 

z067.JN09.gz43_270919 


F 


M00055410D:B02 


UC2-ColonMetLiver 


S447 
344 1 


<A 1 77< 
3013Z3 


zU67.rllO.gz43 270929 


F 


M00055404A:B10 


UC2-ColonMetLiver 


^448 
344o 


334 /4z 


zUo/.110.gz43 270930 


F 


M00055405B:H05 


UC2-ColonMetLiver 


*\AAQ 


30UUU3 


zUo/.Al l.gz43_270938 


F 


M00055396B:C06 


UC2-ColonMetLiver 


S4Sft 

J*TJU 


DDjoDO 


ZUo/.Crl 1 .gz4 J_z /U944 


F 


M0005 5403 A: C07 


UC2-ColonMetLiver 


^4^ 1 
3*t3 1 


1 QQ 

Ojj 1 0 7 


OA/:7 XT 1 1 nr^A'J 17AAC1 

zUo / .IN 1 1 ,gz43 27095 1 


F 


M00055410D:G01 


UC2-ColonMetLiver 


S4S7 


^A7 1 47 
30Z 14 / 


ZUO /.rJlz.gz43 Z/Uy33 


F 


M00055397D:A01 


UC2-ColonMetLiver 




333 /DO 


7HA7T-J17 7TAO/11 

ZUO / .rllz.gz4J_z /uyol 


r 


M00055404A:D08 


UC2-ColonMetLiver 


S4S4 


o30oy 


zUo / .1 1 3 .gz43_z / Uy /o 


r 


M0005 5 405 C : C04 


UC2-CoIonMetLiver 


5455 


491260 


2067 J13 ez43 270979 


F 




u -^oionivieti-fiver 


5456 


559776 


2067.L13.gz43 270981 


F 


M00055408C:E04 


UC2-ColonMetLiver 


5457 


561922 


2067.A14.gz43_270986 


F 


M00055396B:G02 


UC2-ColonMetLiver 


5458 


559102 


2067.114.gz43_270994 


F 


M00055405C:H07 


UC2-ColonMetLiver 


5459 


440833 


2067.O14.gz43 271000 


F 


M00055412A:C05 


UC2-ColonMetLiver 


5460 


561212 


2067.K15.gz43_271012 


F 


M00055408A:A05 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTEF 


SEQ NAME 


ORIE 
NT 






5461 


556734 


OH rT7n 771 HI A 
ZUO / .1-/ 1 J.gZ4 J_Z / lUlO 


r 


M0005 54 1 2A:F09 


UC2-ColonMetLiver 


^46? 


HO / 07J 


70^7 T1£ rr-y/H 771 H77 
ZUO /.J 10.gZ4J Z/1UZ/ 


c 
r 


M00055406D:C10 


UC2-ColonMetLiver 


JHOj 


<<q 1 on 

JJO 1 oU 


7HA7 /^1 ft 'J 171 n<«£ 
ZUO /.Urlo.gZ4j Z/IUjo 


F 


M0005 5403B : A 1 0 


UC2-ColonMetLiver 


S4£4 




7fi/£7 Tift rrrr/l 1 7710.^0. 

ZUO /.J lo.gZ4j z/iujy 


r 


M0005 5406D :G 1 2 


UC2-ColonMetLiver 


S4£S 


A<1£ 171 
440 1/1 


ZUO / .r iy.gZ4J Z/lUol 


T? 

r 


M00055413C:B09 


UC2-ColonMetLiver 


*i4f^ 

J*tOO 


jOHjo 


ZU0/.15ZU.gz4j_z /lUoJ 


F 


M00055398A:C 1 1 


UC2-ColonMetLiver 


J*tO / 


J J lOl / 


7H^7 T7A *v-w17 771 non 

zuo /.izu.gz4j_z / iuyu 


■ 

r 


M00055405D:G05 


UC2-ColonMetLiver 


J^OO 


40y©JZ 


7HA7 "D7 1 ^vrryl "2 ntflfift 

zuo /.ozi .gz4j z/ioyy 


F 


M0005 5 3 9 8B : A05 


UC2-ColonMetLiver 


S4^Q 


JJOlZO 


7AA7 T>71 r*-»/17 771 in 

ZUo / .rz 1 .gZ4 j_Z / 1 1 1 j 


r 


M0005 54 1 3 C : G09 


UC2-ColonMetLiver 


S470 




7HA7 1777 r*-»/l7 771 1 IO 
ZUO /.rZZ.gZ4j z/iiiy 


r 


M00055402C:C12 


UC2-ColonMetLiver 


S471 

J*T / 1 


JOZZJO 


7fir^7 A77 r*-w17 771 T3n 

ZUO / ./YZ J .gZ4 J Z/113U 


T7 

r 


M00055397A:B 10 


UC2-ColonMetLiver 


S477 

J*T 1 aL 


fJDOo / 


7i*ir^7 T571 rrrr/17 771 17 1 

ZUO / . dZ3 .gZ4 j_Z / 1 1 3 1 


T7 

r 


M00055398B:C05 


UC2-ColonMetLiver 


S47^ 


jjU / 1 j 


ZUo/.riZJ.gz4j_Z/ 1 137 


F 


M00055404C:C 1 1 


UC2-ColonMetLiver 


S474 

J*+ 




7HA7 1^77 ft^yn 771 1 Af\ 

ZUO / .JvZ j ,gz4 j_Z / 1 14U 


r 


X If f\f\f\ EC A AO A T""' I ^ 

M00055408A:F12 


UC2-ColoiiMetLiver 


S47^ 
Jt / J 


i nn»7 i 

lUUoZ 1 


7H£7 T 77 o-»/t7 771 1/11 
ZUO /.LZJ.gZ4j Z/1141 


r 


x a r\r\r\ c c a r\c\ a i- < r\ f 

M00055409A:E06 


UC2-ColonMetLiver 


J4 /O 


1 J4V5U 


ZUo / ,AZ4.gz4 3_Z / 1 14o 


r 


M00055397A:C06 


UC2-ColonMetLiver 


S477 

J4 / / 


1 71A 


?n^7 m/l 7711^0 
ZUO / .OZ4.gZ4 J_Z / 1 1 JZ 


r 


n A f\r\r\ c c a /\o t~* t ^ 

M00055403B:G12 


UC2-ColonMetLiver 


S47R 


J 1 OU4 J 


7H/i7 T 7/1 r*T/17 T71 l C7 
ZUO/.i^Z4.gZ4j Z/lIj/ 


r 


X lAAACr /I AA A T™ 1 1 r\ 

M00055409A:E10 


UC2-ColonMetLiver 


S47Q 

J*T /Z7 


JJ7J01/ 


OH^fi rrr^/17 771 1 TO 

ZUOo.lvUl .gZ4 J Z/ll/Z 


T? 

r 


M00055424B:D04 


UC2-ColonMetLiver 


JHOU 


44Q7Q^ 
44^ /yj 


7HAft 771 7AO 
ZU05.UUj.gZ4j Z/lZUo 


T7 

r 


MU0055474A:G06 


UC2-ColoriMetLiver 


S4X1 


JJU 1 OO 


7HA51 nO/1 rrT/17 771717 
ZUOo.UU4.gZ4J_Z / 1Z1 J 


T7 

r 


M0U0554 1 7C: A05 


UC2-ColonMetLiver 




JO 1040 


7HA5i rr-rA7 77171^ 
ZUOo.rU4.gZ4J_Z / 1Z1 J 


T7 

r 


M0005 54 1 9D : D04 


UC2-ColonMetLiver 




S£1 117 

Jul 1 1Z 


7HAft ct-tA 7 771 77Q 
ZUOo.LiUJ .gZ4 J_Z / 1ZZV 


T? 

r 


\lAAAccyi 1 Tr< r\ , ~> 

M0005 54 1 7C : G03 


UC2-ColonMetLiver 


S4K4 


41 R^£7 
4 1 0 jOZ 


7H^ft rn« nr^/17 771 T2n 
ZUOo.liUj.gZ4j_Z / 1Z3U 


T? 

r 


M00055418C:D08 


UC2-ColonMetLiver 




JJ7JOJ 


7HAft TH^ rt-zll otitic 

ZUo5JUj.gZ43_Z / IZjj 


F 


% JTAAAf C /inA* rkAf 

M00055423C:D05 


T T A*~^ df~*S -t ~m. aw m t • 

UC2-ColonMetLiver 




J0141 1 


ZUoo.MUj.gz4j_Z / Izio 


r 


M00055472A:B12 


UC2-ColonMetLiver 


^4R7 


^£ 1 77H 
JO 1 / /U 


ZUOo.CU /.gZ43_Z / 1Z6U 


F 


M00055416B:F1 1 


UC2-ColonMetLiver 




JJ / /ou 


OH^Q p\r\Q r»-rr/17 771 7T7 

ZUOo.JLJUo.gZ4j Z/lZ// 


r 


\ A f\ f\f\ c c a 1 tfv nno 

M000554 1 7D:D08 


UC2-ColonMetLiver 




40oO /Z 


ZUoo.MUo.gz4 J Z/lzoo 


F 


"fc A'f\f\f\C C A A "¥"1/^/^ 

M00055472A:F02 


UC2-ColonMetLiver 




jjo^yu 


ZU0o.INUo.gZ4j_Z / IZo / 


r 


M00055473B:D01 


UC2-ColonMetLiver 




J JUU 1 0 


7H/CC TTHO ^*-»/l7 T717fl< 

ZUoo.rUy.gz4j__Z / i/yj 


F 


X A f\f\f\ C C A 1 AT\ T TrtT 

M00055419D:H07 


UC2-ColonMetLiver 


S4Q7 


**£1 817 
JO 1 Oj / 


7HAft HJnQ t*-*A1 771 70T 

ZUoo.MUy.gz4j_Z / IZy 1 


F 


M0005 5 42 1 C : F06 


UC2-ColonMetLiver 


J*T7J 


J j4UZo 


OrtAQ THQ f*rr/17 OTlOftO 

zuoo.iuy.gz4J z/izy© 


r? 

r 


M00055422C :E 1 0 


UC2-ColonMetLiver 


S4Q4 


*+ / uoo / 


7n^ft nin 771 7no 
ZUOo.UlU.gZ4J z/i3uy 


F 


M000 5 5 4 1 7D :H08 


UC2-ColonMetLiver 


JH7J 


J jzjo 1 


on^Q ct n rvr»>i7 771 1 in 
ZUOo . E, 1 U .gz4 j_Z / 1 j 1U 


F 


M000554 1 8D:A03 


UC2-ColonMetLiver 




JJO / 7J 


70^9 HIM r»T/17 771717 
ZUOo.01U.gZ4j_Z / 1 j 1Z 


F 


MU005542 1 A:A09 


UC2-ColonMetLiver 


5497 


561718 


2068N10ez43 271319 




M000SS47^Pl*F1 n 

IV1V/V/V/J J*r / JDX 1 U 


u i^z -i^oioniviexijiver 


5498 


630348 


2068.Ell.gz43_271326 


F 


M00055418D:B05 


UC2-ColonMetLiver 


5499 


491728 


2068.Hll.gz43_271329 


F 


M00055421C:G07 


UC2-ColonMetLiver 


5500 


562477 


2068.C12.gz43_271340 


F 


M00055416D:B03 


UC2-ColonMetLiver 


5501 


402836 


2068D12.gz43_271341 


F 


M00055418A:C12 


UC2-ColonMetLiver 


5502 


556511 


2068J12.gz43_271347 


F 


M00055424A:A08 


UC2-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


CLUSTE1 


* SEQ NAME 


ORIE 
NT 


CT ONF TT> 


T TDD A DV 


5503 


559076 


2068 N 1 1 f*7d1 771 


p 
r 


JV1UUU3 04 / JC:A05 


UC2-ColoiiMetLiver 


5504 


417426 


2O68P19 0741 9711S1 


p 
r 


MUUU35475D:A05 


UC2-ColonMetLiver 


5505 


456520 


2068 All 074 1 9711S4 


p 


MUUU5 54 1 4C : C02 


UC2-ColonMetLiver 


5506 


492982 


2068 J 11 0741 971 161 


p 


MUUID 3424 A: C 1 0 


UC2-ColonMetLiver 


5507 


558679 

\J\J 1 S 


9068 Pi 4 0741 971379 


p 
r 


MUUU3341oD:Dl 1 


UC2-ColoriMetLiver 


5508 


561707 


2068 F14 cr741 97117S 


p 


JMUUU3 3 4z UA: xl 1 z 


UC2-ColonMetLiver 


5509 


561180 


2068 T1 4 0741 971178 


p 

r 


JV1UUU334Z3 A: A 1 0 


UC2-ColonMetLiver 


5510 


554953 


2068TC14az41 971180 


p 
r 


JVlUUU334z3A:riU3 


UC2-ColonMetLiver 


5511 


492876 


2068 LI 4 ^r41 971181 


p 
r 


JV1UUU334 / 1 C:x3U 1 


UC2-ColonMetLiver 


5512 


555933 


2068 Ol 4 0741 971184 


P 
r 


JV1UUU334 /4C:H12 


UC2-ColonMetLiver 


5513 


562137 


2068 PI 4 0741 97118S 


p 


IVIUUIO 3 4 /3 1J : A 1 U 


UC2-ColonMetLiver 


5514- 


561500 


9068 R 1 S o741 971387 


p 
r 


JYlUUU334l3U:Clz 


UC2-ColonMetLiver 


5515 


561682 


9068 D1 S 0^41 971780 


p 
r 


MUUU334 1 o A: Jb07 


UC2-ColonMetLiver 


5516 


560118 

~J \J\J M. X U 


9068 T1S 0^41 971 3Q4 


p 
r 


MUUU334z3A:D02 


UC2-ColonMetLiver 


5517 


512392 


9068 T1S 0^41 971 3CK 


p 
r 


JVIUUU3 3 4z4 A: r 04 


UC2-ColonMetLiver 


5518 


449356 


9068 Tl 6 o-7-d.l 971410 

ZUUO.liO.gZH'J Z/i'+lU 


p 
r 


MUU0334z3A:Cj08 


UC2-ColonMetLiver 


5519 


447098 


9068 T 16 cx-743 971417 


p 
r 


MUU033471C:D04 


UC2-ColonMetLiver 


5520 


522220 


9068N16&741 97141^ 

LUUO.H lu.^Z.tJ Z/X*+1.J 


p 


MUUU334/JC:ri03 


UC2-ColonMetLiver 


5521 


453715 


2068 P16 a741 971417 


p 
r 


iVlUUU334 / 3 JJ.xJO/ 


UC2-ColonMetLiver 


5522 


554774 


9068 017^^41 971471 

ZUDO.J71 / .gZfj Z/l'fZl 


p 
r 


\/TnAACCyl 1 O A .T TA A 

MUUU334 1 o A:ri04 


UC2-ColonMetLiver 


5523 


553987 


9068 Tl 7 0741 971496 


p 
r 


JVlUUU334z3ri:A0o 


UC2-ColonMetLiver 


5524 


561645 


2068 K 1 7 0741 97 1 498 


p 


J\lUUU334z3i3:LI Iz 


UC2-ColonMetLiver 


5525 


5495 1 1 


2068 Ml 8 0741 971446 


p 
r 


MUUU334/zC:(j1 1 


UC2-ColonMetLiver 


5526 


560700 


9068 P18 0741 97144Q 
Auuo.rio.gztj Z f 1*+*+" 


p 
r 


JV1UUU334 /31J:Cjr0o 


UC2-ColonMetLiver 


5527 


493410 


2068 K90 0741 971476 


p 
r 


JV1UUU3 3 4z 3 Jd : ,b Uo 


UC2-ColonMetLiver 


5528 


482425 


9068 A91 0741 971489 


p 


MUUU3 3 4 1 4D : Lr 1 0 


UC2-ColonMetLiver 


5529 


561497 


2068 R91 0741 971481 


P 


1V1UUU334ioA.CU4 


UC2-ColonMetLiver 


5530 


131130 


2068 P21 0741 971484 

iVUO.V/^l.gZtJ Z, / 1 Hot 


p 
r 


JMUUU3341 / A:CjI 1 


UC2-ColonMetLiver 


5531 


524736 


2068 F99 0741 971 SOI 


p 
r 


l\lUUU334zUl^;liUo 


UCz-ColonMetLiver 


5532 


58680 


2068 G99 0741 971 S04 


p 
r 


MUUU334zIrj:xiU3 


UC2-ColonMetLiver 


5533 


562302 


2068 N22 0741 971 S1 1 


p 


1V1UUU334 /4A.oU3 


UC2 -ColonMetLiver 


5534 


514824 


2079 M0 1 0741 971 SS8 


P 

r 


Ajfnnn/iino 1 t^.t? i a 
MUUU43Uzl D:r 10 


UC3 -NormColon 


5535 


452523 


2079 P02 0741 971 S77 


r 


A^nnnyfon'2 1 A .Dm 


T T/*^ O XT 1 

UC3-NormColon 


5536 


453572 


2079 K04 0741 971604 

Z.V/ / ^.IW/t.gAtJ Zr / 1 OV/*T 


p 


\AnCif\A'2f\l 1 A -Cnz; 
1YIUUU4jU 1 zA.rUo 


UC3 -NormColon 


5537 


406092 


9079 T0S 0741 971618 


P 
r 


JV1UUU4 j UU / JJ:r U7 


UC3-NormColon 


5538 


452759 


907Q OOS 0741 971694 


p 


Jviuuu4JUzyA. J bl l 


T Trf^*} XT * 

UC3 -NormColon 


5539 


456756 


2079.B06.gz43 271627 


F 


M00042983D:A03 


T TP 1 -Nnrm Pnl rm 

\J KJl IIIV^OILJII 


5540 


505226 


2079.C06.gz43 271628 


F 


M00042987C:E04 


UC 3 -NormColon 


5541 


238196 


2079.D06.gz43J271629 


F 


M00042991C:C02 


UC3-NormColon 


5542 


235874 


2079.H06.gz43_271633 


F 


M00043005A:B06 


UC3-NormColon 


5543 


512432 


2079.O07.gz43_271656 


F 


M00043029B:C09 


UC3-NormColon 


5544 


269927 


2079.O09.gz43_271688 


F 


M00043029B:F09 


UC3-NormColon 
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Table 2 

SEQ 
ID 

NO CLUSTER! 



SEQ NAME 



|ORIE| 

NT 



CLONE ID 



5545 449510 



LIBRARY 



2079.K10.gz43_271700 1 F I M00043012D:A06 



UC3-NormColon 



516799 



2079.O10.gz43_271704 | F I M00043029B:G10 



UC3-NormColon 



455716 



2079Ellgz43_271710 | F I M00042995CE04 



UC3-NormColon 



5548 452976 



2079.Gll.gz43 271712 | F I M00043003A:H07 



UC3 -NormColon 



5549 452031 



2079.Nll.gz43_271719| F I M00043026D:D06 



5550 452957 



2079.Pll.gz43 271721 F I M00043032B:H01 



UC3-NormColon 



UC3-NormColon 



5551 452830 



2Q79.D12 gz43_271725 | F I M00042991D:F06 



UC3-NormColon 



5552 452220 



2079.C13.gz43_271740 | F I M00042988D:G10 



UC3-NormColon 



5553 500896 

5554 455855 



2079.K13.gz43_271748 | F I M00043013B:A06 



2Q79.L13 gz43_271749 | F I M00043017D:B06 



UC3-NoimColon 



UC3-NonnColon 



5555 513381 



2079.M13.gz43_271750| F M00043023D:D06 



UC3-NonnColon 



5556 448200 



2Q79.B14gz43_271755 1 F M00042985B:C05 



5557 509027 



2079.F14.gz43 271759 I F I M00043000C:H06 



UC3-NormColon 



UC3-NormColon 



5558 452618 



2079.N14.gz43 271767 1 F I M00043027B:C04 



5559 510254 



2079.C15.gz43_271772 | F M00042989C:A06 



UC3-NormColon 
UC3-NormColon 



5560 451812 



2079.D15.gz43 271773 I F I M00042992B:B03 



5561 452392 



5562 504812 



2Q79.G15 gz43_271776 | F I M00043003C:A02 



UC3-NormColon 



2079 I15 gz43_271778 | F I M00043008D:E12 



UC3-NormColon 



5563 504431 



UC3-NormColon 



2079.J15.gz43_271779 | F M0004301 1B:D03 



5564 452440 



2079.L15.gz43_271781 | F I M00043020A:A03 



UC3-NormColon 



UC3-NormColon 



5565 450225 



2079.B16.gz43 271787 F M00042985C:B03 



UC3-NonnColon 



5566 511348 



2Q79.C16 gz43_271788 | F | M00042989C:B07 



UC3-NormColon 



5567 502343 



2079.D16.gz43 271789 | F I M00042992B:BI 1 



UC3-NormColon 



5568 451850 



2079.F16.gz43 271791 | F I M00043001A:C10 



UC3-NormColon 



5569 455117 



2079 K16 gz43_271796 | F M00043013C:A01 



UC3-NormColon 



5570 452922 



2079.L16.gz43_271797 | F I M00043020A:G12 



UC3-NormColon 



5571 504501 



2079.B17.gz43_271803 | F | M00042985C:D02 



UC3-NormColon 



5572 503122 



2079.F17.gz43_ 271807 | F | M00043001B:C01 



UC3-NormColon 



5573 508126 



2079 I17.gz43_271810 | F I M00043008D:H09 



UC3-NormColon 



5574 448687 



2079.A18.gz43_271818 | F I M00042982C:G02 



UC3-NormColon 



5575 451491 



2079.B18.gz43 271819 F M00042985C:D04 



UC3-NormColon 



5576 452969 



2079 F18 gz43_271823 | F I M00043001B:E02 



UC3-NormColon 



5577 452866 



2079.G18.gz43_271824 I F I M00043003C:G10 



UC3-NormColon 



5578 446936 



2079 K18 gz43_271828 | F I M00043013D:F03 



5579 452445 



UC3-NormColon 



2079.L18.gz43 271829 F M00043021A:A09 



UC3-NormColon 



5580 451092 



2079.N18.gz43 271831 | F M00043027D:G12 



UC3-NormColon 



5581 75212 



2079.D19.gz43 271837 | F I M00042992D:H02 



UC3-NormColon 



5582 452325 



2Q79.F19 gz43_271839 | F I M00043001B:H01 



UC3-NormColon 



5583 502984 



2079.1 19.gz43_27 1842 | F M00043009B:C09 



UC3-NormColon 



5584 508125 



2079.J19.gz43 27 1 843 



M00043011C.H09 



UC3-NormColon 



5585 452528 



2079.L19.gz43_271845 | F M00043021A:B06 



UC3-NormColon 



5586 509505 



2079 C20 gz43_271852 | F I M00042989D: A07 



UC3-NormColon 
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Table 2 



SEQ 
ID 
NO 


CLUSTEI 


? SEO NAMF 


ORIE 
NT 




LIBRARY 


JJO / 


4SS 1 7.9 
4J J UZ 


9H7Q Oft rt-yi-j T71 0€/c 

zu/y.ozu.gz4j z/1oj6 


r 


M00043003D:D10 


UC3-NormCoIon 


JJOO 




ZU/y.lZU.gz43 Z/loJO 


r 


M00043009B:F1 1 


UC3-NormColon 


SS8Q 

JJ07 


4j ju /y 


zu /y.MZU.gz43 271 ©62 


F 


M00043024B:H11 


UC3-NormColon 


JJ7U 


4485 1 j 


ZU/y.L,Zl.gz4j 271868 


F 


M00042990A:E05 


UC3-NormColon 


JJ7 1 


JlJl /o 


ZU /y.JJZI.gz43 2/Ioo9 


F 


M00042993A:D12 


UC3-NormColon 




4 J joU^f 


ZU/y.HZl.gz43 Z/lo/U 


F 


M00042996D:D12 


UC3-NormColon 


JJ7J 


4 J jU /o 


ZU /y.L,Z 1 .gz4 J_2 / 1 5/7 


F 


M0004302 1 A:H06 


UC3-NormColon 


SSQ4 




ZU/y.rZl.gz4J Z/lool 


F 


M00043 03 3 C : C05 


UC3-NormColon 


JJ7J 


Sft4»»n 


90.70 A 99 rr-y/17 071 OQ1 

ZU /y./\ZZ.gZ4j z/looz 


r 


M00042983A:E09 


UC3-NormColon 


SSQ6 


44744 S 

44 / TTJ 


ZU /y.I5ZZ.gZ4J Z/looJ 


T? 

r 


M00042986B:E06 


UC3-NormColon 




JUjDZj 


9H7Q fT79 rrrr/1 ^ 071000 

ZU /y.LrZZ.gZ4j Z/looo 


T7 

r 


M00043 004A; C08 


UC3-NormColon 


JJ7<) 


Sft961 4 


9H7Q T99 rr-rAQ 971 QG1 

zu /y.jzz.gz4j z / loyi 


r 


M0004301 1D:C12 


UC3-NormColon 


JJ77 


4^4996 
4 jHZZO 


9H7Q VII e***A1 771 OQ1 

ZU /y.rvZZ.gZ4j Z/loyZ 


r 


M00043 0 1 4B : G 1 2 


UC3-NormColon 


S60ft 


4^907*3 

4 JZ7 / J 


7ft7Q T 99 rr-y/19 771000 

zu/y.ijzz.gz4j z/i©yj 


r 


M00043 02 1 B:H03 


U C 3 -NormColon 


JOU 1 


4S67SS 

HJO / Jj 


zu /y.ivizz.gz4j z/ioy4 


r 


M00043025A:CO 1 


UC3-NormColon 


S607 

JUV/Z, 


S 1 SI 97 
JlJlZ/ 


9ft7Q 099 rt*rA1 771 QO^ 

zu /y.*jzz.gz4o z/ioyo 


T7 

r 


HJAAA^O ATA/*< t^/-\o 

M00043030C:F03 


UC3-NormCoIon 




Sft74Sft 
JU / H JU 


9ft7Q P97 cttAI 971000 

zu /y.rzj.gz4j_z / lyuj 


r 


XjfAAAilO AA1 A f~*f\f\ 

M00043002A:G09 


UC3 -NormColon 


JDUt- 


44&'3& i 


9ft7Q m 1. n-*A1 771QMy1 

zu /y.vrzo.gz4j_z / iyu4 


T? 

r 


n if r\r\r\ a o /i a yi i /\ 

M00043 004A : C 1 0 


UC3-NormColon 




sftSQ77 

JUJ7J J 


9H7Q 1^71 rrrr/IO 771 OHO 

zu /y.ivzj.gz'fj_z / iyuo 


T7 

r 


M000430 14C:F07 


UC3-NormColon 


S606 


4S9648 


9H7Q \A^1 rr^A 1 77101ft 

zu /y.ivizj.gz43 z/iyiu 


r 


K f\f\C\ A ^ /\1 ^ A T~\.t\f\ 

M00043025A:D09 


UC3-NormColon 


S6H7 
JOU l 


4S7,ft7ft 


9H7Q XT77 f*tA1 771 Ol 1 

zu/y.rNZJ.gz43_z/iyi I 


r 


M00043028C:H09 


UC3 -NormColon 


JOWO 




7H70 DT2 rr^/17 771 Ol 1 

zu /y. J rZJ.gz4J_z / iy i j 


T7 

r 


■* k f\r\f\ juaoi y* r\ c 

M00043 03 3 C : D05 


UC3-NormColon 


JOU7 


sft i ^74 


ZU /y.DZ4.gZ4j_Z / ly 13 


r 


M00042986C:B07 


UC3-NormColon 


ft 


4S4&4Q 
4 J4 o4y 


7fl7Q T77/1 r*^/!7 771010 

zu /y.rZ4.gz4J z/iyiy 


r 


M00043 002B : E06 


UC3-NormColon 


Jul 1 


JOOOU / 


9H70 U9/I *»rr/f1 771011 

zu /y.riz4.gz43 z / iyz i 


r 


M00043007B:D12 


UC3 -NormColon 


JOIZ 


jU /uoo 


zu/y.jvZ4.gz4J z/iyz4 




M00043014C:G06 


UC3 -NormColon 


Jul J 


j l y j i o 


zuyu.t>ui.gz4J z /jyuo 


F 


M00043 153B:B09 


UC3-PrimColon 




4j jy4o 


ZUyu.rlUl .gz4J Z/jyl4 


r 


M00043 172C:G06 


UC3-PrimColon 


JO 1J 


S9767G 

jz /o /y 


zuyu.L,ui.gz4J z/jyio 


r 


M00043 185B:D02 


UC3-PnmColon 


JOIO 


4S4467, 


zuyu.ru 1 .gz43_z / JyZz 


T7 

r 


M0004320 1 D:D03 


UC3-PrimColon 


JU l / 


S9 1 X4ft 
JZ 1 o4U 


9nGn Fin 9 rr-»/!7 77'2O0/I 

zuyu.i/uz.gz4j z/jyzo 


T7 

r 


X if f\r\r\ a o i /"/An . r\c 

MU0043 160B:E05 


UC3-PnmColon 


JOI o 


JZ070 1 


9HOn A/TM7 ^v-»/17 7T3O0? 

zuyu. muz. gz4 3_z /jyj3 


T? 

r 


x ifr\r\f\ a o i oAn t^/\o 

M00043 189C:F08 


UC3-PrimColon 


JOl7 


S984ft4 


zuyu.L/Uo.gz4j z/oy4i 


r 


MU0043 156C:E05 


UC3-PnmColon 


JUZ-U 


S9^674 
JZjD / 4 


zuyu.iNU4.gz4J z/jyoo 


T7 

r 


X if AAA /€ -} 1 ./^» 1 1 

MU0043 193C:G1 1 


UC3-PnmColon 


JUZ, 1 


44.6768 

HHD / UO 


9000 T7fK cttA'Z 971Q7A 

zuyu.ruj.gz4j_z / jy /o 


r 


MUUU43 168C:F03 


UC3-PnmColon 


S677 


1 9R74Q 
izo / H-y 


zuyu.xiuo.gz4j z/jyy4 


T7 

r 


MUUU43 1 72D:H03 


UC3-PrunColon 


5623 


525781 


2090.K06.gz43_273997 


F 




u -1 nmLOion 


5624 


452775 


2090.N06.gz43_274000 


F 


M00043193C:H09 


UC3-PrimColon 


5625 


524721 


2090.L07.gz43_274014 


F 


M00043186C:A11 


UC3-PrimColon 


5626 


454202 


2090.M07.gz43_2740 1 5 


F 


M00043190A:B10 


UC3-PrimColon 


5627 


452325 


2090J08.gz43_274028 


F 


M00043180C:B02 


UC3-PrimColon 


5628 


453804 2090.F09.gz43_274040 


F 


M00043169A:F05 


UC3-PrimColon 



206 



Table 2 



SEQ 
ID 
NO 


CLUSTEI 


3 SEQ NAME 


0RIE 
NT 




LIBRARY 


5629 


454007 


9000 OOQ vtvA'X 77/1H/IQ 
ZUi7U.V^U;/.gZ4,j Z /hU^V 


r 


M00043197D:H12 


UC3-PrimColon 


5630 


425455 


90Q0 T 10 cr^zl^ 77/10X7 
ZUf U.I_,lU.gZ4.j Z /4UOZ 


c 
r 


M00043 1 86D:B09 


UC3-PrimColon 


56"? 1 


453354 


90Q0 HI 1 crvAl IIACilA 
ZU^U. rll 1 .gZ4.5 Z /4U 


r 


M00043173B:F03 


UC3-PrimColon 


5632 


525315 

JLJJ X J 


9000 T1 1 c*t43 77/lfi7^ 
Z.\jy\J.lL l.gZ4.5 Z /^U /j 


r 


M00043 1 76C: AO 1 


UC3-PrimGolon 


5633 


4538Q0 

^ J J07U 


9000 "Mil avA'X 7740ftH 
ZUi*U. rN 1 l.gZ4 j Z /4U5U 


17 

r 


JV100043 194A:G06 


UC3-PnmColon 


5634 


454460 


90Q0 A 19 cfrAI HACIQI 
ZU;/U./\lZ.gZ4 3 Z /4Uo.5 


T? 

r 


M00043 15 1B:D02 


UC3-PrimColon 


5635 


592860 

J^OU7 


90Q0 r*19 cr'wA'X 77/lO.ft^ 

zuyu.L iz.gz4 j z /4Uoj 


T7 

r 


M00043 158A:F03 


UC3-PrimColon 


5636 


454664 


9000 113 (y>r4'3 77/1 1H7 

ZUyU.ll0.gZ4 ,3 Z /41U / 


r 


M00043 1 76C :E 1 2 


UC3-PrimColon 


5637 


528360 


9000 1 H rr-7/13 77/11 in 
ZU7U.L1 J.gZ4.5 Z /41 iU 


¥7 

r 


M00043 1 87C:E06 


UC3-PrimColon 


5638 


526530 


90Q0 T14 rY-r/13 77/117/1 
ZUVU.J 14.gZ4 j Z /41Z4 


r 


M00043181C:C10 


UC3-PrimColon 


5639 


4591 89 


90Q0 Ol 4 fY-y/11 77/11 7G 
ZUi/U.U 14.gZ4 D z/4izy 


T7 

r 


A A r\c\f\ /< o 1 c\r\ a t^/\-^ 

M00043 199A:F03 


UC3-PrimColon 


.JU*TU 


590SQQ 

JZUJ77 


HH rt-ril 17/1 110 
ZUVU.rll j.gz4J Z /41 io 


r 


M00043 1 73 C : D 1 2 


UC3-PrimColon 


5641 


530071 

JJU7 / 1 


7000 T 1 rr-y/11 77/f 1 <o 

ZUVU.I-flO.gZ4j Z/413o 


r 


A A f\f\r\ /to i n ?7r\ x t/\ >t 

M00043 187D:H04 


UC3-PrimCoIon 


5649 


9803 16 


70Q0 "MIA crrrAI 77/1 1 AH 

ZU^U.rN IO.gZ4J Z /41oU 


T7 

r 


M00043 194D:E10 


UC3-PrimColon 


5643 


454731 


90Q0 A 1 7 cttAI HA 1 ^"2 

zuyu.Ai /.gz4.5 z /4ioj 


r 


M00043 152A:F10 


UC3-PrimColon 


5644 


453730 


90Q0 V\ 1 7 cr-7/13 77/1 1 
ZU7U.JU1 /.gZ4.5 z/4loo 


T7 

r 


A X f\/\i\ A "*> 1 /"ll^ ■■— < 

M00043 162D:E06 


UC3-PrimColon 


5645 


450784 

^JU / 0*T 


90Q0 017 ct-rA'X 77/11^0 

ZU7U.UI /.gz*tj z/4ioy 


rr 
r 


M00043 1 7 1 D:G08 


UC3-PrimColon 


5646 


404081 


9000 Ml 7 a^A'X 77417^ 

Z.U;*U.1V1 1 / .gZHO Z /HI / J 


rr 
r 


MUU043 1 9 1 C:G05 


UC3-PnmColon 


5647 


456793 


9000 AIR c*tA*X 77A170 
ZU:/U./\lo.gZ4.5 z/Hi/y 


T7 

r 


MU0043 152A:G08 


UC3-PnmColon 


5648 


596440 


90Q0 PIS rt-74^ 7 74 1 ft 1 

ZU7U.^l0.gZM'J Z/41ol 


17 

r 


MUOU43159A:C01 


UC3-PnmColon 


5649 


413603 

*T 1 JU7J 


9H00 HI SI otA'? 77/11 £A 


17 

r 


A Jff\f\f\A *J 1 *T A T"> .T>r*Z* 


UC3-PnmColon 


5650 


45431 1 


9000 T18 ot41 77/11 fifi 
ZU7U.J lo.gZHJ Z /41oo 


T7 

r 


MUU043 182B:C02 


UC3-PnmColon 


5651 


596733 

/ jj 


7000 W90 ct'rA'X 77/10 1 

zuvu.nzu.gz4 .5 z/4Zio 


17 

r 


M00043 174C:C06 


UC3-PrimColon 


5652 


4391 50 


90Q0 K"70 cttA'X 77/177 1 
ZUfU.IVZU.gZ4 .3 Z/4ZZ1 


17 

r 


MU0043 1 84D:G06 


UC3-PnmColon 


5653 


453083 


9000 M70 cttA'X 77/1773 
ZUfU.lVlZU.gZ4.3 Z /HZZ-) 


17 

r 


MUU043 192B.H06 


UC3-PnmColon 


5654 


51 8179 

J 1 O 1 /z 


90Q0 "M70 rr-y/13 77/177/1 
ZUyU.lNZU.gZHJ Z /4ZZ4 


17 

r 


M00043 1 95D: A05 


UC3-PrimColon 


5655 


454690 


70Q0 P77 rr-*/13 77/l7^fi 
ZU:?U..rZZ.gZ4j_Z /4Zjo 


17 

r 


M00043210C:E08 


UC3-PrimColon 


5656 


530030 


9000 A73 rr*y/13 77/17^G 
ZU?U./\Zj.gZHj Z /HZjV 


17 

r 


M00043 152B:H04 


UC3-PnmColon 


5657 


934970 


90Q0 f^73 rr-y/13 77/17/^ 
ZU"U.\JZ J .gZ4.5 Z / 4ZO0 


r? 

r 


MU0043 172C:B05 


X T^^t ^ x"fc * t 

UC3-PnmColon 


5658 


590749 
JZ" / *+z 


70Q0 T74 nvAl 11A1Q1 
ZU?U.lZ4.gZ4.j Z / 4zoj 


17 

r 


M00043 1 78C:G06 


UC3-PrimColon 


5659 


40446 1 


90Q9 ^104 citA'X HAAAA 
ZUyz.UU4.gZ4 .5 Z /4444 


17 

b 


M00043438B:E10 


UC3 -ColonMetLiver 


5660 


J.S 1073 


70Q7 TO/1 cttAI 11AAA£L 
ZUyZ.lUH.gZHJ __Z /444u 


F 


A A f\f\f\ A*y A A O T Tf\f\ 

M00043448D:H09 


UC3-ColonMetLiver 


5661 


S3Szl3£ 
Jjj'tjD 


7HG7 /^n^ rr-^/1'2 77/l/i/^n 
ZUyZ.VJU3.gZ4 J Z /44oU 


F 


M00043438C:D12 


UC3-ColonMetLiver 


5662 


456001 


9009 "MO^ a-rdl 11AA6J1 
ZU7Z.1NUJ .gZ4j Z/440/ 


17 

r 


A jaaa a ^ a nrn/~i a ai 

M00043470C:A0l 


UC3 -ColonMetLiver 


5663 


450794 

*tJU / Z*T 


70Q7 HHA rr-y/1'3 HAAQA 

ZUyZ. UUO.gZ4j Z /44o4 


T7 

r 


MOOO43478B:D0 1 


UC 3 -ColonMetLiver 


5664 


530149 

JJ7 Iti 


9009 A 07 crT4^ 774/19A 
Z.uyz,.JW 1 .gZ4j Z /44oD 


T7 

r 


A/f AAAHTC A1 A .TTA^ 

MUU042592A:H02 


UC3 -ColonMetLiver 


5665 


450914 


2092.K07.gz43_274496 


F 


M00043455CG07 


ij v--? -^uiomvieiJLjiver 


5666 


455855 


2092.P07.gz43_274501 


F 


M00043484D:H08 


UC 3 -ColonMetLiver 


5667 


537451 


2092.F08.gz43_274507 


F 


M00043433A:F02 


UC3-ColonMetLiver 


5668 


535123 


2092.G08.gz43_274508 


F 


M00043439C:D05 


UC3 -ColonMetLiver 


5669 


452830 


2092.J08.gz43_274511 


F 


M00043452D:D05 


UC3-ColonMetLiver 


5670 


534054 


2092.L08.gz43_274513 


F 


M00043460D:C03 


UC3-ColonMetLiver 
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Table 2 



SEQ 
ID 
NO 


PT T TCTFT 


2 QFH AT AlVyfP 


Wrvlr! 


CLONE ID 


LIBRARY 


56/1 


162851 


2092. M08.gz43_2 745 14 


F 


M00043465B:G08 


UC3-ColonMetLiver 


JO/2 


/i <c don 
455289 


2092.D09.gz43 274521 


F 


M00043420C:C02 


UC3-ColonMetLiver 


5673 


456020 


2092.H09.gz43_274525 


F 


M00043446B:H01 


UC3 -ColonMetLiver 


5674 


448946 


2092.109.gz43_274526 


F 


M00043449C:H09 


UC3-ColonMetLiver 


JO / J 


539353 


2092.L09.gz43 274529 


F 


M00043460D:H01 


UC3-ColonMetLiver 


jo/o 


450658 


2092.P09.gz43 274533 


F 


M00043485A:C04 


UC3-ColonMetLiver 


JO / / 


CA A 

544355 


2092.bl0.gz43 274538 


F 


M00043427C:E11 


UC3-ColonMetLiver 


JO/5 


452128 


2092.F10.gz43_274539 


F 


M00043434A:H02 


UC3-ColonMetLiver 


jo /y 


A A "7 C"5 

447534 


2092.1 10.gz43_274542 


F 


M00043449D:A06 


UC3-ColonMetLiver 


JOoO 


455256 


2092.P10.gz43 274549 


F 


M00043485B:C09 


UC3-ColonMetLiver 


jool 


455201 


2092. C 1 1 .gz43274552 


F 


M00042640C:C10 


UC3-ColonMetLiver 


568Z 


535129 


2092.Fll.gz43 274555 


F 


M00043434B:D06 


UC3 -ColonMetLiver 


5683 


455808 


2092 .HI l.gz43_274557 


F 


M00043446C:E04 


UC3 -ColonMetLiver 


5684 


T1 C5 ^ZT 

215366 


2092.11 1 .gz43_274558 


F 


M00043449D:E09 


UC3-ColonMetLiver 


5685 


456254 


2092. J 1 Lgz43_2 74559 


F 


M00043453B:C06 


UC3-ColonMetLiver 


5686 


536415 


2092 .Nl 1 .gz43_274563 


F 


M00043472A:E06 


UC3-ColonMetLiver 


568 / 


452801 


2092 .F 1 2 .gz432745 7 1 


F 


M00043434B:E04 


UC3-CoIonMetLiver 


5688 


400258 


2092.H12.gz43_274573 


F 


M00043446C:E08 


UC3-ColonMetLiver 


5689 


546121 


2092 J 1 2 .gz43_2745 75 


F 


M00043453B:F02 


UC3-ColonMetLiver 


5690 


A C A tZ 1 ^ 1 

454621 


2092.K12.gz43_274576 


F 


M00043457C:B12 


UC3-ColonMetLiver 


5691 


455162 


2092 . D 1 3 .gz43 2745 85 


F 


M00043422B:C03 


UC 3 -ColonMetLiver 


5692 


543429 


2092 .P 1 3 .gz43_274597 


F 


M00043486B:D02 


UC3-ColonMetLiver 


5693 


456249 


2092 . 1 1 4 .gz43_274606 


F 


M00043450B:C11 


UC3-ColonMetLiver 


5694 


531461 


2092 .E 1 5 .gz432746 1 8 


F 


M00043428C:A08 


UC3-ColonMetLiver 


J 695 


455572 


2092 .HI 5 .gz4327462 1 


F 


M00043447A:F02 


UC 3 -ColonMetLiver 


5696 


456783 


2092.11 5. gz43 274622 


F 


M00043450B:H05 


UC3-ColonMetLiver 


569/ 


A A 

2544 


2092 .N 1 5 .gz43_274627 


F 


M00043473C:B11 


UC 3 -ColonMetLiver 


joy© 


543431 


2092.P 1 5 .gz43_274629 


F 


M00043486C:D02 


UC3-ColonMetLiver 


jo99 


402534 


2092 .016 .gz43_274644 


F 


M00043481A:G02 


UC3«ColonMetLiver 


J /00 


454563 


2092 .1 1 7.gz43_274654 


F 


M00043450C:F11 


UC3-ColonMetLiver 


5701 


541901 


2092 . K 1 7 .gz43_27465 6 


F 


M00043459A.B08 


UC3-ColonMetLiver 


5702 


536225 


2092.L17.gz43274657 


F 


M00043462C:E12 


UC3-ColonMetLiver 


5703 


A Cf OTA 

455379 


2092.M17.gz43274658 


F 


M00043468C:D08 


UC3-ColonMetLiver 


5 /04 


456567 


2092. P 1 7.gz43_27466 1 


F 


M00043486C:F07 


UC3-ColonMetLiver 


5705 


539804 


^ /^V^ 1 O A ^> ^ ^» A a*' a*" A 

2092 . C 1 8 .gz43 274664 


F 


M00043417A:H12 


UC3-ColonMetLiver 


5706 


452618 


2092 .N 1 8 ,gz43 274675 


F 


M00043473D:D11 


UC3-ColonMetLiver 


S707 


JtJOJJ 


ZUyZ.J Izf.gZH J Z/*fOo/ 


r: 
r 


JV100043453D:D02 


UC3 -ColonMetLiver 


5708 


415326 


2092.L19.gz43_274689 


F 


M00043462D:C09 


UC3 -ColonMetLiver 


5709 


455117 


2092.B20.gz43_274695 


F 


M00042600D:B08 


UC3-ColonMetLiver 


5710 


455601 


2092.C20.gz43_274696 


F 


M00043417B:F12 


UC3-ColonMetLiver 


5711 


455254 


2092.E20.gz43_274698 


F 


M00043430D:C01 


UC3-ColonMetLiver 


5712 


402916 


2092.120.gz43_274702 


F 


M00043451A:B02 


UC3-ColonMetLiver 
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Table 2 



SEO 
ID 
NO 


CLUSTEF 


SEO NAME 


ORIE 
NT 


PT OMR TO 


T TOD A T>V 

L1BKAKY 


J/Ij 


433004 


7AQ7 TVT7A n-rr/i"2 OO/fOAO 
zUyz.INzU.gz43 _Z /4 /U / 


t 


M00043474C;H05 


UC3 -ColonMetLiver 


^71 A 
3 1 14 


Z /3oO 


zUyz.FzU.gz43_z /470y 


F 


M00043489C.H06 


UC3-ColonMetLiver 


^71 ^ 

3 1 13 


/1^77G7 
43Z3yZ 


7AG7 DTI r+~A 1 OOM*71 1 
zUyz.oz 1 .gZ43_Z /47 1 1 


F 


M00042600D:E11 


UC3-ColonMetLiver 


^7 1 £ 
J / io 


44 / joU 


zUyz.l_,zl .gz4J_z /47 12 


F 


M00043417C:F12 


UC3 -ColonMetLiver 


<171 7 
3 / 1 / 


445Z3U 


ZUyz.lzl.gz43 z/4718 


F 


M0004345 1B:D08 


UC3 -ColonMetLiver 


J/lo 


^7A1 71 
3301 / 1 


zUyz.ozz.gz43_z /4 /z7 


F 


M0004263 7C : E 1 2 


UC3-ColonMetLiver 


^71 O 

-) / iy 


4333U1 


ZUyz .rizz .gz43_z7473 3 


F 


M00043448B:E08 


UC3-ColonMetLiver 


^77H 
J /ZU 


433oUo 


O AGO ATO rt-yj? T7/n>in 

zUyz.Uzz.gz43_Z /474U 


F 


H Mf\f\f\ A *> A O ^ T~* /^ai 

M00043483B:G01 


UC3-ColonMetLiver 


S77 1 


/177777 


7flQ7 r'TJ ft-yJl "IHAHAA 

ZxJyz . »Oz3 .gz43_z /4 /44 


r 


M000434 1 8 A: A06 


U C3 -ColonMetLiver 


^777 
J /ZZ 


4335 14 


OAGO ATI r^^AI OT/n^iC 

ZUyz.LlZ3.gz43_z /4 /jo 


T? 

r 


A Mf\ f\f\ A O A O *y Ti 1 1 

M00043483B;G1 1 


UC3-ColonMetLiver 


^771 


^7Q^oo 
33o3oZ 


7HQ7 til A n ~,A1 "inAntLZ. 

ZUyz.riz4.gz43_z /4 /o3 


r 


M00043448C:G07 


UC3-ColonMetLiver 


^77zl^ 
3 /Z4 


AA777ft 


ZUyz.lz4.gz43_z /4 /oo 


F 


M0004345 1 C :H03 


UC3 -ColonMetLiver 


^77^ 
J /Z3 


4303 / / 


zuyz.Fz4.gz43 Z/4//3 


F 


M00043490A:F03 


UC3 -ColonMetLiver 


^77£ 
J /ZD 


1 1 ^7£ 
113/0 


z 1 /o.r U 1 .gz43_zo 1 003 


F 


M00039503B:G05 


FIMEC-Unt 


^777 
3 /Z / 


4U14ZO 


zl /o.rUl.gz43 Z51693 


F 


M0003 9605 D : E02 


HMEC-Unt 


^778 
J /Zo 


AfiG77n 

4uyj3U 


z 1 /o.r>Uz.gz43_zo 1 oy / 


F 


M00039363B:H10 


HMEC-Unt 


^77G 

j /zy 


/I A71 

4U / iy3 


zl /o.CUz.gz43_Z5l698 


F 


M00039369D:D1 1 


HMEC-Unt 




/111 QQ< 

41 iy<S3 


zl /o.UUz.gz43_z81699 


F 


M00039379A:D03 


HMEC-Unt 


^77 1 


A 1 7/^7 1 

41zozl 


zl /o.r Uz.gz43_zo 1701 


F 


M00039503D:E08 


HMEC-Unt 


^777 
J /3Z 


4U //ll 


O 1 T TAP r~~wA 1 0 0 1 TAO 

zl /o.rlUz.gz43 zol /Ui 


F 


M000395 14B:E07 


FIMEC-Unt 


<777 
3 / 33 


A 1 1 1 1 1 
411113 


zl /o.OU3.gz43_281718 


F 


M00039508D:C03 


HMEC-Unt 


3 /34 


/1A7/1 1 1 

4UZ41 1 


TA/I O0 1T5/T 

zl76.1U4.gz43_z81736 


F 


M00039523A:D09 


FDVIEC-Unt 


3 /33 


14 /z /y 


zl /o.rUo.gz43_zol7o3 


F 


M00039504B:F05 


FIMEC-Unt 


3 / JO 


4UUoZo 


OTJ/f T A/I rt-^/n O01T71 

zl /o.LUo.gz43_zo 1771 


F 


Tk if AAA1 AT /" /"TV t~i r\ c\ 

M00039566D:E08 


HMEC-Unt 


^777 
3 /3 / 


4U /443 


017/; /^ATf 00 1T70 

zl /o.CU/.gz43_z5l778 


F 


M0003937 1 A:E0 1 


HMEC-Unt 


3 / Jo 


41334b 


O 1 T/C XTAT O OOI "7 on 

zl /o.IN]U7.gz43_zol7d9 


F 


M00039592C:F09 


HMEC-Unt 


3 / J7 


1Z3343 


zi /o.uu/.gz43_zoi /yu 


F 


Ti^AAAIACAA A ~WT\Ci 

M00039599A:E08 


HMEC-Unt 


3 /4U 


7fi77na 

ZU/jUo 


Zl /0.rU/.gz43 zol/yi 


c 
r 


M00039606C:B07 


T X» WW* XT 

HMEC-Unt 


^741 
3/41 


/inn/178 

4UU4Z5 


7 1 7/i FAQ r***A 1 OO 1 OAO 

zl /o.lvUo.gz43_zo loUz 


F 


M000395 62 A: D 1 0 


HMEC-Unt 


^747 
3 /4Z 


JAA/IOA 

4UU4ZO 


7 1 7A UAQ r»rr/i ^ OO 1 O K 

zl /o.rluy.gz43_zolol3 


F 


A /f AAA*) t\C 1 f T T/\ f 

M0003 95 1 6C :H06 


X X"» — r T t j 

HMEC-Unt 


3 /4 3 


A(\0 1 /17 
4UZ14 / 


717/i T>AO rv^/f 1 00 100*2 
Zl /O.FUy.gZ43_ZO 15Z3 


T? 

r 


HJAAATft^A^ A .mc 

M00039607A:F05 


X X» XX^ XT . 

HMEC-Unt 


^74 A 
3 /44 


/1 Afi777 
4UUZ3 J 


717£ T/in ^—/i Q OOIO^/I 

Zl /O.KlU.gz43 zolo34 


F 


M0003 95 62D : B02 


x x* m xt 1 

FIMEC-Unt 


3 /43 


4Uoy5o 


7 1 7£ O OOIO^A 
Zl /O.lVl 1 .gZ43_Zolo3U 


T? 

r 


Tl/fAAA7AC/ , rr\ PAI 

M00039562D:G01 


T TTk IT 1 T T j_ 

FIMEC-Unt 


^7 J. (s 
3 /40 


4UU1 / 1 


zl /o.r>lz.gz43_zo l03 / 


r 


l/TAAAOAT^^r'.TTI A 

M0003 93 66C : B 1 0 


x x"» ir</i xt ^ 

HMEC-Unt 


^7A7 
3 /4 / 


4U30 / 1 


O 1 7/C ri7 OOIO^O 

zl /o.dz.gz43_zolo3o 


F 


A /TAAAIA'lTin.TTAA 

M00039373B:B09 


X XTk M"W^ XT j 

HMEC-Unt 


3 /4o 


4UUU4 / 


7 17^ A 17 rrrr/17 70 1077 
Zl /O.Al3.gZ43_Zol5 /Z 


T7 

r 


A jffAAAOAl OA A . A AO 

MUUU39 1 84A: A08 


T TTk jfT"' X T j. 

HMEC-Unt 


5749 


404736 


2176 Rll c»74^ 981 87^ 

Zil / v.JJ 1 J.g^^tJ ^OlO / J 


1 




TTA/fr^ TT-+ 

riiviii V- - u nx 


5750 


413627 


2176.F13.gz43_281877 


F 


M00039505B:G07 


HMEC-Unt 


5751 


407260 


2176K13.gz43_281882 


F 


M00039563C:D01 


HMEC-Unt 


5752 


155374 


2176.M13.gz43_281884 


F 


M00039586A:C03 


HMEC-Unt 


5753 


409612 


2176.114.gz43_281896 


F 


M00039525B:H03 


HMEC-Unt 


5754 


402411 


2176A16.gz43_281920 


F 


M00039184C:C05 


HMEC-Unt 
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Table 2 



oily 

TO 

NO 


ULfUo 1 UK 


oriV£ JNAJVUi 


NT 


CJLUINi} ID 


LIBRARY 


5755 


411124 


2176.C17.gz43_281938 


F 


M00039375A:C09 


HMEC-Unt 


5756 


412416 


2 1 76 . J 1 7 .gz43_2 8 1 945 


F 


M00039534D:E07 


HMEC-Unt 


5757 


406734 


2176.M17.gz43_281948 


F 


M00039586D:D05 


HMEC-Unt 


5758 


408306 


2176.K18.gz43_281962 


F 


M00039564B:F08 


HMEC-Unt 


5759 


1054 


2176.E19.gz43 281972 


F 


M00039502D:H03 


HMEC-Unt 


5760 


406855 


2176.119.gz43_281976 


F 


M00039527B.D06 


HMEC-Unt 


5761 


400454 


2176.H20.gz43_281991 


F 


M00039521A:D02 


HMEC-Unt 


5762 


402147 


2 1 76 J20.gz43_28 1 993 


F 


M00039536B:E10 


HMEC-Unt 


5763 


202308 


2 1 76.M20.gz43_28 1 996 


F 


M00039588A:H12 


HMEC-Unt 


5764 


413997 


2 1 76. J2 1 .gz43_282009 


F 


M00039536B:H03 


HMEC-Unt 


5765 


409589 


2 1 76.K2 1 .gz43_2820 1 0 


F 


M00039564C:H05 


HMEC-Unt 


5766 


409262 


2 1 76.M2 1 .gz43_2820 12 


F 


M00039588B:H02 


HMEC-Unt 


5767 


405932 


2176.H22.gz43_282023 


F 


M00039521C:B01 


HMEC-Unt 


5768 


20517 


2 1 76.M22 .gz43_282028 


F 


M00039588D:F10 


HMEC-Unt 


5769 


413915 


2l76.P23.gz43_282047 


F 


M00039610A:G11 


HMEC-Unt 


5770 


412483 


2l76.F24.gz43_282053 


F 


M00039507D:E03 


HMEC-Unt 


5771 


402392 


2176.G24.gz43_282054 


F 


M00039514A:G03 


HMEC-Unt 


5772 


401510 


2176J24.gz43_282057 


F 


M00039537B:F06 


HMEC-Unt 


5773 


408649 


2176.L24.gz43_282059 


F 


M00039583A:G09 


HMEC-Unt 


5774 


418682 


2178.K02.gz43_282160 


F 


M00040183B:C06 


HMEC-bFGF 


5775 


417259 


2178.E04.gz43_282186 


F 


M00040141A:G10 


HMEC-bFGF 


5776 


20517 


2178.P04.gz43_282197 


F 


M00040235C:D02 


HMEC-bFGF 


5777 


414821 


2 1 78 .G05 .gz43_282204 


F 


M00040160C:A04 


HMEC-bFGF 


5778 


402534 


2178.K05.gz43_282208 


F 


M00040183D:C08 


HMEC-bFGF 


5779 


402411 


2 1 78.D07.gz43_282233 


F 


M00040134A:A07 


HMEC-bFGF 


5780 


419255 


2178.L07.gz43_282241 


F 


M00040188B:E05 


HMEC-bFGF 


5781 


417426 


2 1 78 .G 1 1 .gz43_2 82300 


F 


M00040161C:H06 


HMEC-bFGF 


5782 


415527 


2178.G12.gz43_282316 


F 


M00040161D:C03 


HMEC-bFGF 


5783 


418340 


2178J12.gz43_282319 


F 


M00040181B:B06 


HMEC-bFGF 


5784 


403154 


2178.L12.gz43_282321 


F 


M00040189D:A06 


HMEC-bFGF 


5785 


403837 


2178J13.gz43282335 


F 


M00040181B:C05 


HMEC-bFGF 


5786 


418482 


2178.L14.gz43_282353 


F 


M00040190B:C02 


HMEC-bFGF 


5787 


419751 


2 1 78 .C 1 5 .gz43_282360 


F 


M00039760B:F12 


HMEC-bFGF 


5788 


402070 


2 1 78 .B 1 6.gz43_282375 


F 


M00039752D:D07 


HMEC-bFGF 


5789 


402070 


2178.N16.gz43_282387 


F 


M00040227A:E07 


HMEC-bFGF 


5790 


420504 


2178.C18.gz43_282408 


F 


M00039760C:H07 


HMEC-bFGF 


5791 


147279 


21 7o.F19.gz4J_zoz4J7 


r 


M0U04U24oA:OU9 


HMtC-brtrr 


5792 


402353 


2178.H20.gz43_282445 


F 


M00040171B:H03 


HMEC-bFGF 


5793 


416914 


2178.L20.gz43_282449 


F 


M00040196B:F10 


HMEC-bFGF 


5794 


163970 


2178.E2Lgz43_282458 


F 


M00040145B:C12 


HMEC-bFGF 


5795 


416762 


2178.021.gz43_282468 


F 


M00040233C.F09 


HMEC-bFGF 


5796 


57183 


2178.M22.gz43_282482 


F 


M00040222A:E06 


HMEC-bFGF 
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Table 2 



Tn 

LLJ 

NO 


PT TTCT1713 




NT 


C*1 CXWT2 1T\ 


LIBRARY 


5797 


a r\ a a s\ y 

403306 


2 1 84.D0 1 .gz43_282670 


F 


M00039950C:C05 


HMEC-VEGF 


5798 


402298 


2 1 84X01 .gz43_282678 


F 


M00040328D:A03 


HMEC-VEGF 


5799 


420958 


2 1 84. K02.gz43_2 82693 


F 


M00040322B:A08 


HMEC-VEGF 


5800 


423884 


2 1 84. A03.gz43_282699 


F 


M00039747A:H06 


HMEC-VEGF 


5801 


268336 


2 1 84 J04 .gz43_2 82724 


F 


M00040318C:B09 


HMEC-VEGF 


5802 


422590 


2 1 84.O07.gz43_282777 


F 


M00040364B:E11 


HMEC-VEGF 


5803 


402298 


2184.A08.gz43_282779 


F 


M00039749B:G05 


HMEC-VEGF 


5804 


423534 


2 1 84. J 1 1 .gz43_282836 


F 


M00040319D:G10 


HMEC-VEGF 


5805 


423008 


2184.K13.gz43 282869 


F 


M00040323D:F04 


HMEC-VEGF 


5806 


424723 


2184.N13.gz43_282872 


F 


M00040347B:B11 


HMEC-VEGF 


5807 


20517 


2 1 84.D 1 6.gz43_2829 1 0 


F 


M00039958D:D05 


HMEC-VEGF 


5808 


403306 


2 1 84 . A 1 7.gz43_282923 


F 


M00039750C:F08 


HMEC-VEGF 


5809 


140224 


2 1 84 . A22 .gz43_2 83 003 


F 


M00039762C:D11 


HMEC-VEGF 


5810 


402298 


2 1 84.G24.gz43_28304 1 


F 


M00040294B:E09 


HMEC-VEGF 


5811 


468930 


1564.A01.gz43_296552 


F 


M00042709B:G05 


UC2-NormColon 


5812 


463143 


1564.B01.gz43_296553 


F 


M00042711B:H04 


UC2-NormColon 


5813 


465785 


1 5 64 . CO 1 .gz43__296554 


F 


M00042713B:C08 


UC2-NormColon 


5814 


453766 


1564.101. gz43_296560 


F 


M00042724A:G02 


UC2-NormColon 


5815 


462398 


1 564. JO 1 .gz43_29656 1 


F 


M00042726D:G11 


UC2-NormColon 


5816 


390563 


1 5 64 . 00 1 .gz43_2965 66 


F 


M00042737B:C07 


UC2-NormColon 


5817 


447189 


1 5 64 . C03 ,gz43_2965 86 


F 


M00042713B:F03 


UC2-NormColon 


5818 


462348 


1564.H03 .gz43_29659 1 


F 


M00042722A:G08 


UC2-NormColon 


5819 


446740 


1 5 64 . 004 .gz43_2966 1 4 


F 


M00042737C:C07 


UC2-NormColon 


5820 


447268 


1564. COS .gz43_296618 


F 


M00042713C:B08 


UC2-NormColon 


5821 


461316 


1 5 64 . K05 .gz43_296626 


F 


M00042729A:F11 


UC2-NormColon 


5822 


420686 


1 5 64 .N05 .gz43_296629 


F 


M00042735C:G02 


UC2-NormColon 


5823 


447579 


1 5 64 .005 .gz43_29663 0 


F 


M00042737C:E03 


UC2-NormColon 


5824 


447006 


1564.B06.gz43_296633 


F 


M00042711C:H10 


UC2-NormColon 


5825 


463824 


1 5 64 .N06 .gz43_296645 


F 


M00042735D:A07 


UC2-NormColon 


5826 


461135 


1564.G07.gz43_296654 


F 


M00042720C:E03 


UC2-NormColon 


5827 


388688 


1564.A08.gz43_296664 


F 


M00042710A:G10 


UC2-NormColon 


5828 


462986 


1564.K08.gz43_296674 


F 


^ AT *f\ *f\ *f\ Jt j*^ /"\ A T TAA 

M00042729A:H08 


UC2-NormColon 


5829 


454825 


1564.M08.gz43_296676 


F 


M00042733C:C05 


UC2-NormColon 


5830 


466614 


1564.O08.gz43_296678 


F 


M00042737D:D03 


UC2-NormColon 


5831 


457975 


1564.A09.gz43_296680 


F 


M00042710B:B09 


UC2-NormColon 


5832 


446345 


1564.109.gz43__296688 


F 


M00042724D:B04 


UC2-NormColon 


3833 


456330 


1 364.C 1 1 .gz43_2yt> / 14 


r 


IYlU(JU4z / I4A.CrU4 


UL2 -in onn^oion 


5834 


462337 


1564.Ell.gz43_296716 


F 


M00042716D:G04 


UC2-NormColon 


5835 


447387 


1564.Nll.gz43_296725 


F 


M00042736A:F03 


UC2-NormColon 


5836 


467780 


1564.B12.gz43_296729 


F 


M00042712A:E08 


UC2-NormColon 


5837 


447561 


1564.C12.gz43_296730 


F 


M00042714A:H05 


UC2-NonnColon 


5838 


461313 


1564.K12.gz43_296738 


F 


M00042729B:F10 


UC2-NormColon 
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Table 2 





SEQ 

TT"\ 

ID 






UKlb 








JNU 


CLUSTER 


SEQ NAME 


IN 1 


CLONE ID 


LIBRARY 




5839 


453679 


1564.L12.gz43_296739 


F 


M00042732A:A11 


UC2-NormColon 




5840 


446964 


1564.G13.gz43_296750 


F 


M00042721A:G07 


UC2-NormColon 




5841 


459158 


1564.H13.gz43_296751 


F 


M00042722D:C12 


UC2-NormColon 




5842 


450723 


1564.M13.gz43_296756 


F 


M00042733D:G08 


UC2-NormColon 




5843 


447826 


1564.C15.gz43_296778 


F 


M00042714B:E01 


UC2-NormColon 




5844 


446741 


1564.D15.gz43_296779 


F 


M00042715D:E02 


UC2-NormColon 




5845 


452687 


1564F15.gz43_296781 


F 


M00042719A:H06 


UC2-NormColon 




5846 


446601 


1564.H15.gz43_296783 


F 


M00042723A:D09 


UC2-NormColon 




5847 


447649 


1564.B19.gz43_296841 


F 


M00042712C:E06 


UC2-NormColon 




5848 


465528 


1564C19.gz43_296842 


F 


M00042714C:C10 


UC2-NormColon 




5849 


462865 


1564.E19.gz43_296844 


F 


M00042717B:G11 


UC2-NormColon 




5850 


405932 


1564.F19.gz43_296845 


F 


M00042719C:H05 


UC2-NormColon 


Q 


5851 


457922 


1564.G19.gz43_296846 


F 


M00042721D:B03 


UC2-NormColon 


%y 


5852 


463821 


1564N19.gz43_296853 


F 


M00042737A:A07 


UC2-NormColon 


^y 


5853 


463217 


1564.019.gz43_296854 


F 


M00042738D:H12 


UC2-NormColon 




5854 


389425 


1564.A20.gz43_296856 


F 


M00042711A:F01 


UC2-NormColon 


? n 


5855 


446242 


1564O20.gz43_296870 


F 


M00042739A:C06 


UC2-NormColon 


^- 3 


5856 


449171 


1564C21.gz43_296874 


F 


M00042714C:F12 


UC2-NormColon 




5857 


450723 


1564.G21.gz43_296878 


F 


M00042721D:D01 


UC2-NormColon 




5858 


469766 


1564.M21.gz43_296884 


F 


M00042735A:G12 


UC2-NormColon 


C3 


5859 


446866 


1564.D22.gz43_296891 


F 


M00042716A:F09 


UC2-NormColon 




5860 


403949 


1564.M23.gz43_296916 


F 


M00042735B:A06 


UC2-NormColon 




5861 


446922 


1564.023 .gz43_296918 


F 


M00042739A:G07 


UC2-NormColon 


S.3 s 


5862 


235874 


1564.C24.gz43_296922 


F 


M00042714D:E11 


UC2-NormColon 




5863 


558301 


1681A01.gz43_296936 


F 


M00055739A:C09 


UC2-NormColon 




5864 


426366 


1681I01.gz43_296944 


F 


M00055749D:B07 


UC2-NormColon 




5865 


638943 


1681.J01.gz43_296945 


F 


M00055750C:H10 


UC2-NormColon 




5866 


645603 


1681O01.gz43_296950 


F 


M00055758C:H10 


UC2-NormColon 




5867 


642644 


1681 E02.gz43_296956 


F 


M00055744C:D02 


UC2-NormColon 




5868 


644105 


1681.102.gz43_296960 


F 


M00055749D:C01 


UC2-NormColon 




5869 


468613 


1681.A03.gz43_296968 


F 


M00055739B:B06 


UC2-NormColon 




5870 


648905 


1681B03.gz43_296969 


F 


M00055740C:A07 


UC2-NormColon 




5871 


446925 


1681.H03.gz43_296975 


F 


M00055748C:C07 


UC2-NormColon 




5872 


649356 


1681. 103 .gz43_296976 


F 


M00055749D:D06 


UC2-NormColon 




5873 


510724 


1 68 l.K03.gz43 296978 


F 


M00055752C:C06 


UC2-NormColon 




5874 


446676 


1681.L03.gz43_296979 


F 


M00055753D:C06 


UC2-NormColon 




5875 


644937 


1681.B04.gz43 296985 


F 


M00055740C:E06 


UC2-NormColon 




5876 


644342 


1681I04.gz43_296992 


F 


M00055749D:F12 


UC2-NormColon 




5877 


640264 


1681.O04.gz43_296998 


F 


M00055758D:E02 


UC2-NormColon 




5878 


463951 


1681D05.gz43_297003 


F 


M00055743C:D12 


UC2-NormColon 




5879 


644345 


1681.K05.gz43_297010 


F 


M00055752C:F06 


UC2-NormColon 




5880 


646552 


1681.N06.gz43_297029 


F 


M00055757B:B07 


UC2-NormColon 
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Table 2 



TFI 
IN \J 




TvT A TV/f XT 
&r,\l JNAJVlty 


NT 




T TDD A TJX/" 


5881 


611927 


168 1 .K07.gz43_297042 


F 


M00055752C:H07 


UC2-NormColon 


5882 


639420 


168 1 .N07.gz43_297045 


F 


M00055757B:C04 


UC2-NormColon 


5883 


568467 


168 1 . A08.gz43_297048 


F 


M00055739B:H08 


UC2-NormColon 


5884 


645147 


1681.C08.gz43_297050 


F 


M00055742B:H06 


UC2-NormColon 


5885 


447936 


1681.E08.gz43_297052 


F 


M00055744D:A11 


UC2-NormColon 


5886 


492917 


1681.O08.gz43_297062 


F 


M00055759A:B02 


UC2-NormColon 


5887 


649427 


1681 .D09.gz43_297067 


F 


Tk M f\f\f\ C CI A T~\ T% f\ A 

M00055743D:D04 


UC2-NormColon 


5888 


556654 


168LF09.gz43_297069 


F 


M00055745D:F11 


UC2-NormColon 


5889 


594040 


168 1 .H09.gz43_29707 1 


F 


X K f\f\f\ C CIA OR "AO 

M00055748D:C03 


UC2-NormColon 


5890 


488432 


1681J09.gz43_297073 


F 


H ATf\f\f\C CJC 1 A T* , /\^' 

M0005575 1 A:F06 


UC2-NormColon 


5891 


641144 


1 68 1 .K09.gz43_297074 


F 


M00055752D:C01 


U C 2 -N ormColon 


5892 


550223 


168 1 .L09.gz43_297075 


F 


M00055754A:E07 


UC2-NormColon 


5893 


491544 


168 1 .M09.gz43_297076 


F 


M00055755D:C09 


UC2-NormColon 


5894 


648481 


1681.A10.gz43_297080 


F 


M0005 5 73 9C :D 1 1 


UC2-NonnColon 


5895 


150839 


1 68 LB 1 0.gz43_29708 1 


F 


M00055740D:G12 


UC2-NormColon 


5896 


644053 


168 1 .F10.gz43_297085 


F 


M00055746A:C09 


UC2-NormColon 


5897 


644342 


168 1 .K 1 0.gz43_297090 


F 


M00055752D:F01 


UC2-NormColon 


5898 


419479 


168 1.C1 1 .gz43_297098 


F 


M00055742C:C01 


UC2-NormColon 


5899 


645000 


168 1 .El 1 .gz43_297100 


F 


M00055744D:F03 


UC2-NormColon 


5900 


478833 


1 68 1 .L 1 1 .gz43_297 1 07 


F 


M00055754A:H06 


x T/*t XT 1 

UC2-NormColon 


5901 


644684 


1 68 1 .A 1 3 .gz43_297 128 


F 


M00055739D:Bl2 


UC2-NormColon 


5902 


640504 


1 68 1 .B 1 3 .gz43_297 1 29 


F 


M0005 5 74 1 A: D09 


TT/1/» XT- _ 1 

UC2-NormColon 


5903 


463815 


1 68 1 .A 1 4 .gz43_297 1 44 


F 


M00055739D:C03 


T T/^ ^ XT 1 

UC2-NormColon 


5904 


640997 


1681.B15.gz43_297161 


F 


M00055741B:B12 


UC2-NormColon 


5905 


642263 


1681.D15.gz43_297163 


F 


M00055 744 A: B04 


UC2-NormColon 


5906 


451508 


1 68 1 .P 1 5 .gz43_297 1 75 


F 


M00055761A:E06 


T T/^1 X T _ ^ 1 

UC2-NormColon 


5907 


644919 


1681.A16.gz43_297176 


F 


M00055739D:E04 


T ^ xt y~^ t 

UC2-NormColon 


5908 


639703 


1681 .K16.gz43_297 1 86 


F 


M00055753A:D04 


x Ty^t ^ x t y^> 

UC2-NormColon 


5909 


639750 


1681 .Ml 6.gz43_297 1 88 


F 


M00055756C:C03 


T T/™*1 XT _ _ 1 

UC2-NormColon 


5910 


587696 


1681.F17.gz43_297197 


F 


M00055746C:F06 


TT/*1^ XT _ f~\ _ 1 

UC2-NormColon 


5911 


461917 


1 68 1 H 1 7 .gz43_297 1 99 


F 


M00055749B:C10 


U C 2 -N ormColon 


5912 


644510 


168 1 .I17.gz43_297200 


F 


M00055750B:H0 1 


UCz-NormColon 


5913 


526459 


1681.B18.gz43_297209 


F 


M00055741L:A09 


UC2-NormCoIon 


5914 


471277 


1 68 1 . F 1 8 .gz43_2972 1 3 


F 


M00055746C:r 10 


U -iN orm Colon 


5915 


483042 


1 68 1 .K 1 8 .gz43_2972 1 8 


F 


M0005 5 75 3 B : A02 


UCz-NormColon 


5916 


514838 


1681.L18.gz43_297219 


F 


M00055755A:B1 1 


U C2-JN ormColon 


con 


643984 




r 


IVIUUUj D ID I LJ.D\JJ 




5918 


462245 


1681.A19.gz43_297224 


F 


M00055740A:B03 


UC2-NormColon 


5919 


640017 


1681.D19.gz43_297227 


F 


M00055744B:B02 


UC2-NormColon 


5920 


453726 


1681.E19.gz43_297228 


F 


M00055745A:H02 


UC2-NormColon 


5921 


644468 


1681.F19.gz43_297229 


F 


M00055746C:G06 


UC2-NormColon 


5922 


644105 


1681.H19.gz43_297231 


F 


M00055749C:B03 


UC2-NormColon 
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Table 2 



SEQ 
ID 
NO 


CLUSTEE 


SEQ NAME 


ORIE 
NT 






5993 


44748 S 


1681 KAQcttA'X 9077^4 


r 


JVIUUUj J /jjd.AUo 


XT /"I t 

UCz-NormColon 


SQ94 


9 1 84 1 6 

Z, 1 OH 1 U 


1681 n?fl 074^ 707947 
lOo 1 .L/ZU.gZ't j Zy / Z*t-5 


r 


MUUUjj /44r>:CU5 


UCz-NormColon 


S09S 
jyz j 


48Q040 


1681 T-I70 rtT/l'? 9077/17 

ioo i .rizu.gz^f o zy/Z'f/ 


17 
r 


jyluuUj j /4yC:Cu4 


UC2 -NormColon 


SQ96 

J7ZO 


64Q709 

o*+y /uz 


1681 T70 rr-y47 7Q774Q 

10© i.jzu.gz*to_zy /z^y 


T? 

r 


MUUujj/jzAlxllO 


UCz-NormColon 


SQ97 
ji^z / 


64^ 1 4? 

0*tO l*tZ 


1681 A71 rr-7/1'3 7Q77^£ 

ioo i ./\z i .gz*tj_zy /zjo 


17 

r 


MUUUjj /4Ur>.CUo 


U Cz -N ormColon 


J 


448? 5 1 


1681 H71 citAI 7Q7967 
lOo l.\JZl .gZH-J Z7/ZOz 


17 

r 


MUUUjj /4oA.UU/ 


UCz-JNormLolon 


S09Q 


6444 6 1 


1681 XT71 ctTA'X ?079/ : ;Q 

ioo i .inzi .gz*tj zy/zoy 


17 

r 


MUUUjj /joA.CjUz 


U Cz -N ormColon 




64 1 068 

OH 1 700 


1681 P99 <»'7/l'* 7Q7787 
lOo 1 ..rZZ.gZ'f j_Z7 /Zo / 


r 


MUUUjj /olrS.rUy 


UCz-NormColon 


j ~ j i 


470667 


1681 F91 cr-wAI 9Q79Q9 

ioo i .czj .gz*+j Z7 /zyz 


rr 
r 


MUUUjj /^JO.riUz 


U Cz -JN ormColon 


50^9 


644470 


1681 T91 o*z4^ 9079Q7 

lOOl.JZJ.gZtJ Z7/Z7/ 


17 


^4000^^7^7 A ni H 
lVlUUUjj / jZA.LrlU 


U Cz-JN ormColon 




46^894 


1681 A 94 cr-7-4^ 907704 

IOO 1 ./YZt.gZHJ Z7/ jU*I 


rr 
r 


MUUUjj /H\JtS.r\)y 


U Cz-JN ormColon 


SQ14- 

J ~ J*T 


^80^77 

JO~J / / 


1681 P94 avA'X 907706 

10ol.^Z4 , .gZ4J Z7/ JUO 


r 


MUUUjj /^JO.liUl 


U Cz-IN ormColon 




468050 


1681 F94 rr<747 707708 
lOo 1 .nz £ +.gz £ io_zy / jUo 


17 

r 


MUUUjj /4jL.AUO 


TT/^7 Mrt««r«1ft« 

u Cz-In ormColon 




55^675 

JJjU / J 


1681 1V494 ct-rA'X 9Q77 1 6 
lOo 1 .lVlZ £ t.gZ £ t J Z.Z7 / j ID 


rr 
r 


MUUUjj / j /A.AU / 


U Cz -in ormColon 




44QQ78 


1561 r*Hl cr-rA'X 7147Q5 
1 Jul .UUI .gZfJ jHJ7J 


17 
r 


ivyf n o ri4 7 q a a rs • pn 1 
MUUU4Z DH^U.r U j 


U Cz-IN ormColon 


J J O 


46^090 


1561 f^Ol o"t47 7147G0 


17 


MUUU^fZ JZuU.AU j 


UCz-JNormcoion. 


j ~ j ~ 


456967 

*+JU.Z.U / 


1S61 T01 cr741 ^14409 


r 


1^00047^*5 1 R/Jl 7 
IVIUUUt-Z J j 115.VJ1Z 


ucz-iNormcoion 


5940 


459097 


1S61 Mill eyT-4^ ^ 1 zlZlOS 


p 


^400047^^ 7 A 'T\1 7 
IVlUUU'tzJj / J\.LJ IZ 


u Cz -jn ormColon 


SQ41 


447^86 

*+*+ / J OU 


1S61 T09 orrAI ^14417 


p 


1V400047 ^ 7 QF* *r>H7 
MUUU^fZ jZVLI.I-IU / 


u Cz-iNormcoion 


5Q49 


468957 


1^61 \C(Y> nvA'X 11441Q 


P 


TVvf00047^'3'2T3-F1 1 
MUUU*fZ J J JlJ.r 1 1 


UCz-iN ormColon 


594^ 

J 7*tJ 


1 87X6 


1S61 F0^ cr^4^ ^14490 


p 


1V/100047 ^ 7 ^ r* -FfH A 
iviuuu^ z jz j .riuo 


u Cz-IN ormcoion 


5044 


447404 


1^61 A 04 rr-yZf? ^ 1 44 A 1 
1 JO 1 ./\U*+.gZ t t J Jl £ + £ t £ fl 


F 
r 


A>rnnft4 7*^41 \-t\c\q 
MUUU4Z J41 A.UUo 


U Cz-JNormColon 


5045 


45Q8Q^ 


1^61 P04 fY-y4^ ^14447 
1 JO 1 .JDU £ t.gZ £ T J Jl'f £ f £ fZ 


p 
r 


MUUU4z34jr>.LlUj 


TTp^ \T J T..,L i r J Ll.L. t 

U C z -N ormColon 


5046 


467558 


1^61 rr-747 ^144^ 
1 JO 1 . IYlU*f .gZ*t J J 1 H-H-J J 


p 
r 


\Af\f\r\A 7 < *2 7 a .rno 
MUUU4ZJ 5 /A.LrUy 


U Cz -N ormColon 


5047 


458495 


1^61 H04 rr-r4^ ^144^^ 
1 JOl .^UH-.gZ'+J Jl't'f jj 


p 
r 


A400047^41 A *T3fl7 
MUUU4ZJ41 A.rSU/ 


UCz-NormColon 


5048 


94Q1 78 
ZH-;/ I/o 


1^61 CC\^ nvA'X "5144^0 

i joi .v^uj.gz^fj ji*f £ fjy 


P 
r 


MUUU4Z J4 j r> . AU j 


U C z -N ormColon 


504Q 


4678 1 ^ 
*tOZo 1 J 


1 Jul .JUJ.gZ^j_^ l^f'fuu 


p 

r 


MUUU4ZJ j lU.OUo 


U Cz - N ormColon 


5050 


41 79Q1 
H-l / Z7 1 


1^61 l<TO^ cttAI* *X\AAfS1 
1 JOl.JSAJj.gZ** J__J l*f*fO/ 


p 
r 


MUUU4ZJJ jl^.rU4 


U Cz-N ormColon 


5051 

j~j i 


46^41 


1^61 1V40^ ci-jA'X ^1446Q 

i jo i .lviuj .gzHj j i*+M-oy 


F 


lV40nn47^'27 A *XJtnA 
MUUU4Z J J / A.OU4 


U Cz-IN ormColon 


5059 


447868 


1^61 A06 n^A'X 'WAATX 
1 Jul ./VUu.gZH-j JlH-'f/J 


p 
r 


MUUU4Z J4 1 A.HU4 


U Cz-N ormColon 


5Q51 


461 65^ 
*t010JJ 


1^61 RflA crrrAI "514474 

1 Jul .DUO.gZ^J jm-H/H- 


p 


MUUU4zj43r>.r Iz 


U Cz-N ormColon 


5054 


4651 97 

HOJ 1Z / 


1^61 1^06 rr-7-4^ ^1448^ 
1 JO 1 .JVUO.gZ't J jIH-HOJ 


p 
r 


MUUU4Z j j jU.oUj 


lT/^7 XT. -- mr*. - 1. - .l 

u Cz-in ormColon 


5955 

J7JJ 


460946 


1 S61 >J06 o-rAI ^14486 


p 


iV4noo47 ^ ^ or nno 
iviuuu^f z j j y d .uuy 


ucz-iNorm^oion 


5956 

J 7JU 


^1 10 


1S61 O06 a^4^ ^14487 

1 JOl .VlUU.gZHJ Jl*t*TO/ 


p 
r 


1V400047^41 A-F06 
IVlUUU^fZ J*f 1 /VHUO 


u cz -in ormcoion 


5957 

j -7 j / 


457508 


1S61K07a74^ ^14S09 

1 JO 1 ,1/NU / .gZ*t J J ItJUZ 


F 
r 


1^400047^^0^^04 


u v^z -in ormL/Oion 


5958 

J 7J O 


461 01 7 

HU 1^1/ 


1S61 RflQ o7d^ ^14S99 

1 JO 1 .X5U".gZ t T J JltJZZ 


F 
r 


1V40 0 04 7 ^ 4 "5 n ■ FO 8 

iviuuu'tZj'tjU.rUo 


u i»/Z-in ormi^oion 


5959 


447568 


1561H09ez43 314528 


F 


M00042528DD09 


1 jr , 2-NofmColon 


5960 


467094 


156 l.J09.gz43_3 14530 


F 


M00042532A:D08 


UC2-NormColon 


5961 


447087 


1561.N09.gz43_3 14534 


F 


M00042539C:H08 


UC2-NormColon 


5962 


458419 


1561. 009.gz43_3 14535 


F 


M00042541B:B05 


UC2-NormColon 


5963 


446952 


1561.D10.gz43_314540 


F 


M00042522C:H10 


UC2-NormColon 


5964 


470513 


1561.H10.gz43_314544 


F 


M00042528D:H03 


UC2-NormColon 
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Table 2 



SEQ 
ID 
NO 


CLUSTET 




OR IF 

NT 


l^lAJINl} ID 


LIBRARY 


core 


A CQQ7y1 

4599 /4 


1561.M10.gz43 314549 


F 


M00042537CD04 


UC2-NormColon 


5900 


A A Z A 

1454 


156 l.N10.gz43_3 14550 


F 


M00042539D:A04 


UC2-NormColon 


59o/ 


A £LA C 1 A 

464510 


1561.D1 Lgz43_314556 


F 


M00042522D:A08 


UC2 -NormColon 


5968 


A CTO /I Z* 

457846 


1561.Gll.gz43_3 14559 


F 


M00042527B:B01 


UC2-NormColon 


5969 


1 A/T 

15296 


1561.111 ,gz43 3 1456 1 


F 


M00042530C:H02 


UC2-NonnColon 


5970 


A A tZC TO 

446578 


1561.C12.gz43 314571 


F 


M00042345D:D04 


UC2-NormColon 


5971 


y| Z7 "5 AO/ 

463086 


1561.F12.gz43_3 14574 


F 


M00042525C:H07 


UC2-NormColon 


COT! 

5972 


/I /I TOT 

464287 


1 C / 1 11 ^ A 1 if 

156 1.112.gz43_3 14577 


F 


M00042530D:A02 


UC2-NormColon 


5y 15 


yf /I 70AC 

447805 


1 5 6 1 .A 1 4 .gz43 3 14601 


F 


M00042341D:G11 


UC2-NormColon 


5974 


446188 


"1^/1 /"* 1/1 /II 1 1 jl A*"T 

156 l.G14.gz43_3 14607 


F 


M00042527C:A10 


UC2-NortnColon 


5975 


468565 


156 1.114.gz43_3 14609 


F 


M00042530D:F09 


UC2-NormColon 


5976 


A TA/A A 

470600 


t C t T 1 jt yl 1 1 1 ji V "1 /"I 

1 56 1 J 1 4 .gz43_3 14610 


F 


M00042532A:H03 


UC2-NormColon 


5977 


457772 


1561.A15.gz43 314617 


F 


M00042342A:B04 


UC2-NormColon 


5978 


446383 


1 5 6 1 .E 1 5 .gz43 3 1 462 1 


F 


M00042524C:C12 


UC2-NormColon 


5979 


446673 


1 56 1 .F 1 5 .gz43_3 14622 


F 


M00042525D:E01 


UC2-NormColon 


c a o a 

5980 


vl /n7A 

462779 


1561.L15.gz43 314628 


F 


M00042536B:G08 


UC2-NormColon 


f AO I 

5981 


446289 


1561. C16.gz43_3 14635 


F 


M00042521A:E10 


UC2-NormColon 


5982 


458146 


1561. K16.gz43_3 14643 


F 


M00042534B:B08 


UC2-NormColon 


5983 


/i /i T/nr\ 

447690 


1CZ1T1Z" /IT 11 A /~ A A 

1561. L16.gz43_3 14644 


F 


M00042536C:A06 


UC2-NormColon 


c no a 
5984 


452888 


1 r / 1 "VT 1 ZT _ T 1 A S" A S~ 

1 5 6 1 . N 1 6 .gz43_3 1 4646 


F 


M00042540B:B03 


UC2 -NormColon 


5985 


461327 


1 5 6 1 . B 1 7 .gz43 3 1 465 0 


F 


M00042344B:F02 


UC2-NormColon 


f AO/ 

5986 


/I c /T AC O 

456958 


1 5 6 1 .E 1 7.gz43_3 1 4653 


F 


M00042524D:A10 


UC2-NormColon 


5987 


/I T A A 1 

467901 


1^/1 T~* it j *\ ^ ■* a y e a 

1561. F17.gz43_3 14654 


F 


M00042526A:E10 


UC2-NormColon 


cnoo 
5988 


459981 


1561 .K17.gz43 3 14659 


F 


M00042534B:D06 


UC2-NormColon 


CAOA 

5989 


/I CTl 1 O 

457218 


1 56 1 .N 1 7.gz43_3 14662 


F 


M00042540C:A12 


UC2-NormColon 


5990 


y| /T "> A A O 

463008 


1f/1 AIO A 11 

1 5 6 1 . A 1 8 .gz43_3 1 4665 


F 


M00042342C:H03 


UC2-NormColon 


rAA i 

5991 


44634 1 


1 5 6 1 .L 1 8 .gz433 1 4676 


F 


M00042536C:B03 


UC2-NormColon 


5992 


446389 


1 ^ 1 /~\ 1 O A*S *> i j / T A 

1 56 1 .0 1 8 .gz43_3 14679 


F 


M00042542A:C11 


UC2-NormColon 


5993 


/| /< T A A /I 

447096 


1 f X" 1 IO jl 1 11jI/OA\ 

156LP18.gz43 314680 


F 


M00042543B:H12 


UC2-NormColon 


5994 


/I yl /T A O vi 

446984 


1561 .B 19.gz43 3 14682 


F 


M00042344B:H12 


UC2-NormColon 


5995 


468767 


1 £ £L 1 1A A 1 ^> i A s~ C% S~ 

1561. F19.gz43_3 14686 


F 


M00042526A:F11 


UC2-NormColon 


5996 


/I /TOO /I 

467884 


1561 .H19.gz43_3 14688 


F 


M00042529B:E03 


UC2-NormColon 


c a at 

5997 


A C AO O 1 

459881 


1561 .A20.gz43_3 14697 


F 


M00042342D:D03 


UC2-NormColon 


5998 


yf A TAT T 

447027 


1 C £. 1 TXT f\ — J 1 TIjITAA 

1561 ,D20.gz43_3 1 4700 


F 


M00042523B:H01 


UC2-NormColon 


C AAA 

5999 


447380 


1561 .F20.gz43_3 1 4702 


F 


M00042526B:C12 


UC2-NormColon 


6000 


446304 


156 1 .02 1 .gz43_3 14727 


F 


M00042542B:B01 


UC2-NormColon 


6001 


447079 




Tj 

r 


JYlU0U4z5z 1 JJ.O09 


T T/"*T TkT i /"V. 1 i ■■ 

UC2-NormColon 


6002 


379335 


1561. 122.gz43_3 14737 


F 


M00042531B:D12 


UC2-NormColon 


6003 


228873 


1561. L22.gz43_3 14740 


F 


M00042537A:A07 


UC2-NormColon 


6004 


446663 


1561.D23.gz43_314748 


F 


M00042523C:E08 


UC2-NormColon 


6005 


464791 


1561. F23.gz43_3 14750 


F 


M00042526C:B12 


UC2-NormColon 


6006 


446595 


1561.K23.gz43_314755 


F 


M00042534D:D10 


UC2-NormColon 
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SEQ 
ID 
NO 


CLUSTER 


SEQ NAME 


ORIE 
NT 


CLONE ID 


LIBRARY 


6007 


446439 


1561.023.gz43_314759 


F 


M00042542B:C11 


UC2-NormColon 


6008 


458979 


1561.A24.gz43_3 14761 


F 


M00042343A:C05 


UC2-NormColon 


6009 


446620 


1561.C24.gz43_314763 


F 


M00042522A:A05 


UC2-NormColon 


6010 


452507 


1561.F24.gz43_3 14766 


F 


M00042526D:A02 


UC2-NormColon 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1 


AJ251957 


Ciona intestinalis mRNA for nuclear lamin (lamin L2 gene) 


2.2 


2 


AF133812 


Stronsvlocentrotus nallidus isolate tjR31 hindin &ene nartial cds 


0.88 


3 


AF051651 


Homo sapiens sqnalene synthase gene, intron 3, 5* end 


0.0001 


4 


NMO 16701 


Mus musculus nest in fNes i mRNA 


0.24 


5 


Y09000 


R norvefficiis mRNA for dendrin 


0.11 


6 


U67519 


Methanococcus iannaschii section 6 1 of 150 of the corrmlete penome 


6.1 


7 


AK005889 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone: 1700012B08, full insert sequence 


e-139 


8 


AB029343 


Homo saniens H(TR fa-helix cniled-coil rod homnlnpiie^ jrene 
complete cds 


0.023 


9 


ABO 17505 


Homo sapiens mRNA for Nori-2p, complete cds 


0 


10 


AY024246 


Orvza sativa microsatellite MRG6571 containing fTPTA^XfS 
genomic sequence 


6.3 


11 


U32309 


Acirienser fiilvescens Afl4 cvtriphrnme H uene nnrti;*1 r*Hc tR^TA-Tlir 

tRNA-Pro and partial D-loop, mitochondrial genes encoding 
mitochondrial products 


6.3 


12 


XM 046070 


Homo sapiens KIAA0853 protein (KIAA0853), mRNA 


3E-48 


13 


AK001945 


Homo sapiens cDNA FLJ1 1083 fis, clone PLACE1005232 


2.2 


14 


AF1 11457 


Meleagris gallopavo clone TUCA1 141 microsatellite sequence 


0.076 


15 


AK018499 


Mus musculus adult male colon cDNA, RIKEN full-length enriched 
library, clone:9030409O13, full insert sequence 


0 


16 


AE002648 


Drosonhila melanoeaster eenomic scaffold 142000013385436 
complete sequence 


2.2 


17 


AC001011 


Homo saniens (subclone 1 e5 from PI H43^ DNA senuence comnlete 

sequence 


0.003 


18 


XM 039942 


Homo sapiens Meis (mouse) homolog 3 (MEIS3), mRNA 


0.75 


19 


AF099003 


Caenorhabditis elegans cosmid Y59C2A 


0.75 


20 


U92972 


Mus miisciihis nTOt ease-activated recentor 3 fPAR3 1 mPNA comnlete 

cds 


0.08 


21 


AE003935 


Xylella fasti diosa 9a5c, section 81 of 229 of the complete genome 


0.067 


22 


AB031009 


Homo sapiens DNA, MHC class I CL region, 7.1 ancestral haplotype 


0.019 


23 


X94207 


Rsapiens TPR gene (1 365bp) 


4.8 


24 


AK025678 


Homo sapiens cDNA: FLJ22025 fis, clone HEP08518 


2.2 


25 


AK014552 


Mus musculus 0 day neonate skin cDNA, RIKEN full-length enriched 
library, clone:463 2404006, full insert sequence 


e-176 


26 


XM_044013 


Homo sapiens kinesin-like protein 2 (hklp2), mRNA 


0 


27 


NM_032993 


Homo sapiens nucleolar protein family A, member 1 (H/ACA small 
nucleolar RNPs) (NOLA1), transcript variant 2, mRNA 


4E-50 


28 


NM_032315 


Homo sapiens hypothetical protein MGC4399 (MGC4399), mRNA 


0 


29 


AK020701 


Mus musculus 6 days neonate skin cDNA, RIKEN full-length 
enriched library, clone: A030009B 12, full insert sequence 


0.0000002 


30 


AK021717 


Homo sapiens cDNA FLJ11655 fis, clone HEMBA1004554 


0 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


31 


AL5 12453 


Human DNA <;eniience from clone OTO-15161 i C22 on chrnmnsotne 6 

X XlXXXlvXXE JU^ X ^ X X. tfvuUVlAW 11 vltl VlvllV X X J. vXVai *f vli vlil vllluuvlltv V/ ^ 

complete sequence [Homo sapiens] 


0.00004 


32 


XM 005034 

^»*XTA W^v--/ ■ 


Homo sapiens hypothetical protein FLJ 105 11 (FLJ10511), mRNA 


2E-22 


33 


AF3 18896 


Oricetiilii*; pri^eus heta-1 4-Pfl1acto<»vltran^feT*a<ie 1 rtieta40alT-l^ 
mRNA comtvlete ccte 


0.78 


34 


D29757 


Human gene for cytochrome P-450 aromatase, exon 1 (brain specific) 


3E-24 


35 


AJ3 11520 


Ostreopsis ovata 5.8S rRNA gene, internal transcribed spacer 1 (ITS1) 
and internal transcribed spacer 2 (ITS2), strain Al 


0.79 


36 


XM_038521 


Homo sapiens phosphatidylserine decarboxylase (PISD), mRNA 


0 


37 


AK023043 


Homo sapiens cDNA FLJ 12981 fis, clone NT2RP2006454 


3E-96 


38 - 


XM_045520 


Homo sapiens KIAA1595 protein (KIAA1595), mRNA 


4E-16 


39 


ABO 17505 


Homo sapiens mRNA for Nori-2p, complete cds 


0 


40 


AK002682 


Mus musculus adult male kidney cDNA, RIKEN full-length enriched 
library, clone:0610027B03, full insert sequence 


e-138 


41 


NC 001712 


Locusta migratoria mitochondrion, complete genome 


0.009 


42 


AL5 12453 


Human DNA sequence from clone CTC-1516K22 on chromosome 6, 
complete sequence [Homo sapiens] 


0.00001 


43 


AK002682 


Mus musculus adult male kidney cDNA, RIKEN full-length enriched 
library, clone:0610027B03, full insert sequence 


e-125 


44 


XM 039088 


Homo sapiens KIAA0914 gene product (KIAA0914), mRNA 


0.76 


45 


U30291 


Schistosoma mansoni synaptobrevin-like protein gene, exon 2 and 

partial cds 


0.69 


46 


XM_052543 


Homo sapiens mitochondrial ribosomal protein L42 (MRPL42), 

mRNA 


1E-26 


47 


AK026225 


Homo sapiens cDNA: FLJ22572 fis, clone HSI02313 


0.0008 


48 


XM 043007 


Homo sapiens hypothetical gene supported by AK021969 
(LOC92132), mRNA 


1E-99 


49 


Ml 8824 


P.faciparum S antigen gene, complete cds 


0.003 


50 


XM 009311 


Homo sapiens zinc finger protein 304 (ZNF304), mRNA 


0.003 


51 


AK026697 


Homo sapiens cDNA: FLJ23044 fis, clone LNG02454 


0 


52 


XM_035597 


Homo sapiens hypothetical protein AF140225 (AF140225), mRNA 


1E-98 


53 


U81366 


Plasmid Rtsl killer protein and antidote protein genes, complete cds 


0.26 


54 


U28921 


Phaseolus vulgaris ATPase gamma subunit mRNA, nuclear gene 
encoding mitochondrial protein, partial cds 


0.27 


55 


AF305083 


Homo sapiens alpha(l,3)-fucosyltransferase IV (FXJTIV) gene, 3' UTR 


0.031 


56 


D29757 


Human gene for cytochrome P-450 aromatase, exon 1 (brain specific) 


7E-20 


57 


XM_038048 


Homo sapiens similar to APICAL ENDOSOMAL GLYCOPROTEIN 
PRECURSOR (R. norvegicus) (LOC91372), mRNA 


0.0004 


58 


AK026860 


Homo sapiens cDNA: FLJ23207 fis, clone ADSE00968 


8E-12 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


59 


BCO 10849 


rioino sapiens, oimiiar to iiypoineiicai proiem ri^jzujol, clone 
MGC:8851 IMAGE: 3 878776, mRNA, complete cds 


0 


60 




xioino sapiens signal recognition panicle ookjj (pKrooj, mKJNA 


V 


61 




norno sapiens zinc linger protein AJNC_zriUi (Laji^di iyj ) y itikina 


U 


63 


Z49806 


B.taurus mRNA for adenylyl cyclase type VII 


6.8 


64 


XM_048858 


Homo sapiens hypothetical protein FLJ23384 (FLJ23384), mRNA 


0 


65 




numu octpicus iiypuuicucai pruiciii ivivjv^'toy^ ^jyivji^tJ yy), iillvTN/\ 




66 


ARftS7SQJ. 

ADUJ / 


Homo sapiens GSTT1 gene for glutathione S-transferase TT1, 

CUUipiClC CUa 




67 




T-Tomn cnTiipnc. fivnntViPti^al nmtpin A T71J.no 7^ f AFIJ.077^ ml? XT A 
rxuiiiu sapiens iiypuiiicucai piuiciu at itu^^j ^/\r ifuzz j ULtvL^rV 




68 




Unmn cani(=»nc hvr»ntliptir , al niwtf»in AFIJ.077^ /'AF1d.077^\ mPMA 


id- / 1 


69 


ABO 17505 


Homo sapiens mRNA for Nori-2p, complete cds 


0 


70 


AK010143 


ivius niuswuius auuii male luiiguc cl/ina, xviTviJriN ruii-iengtn enncneu 
library, clone:2310074B19, full insert sequence 


1.7 


71 


AK008666 


ivius museums auuii maie siuiuacii ci-jina, isJjvcriN luii-iciigtii 
enriched library, clone:22 10008 A03, full insert sequence 


e-173 


72 


T 

JLrXO / OJ 


x laSUlUUl 11111 IclIUipdl U.111 L^fi^i r\ pUiyillCiaSC aipiUl gCIIC, IrUllipiCIC CUS 


n no7 


73 


XM 003327 


Homo sapiens hypothetical protein FLJ10858 (FLJ10858), mRNA 


0 


74 




jvius muscuius aaun maie lung cujna, k i is hjn iuii-iengtn enncneu 
iiuidiy, Liuiie . i zuuu i jr z j , luii niscii sequence 


7F-1 1 


75 


Z29969 


cmsioiyuca piasmici genes ior noosomai kjn a ana nemoiysins 
HLY1, HLY5mcl HLY5mc2 HLY4 


0.028 


76 




Unmn cnr»ip»nc ^OXTA- KT T719Q9 fie rlnnp POT OIO^O 


n 


77 


Y15931 


Homo sapiens CTNS gene, exon 10 and flanking intronic regions 


0.78 


78 


BC000610 


riomo sapiens, nypotneucai protein rLjiuyoo, cione jvhjcvuo 
IMAGE:3347108, mRNA, complete cds 


0 


79 


YVf 0S04.74 


Wnmn cani^nc "K"T A A 1 944 nrntf»in fJCl A A 1 7dJ.^ mUNA 

FLUIIIU oo.pi Clio JVlArtlZHt pi U LCI 11 l^IVl/^r\ IZttJ, llLIVLN.rV 


0 7S 


80 




T-TumsiTi tnitiv*lirtTiHi*irtTi r'ntririlf^tf* of*Yif\vn& 
nuiiuui iiiiiuviiuiiuiivJii, cuiiipicic genuine 


1D"JU 


81 




Arabidopsis thaliana unknovm protein (F28C11.8) mRNA, complete 

f He 


\J. o 


82 


BC000225 


riomo sapiens, onmiar 10 liypuuieucai protein ri^jiuojo, cione 
IMAGE:3350951, mRNA 


0 


83 


XM_034107 


Homo sapiens hypothetical protein FLJ20758 (FLJ20758), mRNA 


7E-32 


84 


AF305816 


Homo sapiens PRO0633 mRNA, complete cds 


0.67 


85 


AF196678 


HIV-1 isolate 96RW34 from Rwanda, gag protein (gag) gene, partial 
cds 


2.1 


86 


AK024927 


Homo sapiens cDNA: FLJ21274 fis, clone COLO 1781 


6E-75 


87 


X77394 


P.philodendra mitochondrial DNA repeat unit 


0.23 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 




t> \rAT TTE7 


88 


U41203 


Human short tandem reneat nolvmnrnhicm ITKAU r>i*7Qi^ii 


U.UUUl 


89 


AC015937 


Homo ^aTvipn*; rhrnirnvinmp 17 plnnp f* J i "i ^ 1 9 mmnioto 

sequence 


ft A* 


90 


AF245117 


Mus musculus leukocyte cell-surface molecule (Ly9) gene, exon 1 


0.74 


91 


XM 001412 


Homo saoiens metal-retnilatorv tran^crintion fartnr 1 /A/i i Kl ^ mRM A 




92 


AE006690 


Sulfolobus solfataricus section 49 of 272 of the complete genome 


0.26 


93 


AJ292466 


Homo sapiens mRNA for WDR9 protein (WDR9 gene), form B 


9E-44 


94 


NM_013983 


Homo sapiens neuregulin 2 (NRG2), transcript variant 4, mRNA 


0.27 


95 


JO 1390 


Emericella nidulans mtDNA between h2/h5 and bh2/b2 junctions, 
frenes for ATPase subunit 6 cvtochrome nxirfaQf* ciihiinit ^ cpvpti 
unidentified proteins, twentyfour tRNA's and L-rRNA 


2.2 


96 


AP000384 


Arabidopsis thaliana genomic DNA, chromosome 3, PI clone:MCE21 


0.22 


97 


AE001397 


Plasmodium falciparum chromosome 2, section 34 of 73 of the 
complete sequence 


9 4 


98 


AE000192 


Escherichia coli K12 MG1655 section 82 of 400 of the complete 
genome 


0.003 


99 


AF3 16828 


Schistosoma mansoni zinc finger protein SmZFl gene, complete cds 


1.7 


100 


BC007942 


XxUlllU aaJJlCila, liUL>lCUldl ctULUdilllgCIl \DDKLJ) MIEUldr lO la. 1 

<*vnantnnemal comnlpv nrntpin plrvnp K/ffrf** 14967 TKA AfirF'd.1 '\C\10f\ 
mRNA, complete cds 


2E-59 


101 


L35301 


Homn <winienQ tiiirlf*n<;idf* Hinhcnnhatp Irinsicf* A ( nm9^-T-T1 ^ opnp 

i.i\jLLi\j ocifjii'iio iiu\^iw\jdiu& uijuiiidU^iiiaLC j\j.iiaoC av ^iliilz,j n.1 J HCllC 

fragment 


0.003 


102 


AF1 14927 


Saccharomyces pastorianus CBS1538 small subunit ribosomal RNA 
eene mitochondrial cene for mitochondrial RNA comnlete sennence 


0.27 


103 


AF299248 


Drosophila melanogaster talin mRNA, complete cds 


1.8 


104 


BC003191 


Homo saoiens GL004 nrotein clone MGC-89S TMAGF*'*^09q99 
mRNA, complete cds 


2E-10 


105 


U29926 


Human AMP deaminase f AlVfPDl^ eene exon 14 and 1 5 and 
complete cds 


0.06 


106 


BC003191 


Homo sapiens, GL004 protein, clone MGC:895 IMAGE:3502929, 
mRNA, complete cds 


o 


107 


AK022127 


Homo sapiens cDNA FLJ12065 fis, clone HEMBB 1002249 


0.0008 


108 


D16360 


Human DNA for plasma glutathione peroxidase, exon 1 


0.091 


109 


AK014138 


Mus musculus 13 days embryo head cDNA, RUCEN full-length 
enriched library, clone:3110038All, full insert sequence 


6 


110 


XM_006914 


Homo sapiens cold shock domain protein A (CSDA), mRNA 


0.079 


111 


U31283 


Mesotaenium caldariorum clone mesphyla phytochrome gene, partial 
cds, exons 4 through 10 


0.56 


112 


XMJ)49935 


Homo sapiens hypothetical protein FLJ14950 (FLJ 14950), mRNA 


0 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


113 


AE001329 


Chlamydia trachomatis section 56 of 87 of the complete genome 


0.75 


114 


XM 037830 


TT/vmrt csir*if*Tic fiT F-1 /FRF accociaffvl y\ nr ftncFPT crf*np fTCTA A07^0^ 

XXUXXXU SclJJXWlXo V-/X-/X X / X-sXJX aSSUWlol&U £tXHW IXllgWX ^Wllb ^IVlfV/\v 'Uv^j 

mRNA 


0.25 


115 


AE006368 


T actncoccus lactic ^lih^n lactis TT 1401 section 110 of 218 of* the 

complete genome 


0.0003 


116 




Homo aniens hvoothetical nrotein FLJ 14950 rFLJ 14950} mRNA 


0 


117 


ym 032533 


Homo sapiens hypothetical protein FLJ 12787 (FLJ 12787), mRNA 


0 


118 


AJ309861 


Homo saoiens oartial mRNA for nutative nrotein kinase WNK.4 
fPRKWNK4 eene) 


9E-76 


119 


BC005173 


Homo sapiens, KIAA0255 gene product, clone IMAGE: 3 5079 18, 
mRNA 


2E-38 


120 


AF068758 


Drosophila melanogaster SIR2 (Sir2) mRNA, complete cds 


0.083 


121 


AKO 17307 


Mi is miiscnliis 6 davs neonate head cDNA. RIKEN fnll-lenffth 
enriched library, clone:5430414B19, full insert sequence 


0.003 


122 


yu 036208 


Homo sar>iens snhineosine kinase 1 fSPHKl) mRNA 


2.1 


123 


XM_039960 


Homo sapiens PC3-96 protein (PC3-96), mRNA 


0 


124 


yviVA UJy J \J\J 


Homo aniens hvDothetical Drotein FLJ20400 fFLJ20400) mRNA 


0.08 


125 


yvivx w+ y y -j ~j 


Homo saniens hvoothetical Drotein FLJ14950 (FLJ14950) mRNA 

X lvlllU OCI|Jivilu J Uv Ulv U vul Ulvlvlli X u*r X ~ s *y v yA J-»*l * ■ ■'* , 'vyj ^a ^ 


0 


126 


AE006080 


Pasteurella multocida PM70 section 47 of 204 of the complete genome 


2.1 


127 


XM_009518 


Unmn canipnc WWT1 inducible sipnalinp nathwav nrotein 2 fWISP2^ 
mRNA 


0.16 


128 




Homo saniens Heta-nrime-adantin f APIBI^ f?ene exon 15 


0.022 


129 


AF267996 


Homo sapiens muscle-specific protein (C4orf5) gene, intron 3 


0.34 


130 


AF111RS9 


Homo saoiens clone 24923 mRNA seauence 


1.8 


131 


T 48479 


Homo sapiens (subclone 6 hi from PI H21) DNA sequence 


0.01 


132 


YU 018117 


Hnmo canipnc iihiniiifinatinn factor F4B fhomoloffoiis to veast UFD2^ 

XXUlIlw oaUldlO UXyXUlXXLXXlalXVXX X CI w l\J I J— <"TXJ ^liuiliuivgvlw ivr J \sciij\, x / 

rTTRF4B^ mRNA 

^\^XJXw~X^y, 1 1 U. Vi. ^ 


1.6 


133 


yu 008267 


Hnmn saniens solute carrier familv 16 fmonocarboxvlic acid 
transporters), member 6 (SLC16A6), mRNA 


0.23 


134 


AF1 62923 


Beet soil-borne virus RNA3 isolate 6b 22K protein gene, partial cds 


6.6 


135 


NM 010053 


Mus musculus distal-less homeobox 1 (Dlxl), mRNA 


2.5 


136 


AF257303 


Mus musculus synaptotagmin II (Syt2) gene, complete cds 


0.85 


137 


Z92837 


Caenorhabditis elegans cosmid R03E1, complete sequence 


0.027 


138 


AF258528 


Loligo pealei phospholipase C mRNA, complete cds 


0.74 


139 


M65252 


Bacillus thuringiensis alesti delta endotoxin gene, complete cds 


0.075 


140 


M14954 


D.melanogaster (W-IR1 mutation) I factor DNA, complete cds 


0.053 


141 


Z48636 


C.novyi gene for alpha-toxin 


0.026 


142 


XM 036491 


Homo sapiens hypothetical protein MGC3771 (MGC3771), mRNA 


0.002 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


143 


AK020198 


Mus musculus 15 davs embrvo male testis cDNA RTKFN fnll-lpnath 
enriched library, clone:8030442B05, full insert sequence 


0.53 


144 


AJ404228 


Saccharomyces douglasii mitochondrial tRNA gene cluster 




145 


AJ297708 


Rattus norvefriciis RT6 jyene "for T rHl HifTf*«*ntijitinTi tnarVpr RTK 9 

ivfll-LUo ilvl VV^lvUu lv X \J Ivl ± Wvli vl 1,1,1 ^1 villi QUmi lluULJYvM -TV 1 U.L, 

exons 1-8 


0.024 


146 


AK026697 


Homo saniens cDNA* FLJ23044 fis clone LNG02454 


1.9 


147 


AE001370 


Plasmodium falciparum chromosome 2, section 7 of 73 of the 
complete sequence 


0.017 


148 


AF325208 


Acetabularia acetabulum nicotinamide nucleotide transhydrogenase 
(Nnt) gene, exons 6, 7, 8, and complete cds 


5.2 


149 


L78770 


Homo sapiens (subclone 1 f7 from PI H43) DNA sequence 


0.068 


150 - 


AF182214 


Glycine max glutamine synthetase gamma 2 subunit gene, 5'-flanking 
region and partial cds 


0.2 


151 


AF359513 


Lolium rigidum clone LR1 acetyl-CoA carboxylase mRNA, partial 
cds; nuclear gene for plastid product 


0.2 


152 


AE004798 


P<*endnmonfl^ flemsrinooa PA01 section nf ^29 of the cnmnlpte 
genome 


2.6 


153 


NM 021699 


Rattus norvegicus serine/threonine kinase (LOC60328), mRNA 


1.8 


154 


XM_040539 


Homo sapiens interphotoreceptor matrix proteoglycan 200 
(SPACRCAN), mRNA 


2.3 


155 


AE006894 


Sulfolobus solfataricus section 253 of 272 of the complete genome 


0.027 


156 


Y14603 


Erwinia amylovora srlA, srlE, srlB, srlD, srlM and srlR genes 


0.8 


157 


AF181720 


Homo sapiens RU2AS (RU2) gene, complete cds; and RU2S (RU2) 
gene, partial cds 


0.051 


158 


XM 043815 


Homo sapiens hypothetical protein FLJ21736 (FLJ21736), mRNA 


1.1 


159 


AE006059 


Pasteurella multocida PM70 section 26 of 204 of the complete genome 


1.6 


160 


AF151097 


Homo sapiens voltage-dependent anion channel (VDAC1) gene, 
exons 8 and 9 and complete cds 


0.074 


161 


AF165136 


N/fvr*nn1 ucmn mvfniHpc mvrni H pq T f" 1 livrvitliptifjil ciirfiirv* 1r»r , ntf*H 

IVxjrlrU^Jlaollla. llljrvUlU&is llljr tA/lUbd lljf^HJLlltLlwCll oul v<\\j\u lULaLCU 

membrane protein and lipoprotein B precursor (lppB) genes, partial 
cds 


0.23 


162 


XM 016392 


Homo sapiens hypothetical protein FLJ 12768 (FLJ 12768), mRNA 


2.1 


163 


AF170544 


Magnaporthe grisea vacuolar- ATPase (VATP) mRNA, complete cds 


0.15 


164 


AY045680 


Arabidopsis thaliana Atlg08350/T27G7_4 mRNA, complete cds 


0.009 


165 


AF285100 


Bos taurus lysophosphatidic acid acyltransfesrase gene, complete cds 


1.6 


166 


AF194170 


Dictyostelium discoideum cudA protein gene, promotor sequence and 
partial cds 


0.039 


167 


AF242446 


Homo sapiens LINE-1 insertion dimorphism LID-3 empty site 
sequence 


0.043 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VAT TJE 


168 


AF084196 


Pan troglodytes garnrna-aminobutyric acid receptor A5 subunit 
duplicated gene, 5'UTR region 


0.88 


169 


AK021754 


Homo sapiens cDNA FLJ1 1692 fis, clone HEMBA1004983 


0 


170 


AE006002 


Caulobacter crescentus section 328 of 359 of the complete genome 


1.6 


171 


AL096734 


Homo sapiens mRNA; cDNA DKFZp434M01 1 (from clone 
DKFZp434M011) 


0.015 


172 


AF251351 


Homo saoiens 2 , -5 , olieoadenvlate svnthetase 3 CO A S3 eene nromoter 
region 


2.7 


173 


AF061244 


AffToevbe aefferita B tvne ON A nolvmpraop fMtnnn **f*fif* r/imnlpfp 
cds; tRNA-Asn gene, complete sequence; and unknown genes, 
mitochondrial genes for mitochondrial products 


0.0006 


174- 


X95275 


P.falciparum complete gene map of plastid-like DNA (ER-A) 


0.1 


176 


M88599 


Entamoeba histolytica P-glycoprotein-1 (pgpl) gene, complete cds 


0.12 


111 


AYO 19624 


Orvza <witiva micro<iatel1ite MR01949 cnntaininp fATY5C40 Hnopot to 
marker Rl 167, genomic sequence 


0.017 


178 


NM_026040 


Mus musculus RDCEN cDNA 2810036K01 eene f2810036K01Rik > > 
mRNA 


e-101 


179 


AL162040 


Homo sapiens mRNA; cDNA DKFZp434N199 (from clone 
DKFZp434N199) 


e-152 


180 


AJ295190 


Human coxsackievirus All ffenomic RNA for nartial nolvnrntein 
gene, isolate VR-169, ATCC 


1.5 


181 


AJ298684 


Phylloxera sp. MBLM2 mitochondrial ATP6 gene for ATP synthase A 
chain subunit 6 


0.51 


183 


AF268064 


C^andidatu*; C^arsonella niddii nafural-host f^alnnhva srhini RNA 
polymerase beta subunit (rpoB) and RNA polymerase beta-prime 
subunit (rpoC) genes, partial cds 


0.012 


184 


U27374 


Human Menkes disease gene (ATP7A), exon 16 


0.51 


185 


D 10767 


Epizootic haemorrhagic disease virus serotype 1 (EHDV-1) gene 
encoding VP2 protein 


0.049 


186 


AJ002256 


Mus musculus minisatellite tandem repeat (MMS73) 


0.058 


187 


AB007546 


Homo sapiens gene for LECT2, complete cds 


2E-11 


188 


AF254573 


Cricetulus griseus origin recognition complex subunit 2 mRNA, 
complete cds 


0.57 


189 


AK024903 


Homo samens cDNA* FLJ21250 fis clone COLO 1253 hiehlv similar 

X XX/XX&Vr tJClUlVllu WX/X^xX* X XJJm lAfb/V llj^ viwllv V/ V/U v 1 « ^ 1L1 will V ^IXXXJXCXX 

to AB020527 Homo sapiens mRNA for Na/P04 cotransporter 
homolog 


e-135 


190 


AF359513 


Lolium rigidum clone LR1 acetyl-CoA carboxylase mRNA, partial 
cds; nuclear gene for plastid product 


0.15 


191 


AJ237585 


Mus musculus mRNA for hypothetical protein expressed in 
thymocytes (clone MFT.M05.13/MTA.B10.066), partial 


4.4 


192 


AF233591 


Mus musculus son-of-sevenless 1 (Sosl) gene, partial cds 


0.52 


193 


X81190 


M. musculus DNA flanking site of transgene insertion 


4.8 


194 


AF232246 


Tetrahymena thermophila clone Tlrl Int integrase-like protein gene, 
complete cds 


0.53 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCFSSN 


DESCRTP 


P VAT I IF 

A V iVL/UAj 


195 


AR014.Q69 


' 1 h a r m rt^ a/*^i i p ft*t u 1 fill 1 /t4"TJ y-fc A f+rrt-t^ f^frwt* ^^y+T* f^AfiAr* 

lnermococcus sp. oiuui cgio, cgiA, cgiw, cgiL/, cgui genes tor 

UyulOIIldllOUCAU lllcloC, CyLlWllldllWUGALIill giuuaiiuiraiiaierase, 
UyLAUlIiaxLWUCAUlIl L/lllUlllg jJlULClil, ^ywiWliUliLUUCAUlxl LlilllbpUI L 

tn^mhrjirip TYrntpin f^nmr>1ptp pHc 

llldllLrl ClllV JL/1 VJlVlll, WVFillLVlt'lfc MIS 


1.8 


196 


LJ 7JOOJ 


Ixodes granulatus 16S ribosomal RNA gene, mitochondrial gene for 
mitnrlinriHrial RNA nartial ^pmiPTice 

llillUwllWUUX 1CLX AVI ^1 pdlUCU idVljUvllvv 


0.048 


198 


XMO 16002 


Homo sapiens LOC88174 (LOC88174), mRNA 


4 


199 


AC091698 


Homo sapiens clone RP11-596J6, complete sequence 


0.000003 


200 


299763 


Flaveria nrinf?lei ffdcsH *?ene 


0.02 


201 


AB048873 


Macaca fascicularis brain cDNA, clone:QnpA-10763 


0.053 


202 


AF286167 

/TLA z#ovj j vj / 


TTfimn Qjmipns HN/fCiTY ffene nromoter 


0.007 


203 - 


AR029071 


Mus musculus Kkm gene for Ser/Thr kinase KKIAMRE, exon 13, 14, 


3.9 


204 




T^r\1ir , Tir»rTii tiritprmpc cr» MV^nnjmc' m ■trw > lir\nH ri q! f*ihncr\mu1 RNA 
.L/WlldlUI lAlllWLGliilCa o|i. ivitUia.u.a iuo iiiiiwviiWiiLiiicii l iL/uavJiiidi rvi^^v 

ctp>T\f* riartijil cpnnpiK^p* mitochondrial pptip for mitochondrial nrodnct 

gCllv, L/fll LlO.1 Q&llUvllvv^ 1111 LV/v ALU 1 1UA Aal A^fAiS^ 1U1 11ULV/W1AVJAAUA IcAA L7A V/vllAwl 


0.045 


205 


at ^nnns 


Unman HMA cf»niif»nr"f» fVnm r*innp T?P 1 1 _4.*?^TT17 c\n rhrnmncntTiP 

XxUlIIall J-V1N /A. ■sCUliCllL'C 11 Will L>1W11G AVA 1 I'rJUIli / VJll will UIIIW0WII1& 

Yn9^i 1-77 1 r^omr^lpff* cpnn^nrp rTTnrnn cor\if»r»cl 

J\\^Z,\J. 1 ~£* l . 1, IsWlUJJlwLG 3d|UbllUv [^1 AW111W OOAJ1^113J 


0.16 


206 


>jA/f OIQOSJ. 
iNivi \j Lzt\J j*t 


Homo oanipn^ hvnothptiral nrotein MOPS 560 (MGC5560I mRNA 

XXU111W oapiCllO liy LJULllwLlWal Ljiv/l&lll IviVJV/JJwv ^ivivjv^.'Vv iiuvi ^ r^. 


9E-35 


207 




\Aitc mncr*n1iic 1ft Hsii/c np»r\riiitp» clrtn r*DM A RTTCF^T fii11-1pno r th 
aVxUo illUol*UlUo 1W Uajs iicwiidic 31V111 vJL/i^i/Tt, rvixvj_#iN iuii ibiigui 

pririrhpiH lihrjirv Hnnp*4.7H 4-1 1O05 fiill in<;PTt ^ennence 

CllllvllCU HWlCliy, Llx/llv.^ / J Itl JVJV/Jj 1U11 lllOWl L Dbl^UvllUv 


0.82 


208 


AK"097 1 9R 


Unmn onnipn*: rDNA' FT T71475 fi« clone HST 11659 

nwlllW oaUlCllo ^lyi^ rx. x i—tJ / -J 11 o, viuiit/ HJHJUJ7 


0.021 


209 


T0077R 


Rat •n^^*nr*r^*^lti^ , trvticiti T crprip fninrilptP pHd 

XvclL JJcllldl Calit llj'Liolll 1 g^ll^, WWlllLJlblb wuo 


0.016 


210 


D49412 


Human gene for interleukin 3 receptor alpha subunit, exon 1 1 


0.067 


211 


r\ I JJ / o 


ArnhiHnnciQ thalinna AT4.<xl7000/P7A10 160 mRNA comnlete cds 

r\i dUlvAWUolo Lllallollcl r\l tgJ /wu/^//niu JW iilivi ^ -tv, w/iiiljiwlw wuo 


2 


213 


AJ011013 


Cicer arietinum epicotyl EST, clone Canl33 


4.7 


214 




Unmn «:jiti1(*iiq mpnrifi A hf*ta rMFPIRi mRNA 


5.6 


215 


AT 091R10 


f*of»TinrliiihHitic plptran^ pft^mid Y76F6A cnmrilete ^pnnence 

VvCldlWl lldULU LlO Cl&Kdll3 wWOllllU X ArVJX^Wxl., VWllILilvLV OWV[UWlvv 


0.084 


216 


AK027136 


Homo sapiens cDNA: FLJ23483 fis, clone KAIA04052 


6.6 


217 


U03554 


oacuius inuringiensis mux 1 isuni c<vjz ijo ixaiiapuauii nutwi aiic" 
specific recombinase (tnpl) and transposase (tpnA) genes, complete 
cds 


0.46 


218 


YU 041584 


TTomn canipnc nnkvrin rpnpat anH SOPS box-rnntaiiiiTiff 1 rASRl^ 

xxWliiW ocllJidia ctiiivjfiiii i x^yjy^cn. cixiu ov/v^ij L/uiv^^v^xiLCUiixxig a y*ri.iju i. j 9 

mRNA 


2.2 


219 


AE001177 


Borrelia burgdorferi (section 63 of 70) of the complete genome 


0.094 


220 


AK018148 


Mus musculus adult male medulla oblongata cDNA, RIKEN full- 
length enriched library, clone: 63 30408P1 1, full insert sequence 


1.6 


221 


AF271148 


Botryosphaeria ribis isolate 96-13 1 small subunit ribosomal RNA 
gene, partial sequence; mitochondrial gene for mitochondrial product 


0.65 


222 


XM_008090 


Homo sapiens KIAA0419 gene product (KIAA0419), mRNA 


0.25 


223 


Z54541 


H. sapiens CpG island DNA genomic Msel fragment, clone 132c 11, 
forward read cpgl32cll.ftlb 


6.4 



224 







Table 3A Nearest Neighbor (BlastN vs. Genbanto 


SEQIE 
NO 


) 

ACCESSN 




P VALUE 


225 


XM 042689 


Homo sapiens fasciculation and elongation protein zeta 2 (zygin II) 


0.69 


226 


AF3 17083 


Anopiosioma viviparum isolate BrnlO large subunit nbosomal RNA 
gene, partial sequence; mitochondrial gene for mitochondrial product 


0.22 


227 


AE006436 


Lactococcus lactis subsp. lactis IL1403 section 198 of 218 of the 
complete genome 


0.029 


228 


AK027535 


Homo sapiens cDNA FLJ14629 fis, clone NT2RP2000448, weakly 
similar to PROTPTM 


7E-76 


229 


BC004209 


Homo sapiens, NIMA (never in mitosis gene a)-related kinase 6, clone 
MGC:4434 IMAGE:2958695, mRNA, complete cds 


3E-36 


230 


AC002183 


fiomo sapiens (suoclone 2_n8 from BAC Hill) DNA sequence, 

com nipt p cfvmpnr'** 


0.0000004 


231 


AF242489 


Prochlorococcus marinus UreC (ureC), UreB (ureB), UreA (ureA), 
UreD (ureD), UreE (ureE), UreF (ureF), and UreG (ureG) genes, ' 


0.0003 


232 


AF385396 


UrtTtlf^ Compile ncAnocic citp/«Ai\'f<K<T'!fir _ , . ,.i * . t 

numu ^apiciib pboridsis susceptiuiiity gene, partial sequence 


0.027 


233 


AE006749 


ouuuiuuub buudiancus section iuo 01 z 11 ol tne complete genome 


2.1 


234 


X73501 


H. sapiens gene for cytokeratin 20 


0.25 


235 


AJ296801 


uiyptotencupes pallens 18S rRNA gene (partial), 2S rRNA gene, 28S 
rRNA gene (partial), internal transcribed spacer 1 (ITS1) and internal 
LioiiaLriiucu bpdcer z ^iioz^, cione iipaitsy 


0.49 


236 


X17627 


C. thummi DNA transposable element TECthl 


0.59 


237 


AF157924 


nuianuas aorsaiis cytochrome b (cytb) gene, complete cds; 
uuLuciiuiiunai gene ior mitocnonanai product 


0.19 


238 


XM 040836 


Homo sapiens similar to calcium channel, voltage-dependent, alpha 
in E>uDumi ^ti. sapiens) (LUCVloUy), mRNA 


0.38 


239 


AF097103 


Oreina melanocephala 16S large subunit ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


0.23 


240 


AK013974 


Mus museums 13 days embryo head cDNA, RIKEN full-length 
eiinuiiea iiorary, cione.JiiuuU2U09, lull insert sequence 


0.73 


241 


AK022073 


Homo sapiens cDNA FLJ1201 1 fis, clone HEMBB 100 1653 


0.26 


242 


XM_012392 


numu 2>«ipici^ uDiquiun specinc protease o (UoFs), mRNA 


2 


243 


AB048604 


Mus museums Neu-2 mRNA for sialidase, complete cds 


0.54 


244 


AF275235 \ 


Drepanaphis utahensis 12S small subunit ribosomal RNA gene, 
partial sequence, tRNA-Val gene, complete sequence; and 16S large 
suoumi noosomai km a gene, paruai sequence; mitochondrial genes 
for mitochondrial products 


0.076 


245 


] 

XM_032956 ( 


Homo sapiens cat eye syndrome chromosome region, candidate 2 
;CECR2), mRNA 


2.2 


246 


] 

XM 050246 i 


iomo sapiens secretory protein SEC8; KIAA1699 protein (SEC8), 
tiRNA 


0 


247 


X69200 I 


3.tigrina Dth-1 gene, exon 2 


0.49 


248 


M33582 ( 


D.cuniculus beta-casein gene, complete cds 


0.006 


250 


AB048604 I 


tfus musculus Neu-2 mRNA for sialidase, complete cds 


0.46 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


251 


U48594 


Heliothis virescens inject storaw nrnt^in f^lPd^ mPMA m rfioi r*Ae 


0.022 


252 


AL162583 


Human DNA sequence from clone RP1-250J21 on chromosome 

Xoll 22-11 4 comnlete sennpnc^ rRnmn canii^ncl 


o o 
0.2 


253 


AF284038 


Cucurbita maxima nhlopm cpmin- 1 mR\J A rr\x\\r\\f*\f* r>Ac 


A 1 A 


254 


AK026699 


Homo sapiens cDNA: FLJ23046 fis, clone LNG02491 


0.6 


255 


AF069199 


± i d u&iuiag ii/vun uciiyuiugcxicisc i gene, miiocnonunai gene 
encoding mitochondrial protein, partial cds 


0.19 


256 


Z 12028 


L Dimoinellifblium pene encoding vamolnr invprtacf* 


u. / 


257 


AB041256 


Panonychus mori mitochondrial COl gene for cytochrome oxidase 
subunit 1 oartial cds strain 'TfDe 


U.Zl 


258 


AK022820 


Homo sapiens cDNA FLJ12758 fis, clone NT2RP2001328 


0.25 


259 


AE006782 


Sulfolobus solfataricus section 141 of 272 of the complete genome 


1.4 


260 


AF027656 


Homo saoiens cholestervl ester transfer nrotpin pf*np nrnmntpr rfcrion 




261 


Z57957 


H.sapiens CpG island DNA genomic Msel fragment, clone 22b3, 
reverse read cpg22b3.rtla 


n noon'? 


262 


AF266097 


Orthetrum cancellatum large subunit ribosomal RNA gene, partial 
&cquciiwc, uviNrt.- vdiiiic gene, compieie sequence, ana sman suoumt 
ribosomal RNA gene, partial sequence; mitochondrial genes for 
mitochondrial products 


o n.77 


263 


XM_003119 


Homo sapiens dynein light chain-A (LOC51 143), mRNA 


5.6 


264 


AF099990 


Rattus norvegicus Ste-20 related kinase SPAK mRNA, complete cds 


0.54 


265 


AE007394 


Streotococcus nneumotiiae section 77 of* 1 Q4 nf* thp mmnlptp opnnmp 


z.z 


266 


L07545 


Leishmania tarentolae kinetoplast mitochondrial MURF2 edited 
mRNA comnlete cds 




267 


L09706 


Homo sapiens complement component 2 (C2) gene allele b, exons 1 
through 8 




268 


AF 1743 26 


Panesia iinilachni \f%9> rihocomal RMA a^-ne* r^artiol cpnnpn^p* 

i a Lipoid. uimci\*/iLLLi iv/o l iLFUoWlllcll XvlN r\ KCIIC, Udl lltil aGUUCIlC6, 

mitochondrial t?ene for mitochondrial nrnrlnrt 


U.UOj 


269 


Z79794 


M.musculus dystrobrevin gene, exon 8 


1.8 


270 


AB029325 


Oryza sativa gene for water channel protein RWC3, promoter region 
and complete cds 


1.7 


271 


U42461 


Xenopus laevis neural specific DNA binding protein (Xgli3) mRNA, 
complete cds 


1.6 


272 


XM 029784 


Homo sapiens CGI-53 protein (LOC51098), mRNA 


6E-23 


273 


D14011 


Mouse reg II gene for regenerating protein II, complete cds 


0.071 


274 


AF179235 


Homo sapiens short-chain dehydrogenase/reductase 1 (SDR1) gene, 
exons 2, 3, and 4 


2.1 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


275 


AF264911 


Camnvlobacter ieiuni mitative TJDP-N-acetv1mnrannat£*-a1»ninp licracp 
(miirC) gene, partial cds; putative integral membrane protein, putative 
MutS (mutS), restriction and modification enzyme Cjel (cjel), putative 
transferase, and putative integral membrane > 


0.22 


276 


AF266074 


Gomphus exilis large subunit ribosomal RNA gene, partial sequence; 
tRN A- Valine gene, complete sequence; and small subunit ribosomal 
RNA gene, partial sequence; mitochondrial genes for mitochondrial 
products 


0.021 


277 


AF229988 


Cryptosporidium parvum P-type ATPase3 gene, complete cds 


1.9 


278 


AF250346 


Xenopus laevis early growth response protein 1 (egrl) gene, partial 
cds 


0.25 


279 


AL157448 


Homo sapiens mRNA; cDNA DKFZp761E03 1 1 (from clone 
DKFZp761E0311) 


0.00001 


280 


AF386078 


Homo sapiens serine-cysteine proteinase inhibitor clade C member 1 
(SERPINC1) gene, complete cds 


2.1 


281 


S80932 


MVAT5-RX2 VSG=variant surface glycoprotein {promoter} 
[Trypanosoma brucei= African trypanosomes, Genomic, 6583 nt] 


0.003 


282 


U09478 


Dictyostelium discoideum KAx-3 LagC protein (lagC) mRNA, 
complete cds 


0.008 


283 


U63580 


Oxytricha fallax micronuclear actin I gene, partial sequence 


0.022 


284 


AF348411 


Macaca mulatta Per4 pseudogene sequence 


0.0008 


285 


AF053645 


Homo sapiens cellular apoptosis susceptibility protein (CSE1) gene, 
exons 3 through 10 


0.0002 


287 


U90928 


Arabidopsis thaliana glyoxalase II mitochondrial isozyme (Glx2-1) 
mRNA, nuclear gene encoding mitochondrial protein, complete cds 


2.2 


288 


AB050751 


Carabus pseudopusio mitochondrial ND5 gene for NADH 
dehydrogenase subunit 5, partial cds 


1.6 


289 


AF275220 


Procinhilus firaxinifblii 1 2S small *nibimit ribosomal RNA f?ene 
partial sequence; tRNA-Val gene, complete sequence; and 16S large 
subunit ribosomal RNA gene, partial sequence; mitochondrial genes 
for mitochondrial products 


0.003 


290 


NM_014358 


Homo sapiens C-type (calcium dependent, carbohydrate-recognition 
domain) lectin, superfamily member 9 (CLECSF9), mRNA 


0.009 


291 


D25274 


Homo sapiens mRNA, clone:P02ST9 


1E-38 


292 


AF111936 


Callithrix jacchus isolate MJ50037 alpha (1,2) rucosyltransferase 
(FUT1) gene, partial cds 


1.8 


293 


AL592159 


Human DNA sequence from clone RP13-237P14 on chromosome X, 
complete sequence [Homo sapiens] 


0.18 


294 


AKO 13928 


Mus musculus 13 days embryo head cDNA, RIKEN full-length 
enriched library, clone:3100003M19, full insert sequence 


1.9 


295 


AF289199 


Mus musculus junctin and aspartyl beta-hydroxylase (Asph) genes, 
partial cds 


0.079 
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SEQID 
NO 
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DESCRIP 


P VALUE 


296 


AE001425 


Plasmodium falciparum chromosome 2, section 62 of 73 of the 
complete sequence 


0.008 


297 


XM_0 11029 


Homo saciens LIM domain-containinf? nreferred translocation naTtne*r 
in lipoma (LPP), mRNA 


0.23 


298 


AF105034 


Arabidopsis thaliana delta7 sterol C-5 desaturase (STE1) gene, 
complete cds 


0.009 


299 


L77040 


Homo sapiens (subclone 8_cl 1 from PI H22) DNA sequence 


9E-14 


300 


AF278536 


Parastrongyloides trichosuri heat shock 70 protein gene, complete cds 


0.61 


302 


Z28216 


S.cerevisiae chromosome XI reading frame ORF YKL216w 


1.7 


303 


AJ008049 


Chrysolina colasi 16S rRNA gene 


0.0007 


304 . 


AF280812 


Glycine max putative Hslpro-1-like receptor mRNA, complete cds 


0.67 


305 


AK007831 


Mus musculus 10 day old male pancreas cDNA, RIKEN full-length 
enriched library, clone: 1810048N21, full insert sequence 


0.047 


306 


AL163543 


Human DNA sequence from clone RP1 1-424E21 on chromosome 13, 
complete sequence [Homo sapiens] 


0.64 


307 


AF045022 


Bos taurus phosphatide acid-preferring phospholipase Al mRNA, 
complete cds 


5 


308 


AB052731 


Oikopleura longicauda OilBra mRNA for brachyury protein, complete 
cds 


1.8 


309 


AF 194275 


Holbrookia maculata clone HMFL 12S ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


6.3 


310 


AJ400814 


Dictyostelium discoideum srfA gene, alternative promoters 


0.024 


311 


AK002854 


Mus musculus adult male kidney cDNA, RIKEN full-length enriched 
library, clone:0610039P13, full insert sequence 


0.68 


312 


U25177 


Helobdella robusta cyclin A mRNA, partial cds 


0.023 


313 


AK011102 


Mus miiscnlns 1 3 davs embrvo liver cDNA R1TCEN full-leneth 
enriched library, clone:2510044F14, mil insert sequence 


0.51 


314 


AK002181 


Homo sapiens cDNA FLJ11319 fis, clone PLACE1010293 


0.0006 


315 


U67282 


Human mis5 homolog (MCM6) gene, exon x 


0.014 


316 


XM_040357 


Homo sapiens chromosome 6 open reading frame 5 (C6orf5), mRNA 


1.5 


317 


AK006073 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone: 17000 17M07, full insert sequence 


6.7 


318 


AE001381 


Plasmodium falciparum chromosome 2, section 18 of 73 of the 
complete sequence 


0.18 


319 


NM 017678 


Homo sapiens hypothetical protein FLJ20127 (FLJ20127), mRNA 


3E-57 


320 


BC004680 


Mus musculus, Similar to coxsackievirus and adenovirus receptor, 
clone MGC:5878 IMAGE: 3 500491, mRNA, complete cds 


0.24 


321 


AK002181 


Homo sapiens cDNA FLJ1 13 19 fis, clone PLACE1010293 


0.007 


322 


U90946 


Dictyostelium discoideum myosin heavy chain kinase B (MHCK-B) 
mRNA, complete cds 


0.003 


323 


Y14277 


Drosophila melanogaster mRNA for nuclear protein SA 


0.062 
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SEQID 
NO 
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DESCRIP 


P VALUE 


324 


Z17272 


H. sapiens DNA segment containing (CA) repeat; clone AFM248wcl; 
single read 


0.25 * 


325 


M37526 


E.gracilis chloroplast P-700 chlorophyll alpha apoprotein (psaA and 
psaB) genes, complete cds 


0.025 


326 


Y14952 


Mus musculus gene encoding immunoglobulin J chain precursor 


0.24 


327 


AE006596 


Streptococcus pyogenes Ml GAS strain SF370, section 125 of 167 of 
the complete genome 


6.8 


328 


AF079804 


f^anfiiHa albir^an*; N-flp^*tv1-plnpnoji'minp-f>-nhn<;nhatp HeapptvlaQP 

(DAC1) and glucosamine-6-phosphate deaminase (NAG1) genes, 
complete cds; and hexokinase (HXK1) pseudogene, complete 
sequence 


2 


329 


XM_029315 


Homo <winien^ Machario-To<tfinh disease fsryinocerebellar ataxia 3 
olivopontocerebellar ataxia 3, autosomal dominant, ataxin 3) (MJD), 
mRNA 


7.8 


330 


AK022820 


Homo sapiens cDNA FLJ12758 fis, clone NT2RP2001328 


0.24 


331 


AF059275 


Mus musculus heat shock transcrintion factor 1 fHsfl) cene nartial 
cds 


0.085 


332 


AL049263 


Homo sapiens mRNA; cDNA DKFZp564F133 (from clone 
DKFZp564F133) 


0.059 


333 


AP001233 


Homo sapiens genomic DNA, chromosome 2pll.2, clone:cos607/2 


0.003 


334 


AC017021 


Homo sapiens BAC clone RP11-208M4 from 7, complete sequence 


0.0000005 


335 


U03248 


Human chromosome 21 clone pVC 1.23c 


3E-56 


336 


XM_046758 


Homo sapiens tensin (TNS), mRNA 


2 


337 


AJ279150 


Homo sapiens partial TFNR gene for transcription factor-like nuclear 
regulator, exon 3 1 


0 


338 


AF282896 


Rattus norvegicus chemokine RANTES gene, promoter sequence 


0.001 


339 


AK000939 


Homo sapiens cDNA FLJ10077 fis, clone HEMBA1001864 


0.21 


340 


AF188517 


Staphylococcus aureus Bit-like protein SbtA (sbtA) gene, complete cds 


0.008 


341 


AF202562 


Homo sapiens DNA methyltransferase (DNMT1) gene, exon 13 


0.22 


342 


XM_0 17967 


Homo sapiens hypothetical protein FLJ 13732 similar to tensin 
(FLJ13732), mRNA 


2.7 


343 


AF051685 


Homo sapiens clone SBS-12 SATB1 binding sequence in vivo 


0.21 


344 


M77188 


Campylobacter jejuni argininosuccinate lyase (argH) gene, complete 
cds 


0.22 


345 


U95137 


Rattus norvegicus Na+-Ca+ exchanger (NCX1) gene, exon 1-Ht 


0.24 


346 


AKO 19943 


Mus musculus 6 days neonate head cDNA, RIKEN full-length 
enriched library, clone:5430403G16, full insert sequence 


0.74 


347 


BC005571 


Mus musculus, ubiquitin specific protease 14, clone MGC:7106 
IMAGE:3 157723, mRNA, complete cds 


0.41 


348 


XM_026968 


Homo sapiens hypothetical protein MGC2668 (MGC2668), mRNA 


4E-16 
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NO 


ACCESSN 


DESCRIP 


P VALUE 


349 


• 

AK024903 


Homo sapiens cDNA: FLJ21250 fis, clone COL01253, highly similar 
to AB020527 Homo sapiens mRNA for Na/P04 cotransporter 
homolbg 


7E-28 


350 


U85645 


Oryctolagus cuniculus fructose 1,6, bisphosphate aldolase (AldB) 
gene, complete cds 


0.075 


351 


AF388026 


Homo sapiens fibrinogen, B beta polypeptide (FGB) gene, complete 
cds 


5.8 


352 


AY028321 


Homo sapiens pro-melanin-concentrating hormone-like 2 protein 
(PMCHL2) gene, exons 4, 5a, and 5b, alternatively spliced 


0.0003 


353 


AL513491 


Human DNA sequence from clone RP1 1-79A21 on chromosome X, 
complete sequence [Homo sapiens] 


1.9 


354 - 


M92068 


Chimpanzee retrovirus-like sequence-isoleucine b (RTVL-Ib) gene, 5* 
and 3' LTR 


1.5 


355 


XM_002120 


Homo sapiens zinc finger protein 262 (ZNF262), mRNA 


0.007 


356 


X02435 


Tetrahymena mitochondrial gene for tRNA-Phe (GAA) 


0.028 


357 


XM_033459 


Homo sapiens syntaxin binding protein 1 (STXBP1), mRNA 


2.1 


358 


NM_015770 


Mus musculus non-agouti (a), mRNA 


0.64 


359 


AJ296103 


Staphylococcus aureus repNVH99 gene for replication protein and 
smr gene 


0.15 


360 


Z12840 


O.cuniculus mRNA for protein of unknown function 


e-131 


361 


XM 038370 


Homo sapiens hypothetical gene supported by AK001938; AK001941 
(LOC91424), mRNA 


0.0002 


362 


AF108684 


Rhipidomys mastacalis cytochrome B (cytB) gene, mitochondrial gene 
encoding mitochondrial protein, partial cds 


0.009 


363 


NM_020045 


Mus musculus HIRA-interacting protein 5 (HIRIP5), mRNA 


1.8 


364 


AK024439 


Homo sapiens mRNA for FLJ00029 protein, partial cds 


0.73 


365 


AK025445 


Homo sapiens cDNA: FLJ21792 fis, clone HEP00441 


2.1 


366 


AP000307 


Homo sapiens genomic DNA, chromosome 21q22.1, D21S226-AML 
region, clone:f6C7, complete sequence 


0.00004 


367 


AE000818 


Methanobacterium thermoautotrophicum from bases 264585 to 
276866 (section 24 of 148) of the complete genome 


2.3 


368 


AK001164 


Homo sapiens cDNA FLJ10302 fis, clone NT2RM2000042 


0.13 


369 


XM 029101 


Homo sapiens KIAA0947 protein (KIAA0947), mRNA 


0.95 


370 


AK022191 


Homo sapiens cDNA FLJ12129 fis, clone MAMMA1000198 


0.029 


371 


Z54280 


S.scrofa gene for skeletal muscle ryanodine receptor 


1.9 


372 


AY037259 


Arabidopsis thaliana AT3g52120/F4F15_230 mRNA, complete cds 


9.6 


373 


XM_034599 


Homo sapiens acid sphingomyelinase-like phosphodiesterase 
(ASM3A), mRNA 


0 


374 


XM_046914 


Homo sapiens SWI/SNF related, matrix associated, actin dependent 
regulator of chromatin, subfamily f, member 1 (SMARCF1), mRNA 


3E-16 


375 


M86340 


Mycoplasma-like sp. 23S and 16S ribosomal RNAs, Tyr-transfer RNA 
and lie-transfer RNA, complete cds 


0.24 


376 


AB019507 


Drosophila kanekoi gene for glycerol-3 -phosphate dehydrogenase, 
complete cds 


1.3 
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SEQIE 
NO 
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P VALUE 


377 


AF106915 


x nuuu^us auii^ui ua iuicnuz,iiig nuniione DGUJ suDunii iTUvJNA, partial 
cds 


0.61 


378 


AL3 54800 


Human DNA sequence from clone RP1-3 1615 on chromosome 20 

(^nntjliriQ tllf* ^ ' PnH /YF til** VTT71T3 nana £t-rkr-tnsJi«sv n Iri -C *1 

v^uuuuiia uic j ciiu oi ine jvlt jd gene encoding a kinesin tamily 
member 3B (KIAA0359) and ESTs, complete sequence [Homo 
sapiens] 


0.0001 


"379 


AL133430 


Homo Sflllien*! TtlRNA fiill lf*noth inc^rt rHMA r*1r\na rjT TDfYTfc K A i~*T2 
xiwiiivF aapicua iiuvL^rt. mil lcllgul inSeiT CJLJINA Clone liUK.iJlJVlAOxi 

233360 


U.oo 


380 


AF085894 


Homo sapiens full length insert cDNA clone YP97D1 1 


0.0002 


381 


AJ286750 


Sesbania rostrata mRNA for phosphoenolpyruvate carboxylase (pepc 
eene) 


U.UOo 


382 


XM_040415 


Homo sapiens region containing hypothetical protein; mutL (E. coli) 
homolog 3 (LOC82389), mRNA 


e-izo 


383 


Y09794 


Arabiei microsatellite DNA 


1.2 


384 


AB047905 


Macaca fascicular! s brain cDNA rlnnp-OnnA-i ^oi i 


1 © 
l.O 


385 


AK001451 


Homo sapiens cDNA FLJ10589 lis, clone NT2RP2004389, weakly 
similar to PRORART F M7TOPT-TfYNJT>P tat ahq DTPncn\>r at 

annual «j i xvwJ_j J rYJ_J.L,JL_/ 1VJJ. 1 \^^OV^lNl^l\J./VL, 4Uij xvlljlJ^i JIVl Al > 

PROTEIN S9 PRECURSOR 


1.5 


386 


AFO 10473 


ouiiiz.uo<x^uii<iiuiiiyLA;& pumuc niyobin-iiKe protein ops (sptsj niKJN A. 
partial cds 


A 1 O 

u. iy 


388 


AL133604 


Homo sapiens mRNA; cDNA DKFZp434G1615 (from clone 
DKFZp434G1615) 


U.UUUUo 


389 


XM 009252 


Homo sapiens methyl-CpG binding domain protein 3 (MBD3), 
mRNA 


0.055 


390 


AB053116 


Polvandrocama micalHpncic mTJ AFA fn-r nutat-ix/** onUonrAtt/* nAti/ia 

A \fi-j aj.i\M±\j\^ai ^jcl iiiAoaJVidialo IIlxvl> r\ IUI UUlct 11 VC dlKaryOIlC PCllQe 

chain release factor subunit 1, complete cds 


1.5 


391 


AL 13 8666 


S Dombe chromo<!Otnp T pnQmiH r&Qd. 

• pv/niUv N/lll W-lllV/oWlllV^ X lAJollUU CU7t 


i o 

1.3 


392 


AF091933 


Pleurotus ostreatus small subunit ribosomal RNA gene, partial 
seonence* mitochondrial ppn** "For mitor-TionHt-i q1 nrnHiw 

ijv\juviiw, iliii\JW11\J11\X1 Idl gCllC LKJL IIl1L\J\^IL\J LL\JLl Idl |jrUu.UCL 


o mo 
U.U /o 


393 


Z37964 


Staphylococcus sp. genes encoding QacC and replication protein 
(rep827) 


U.U to 


394 


AB020712 


Homo SaoienS mRNA for KTAA090^ nmfrin rnmnlptp' rHc 




395 


BC009023 


Mus museums, Similar to prostaglandin E receptor 4 (subtype EP4), 
clone MGC'7183 IMAGE'3481696 mRNA romnl^tprdc 


1 < 

L.J 


396 


XM 010128 


Homo sapiens odz (odd Oz/ten-m, Drosophila) homolog 1 (ODZ1), 
mRNA 


U.JJ 


397 


XM 011386 


Homo sapiens hypothetical protein PRO0813 (PRO0813), mRNA 


5.3 


398 


XM_040694 


Homo sapiens AD03 1 protein (AD03 1), mRNA 


1.3 


399 


AE006177 


Pasteurella multocida PM70 section 144 of 204 of the complete 
genome 


5 


400 


AL113079 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


1.2 


401 


U39694 ] 


Mycoplasma genitalium section 16 of 51 of the complete genome 


0.84 


402 


XMJH6314 ] 


Homo sapiens ankyrin 2, neuronal (ANK2), mRNA 


4.6 
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403 


XM_045362 


Homo Cjini Pn<» lilf^lv fifvmrilney nf rot L-inopA T\ ~ i , _ — . 

iiwiiiu acipx&ifs ii&ciy xiuiiiuiug or rai Kinase u -interacting substance of 
220 kDa; KIAA1250 protein (KIAA1250), mRNA 


7E-77 


404 


NM 028513 


lu "^uiua rvixvjjj.'y Cl^iN^-V I j gene {I /UUU5zJvl5J\lK) 

mRNA 


9 


405 


U47416 


ouicuium aiiupiryiiuiii iNAun aenyarogenase suounit (nanr) gene, 
chloroplast gene encoding chloroplast protein, partial cds 


0.2 


406 


NC_002502 


Lactococcus lactis subsp. lactis plasmid pCI305, complete sequence 


5.3 


407 


AF269561 


StaDflvl0C0CC1l<I PniHprmi Hie ctroin QD 1 r*\r\ne* c^n lOtAaA/: 

uwl r , v ivA-uwtua ^jlfiuci iiuuia audllL orvl ClOUC Step. lUlUeUu genomic 

sequence 


0.19 


409 


AF336128 


LuniDV skin disease vim*; strain Tslppthlino v r»at-Hai c&mtan^a. 


1.5 


410 


AB058397 


Orvza sativa CHS f?ene for rhulrrmp cvntfiac#=» rr\mn\t*+n 
vjt*j^c» ijauTo viu ^v/iiv^ iui uiicu^viiic oyiiiiictsc, complete cos 


0.16 


411 


AE006830 


Sulfolobus solfataricus section 189 of 272 of the complete genome 


0.21 


412 


AY035064 


AxabidoDsis thaliana unknown nrntpin ^TOfiOin inn/AT"3/*^'2nno.\ 
mRNA, complete cds 


0.22 


413 


AF158049 


Platvbrachvs decern manil a 16^1 riHncrrmQl pkta nono nqr fi<ti 

A ciwiijr o uvwv/iiiiiiat«uia 1UO 1 lUVJoUlIldl x\J.N/\ gene, pcUTiai 
SeQUence* mitochondrial ffPTlP for mitnrVirvnHrifil T»rr»Hnr-t 


0.063 


414 


NM 026444 


Mus musculus citrate synthase (C<iS mRNA 


1 H 
1. / 


415 


BC007820 


Homo sapiens, clone IMAGE: 43 03 165, mRNA 


u 


416 


AJ295228 


Caenorhabdius elegans mRNA for MAGUK protein DLG-1 (dlg-1 
cene) 


0.2 


411 


AB028633 


Flammulina vehitine*; fnitrwhrvnHrial opnp fr»r- tykia -rmKrmA-vncA d\t a 
* icuimiuiiiia vuuupca iiinu^iiuiiuridi gune ior uin/v polymerase, .kjn a 

polymerase, complete and partial cds 


0.6 


418 


Y09794 


A.rabiei microsatellite DNA 


1.2 


419 


AB054063 


PaffTUS maior lol pene for linrwvmtfMn tinacp rr\mr\1<=it^ oHc 


€ C\ 

J.y 


420 


AF323928 


Plasmodium falciparum GcpE (gcpE) gene, complete cds 


0.14 


421 


M31012 


Chicken MHC class I B-FIV-B12 alpha-chain gene, complete cds 


1.2 


422 


Z49806 


B taurus mRNA for adenvlvl cvpIjiq** tvn*» vtt 


6.1 


423 


AB009907 


Luciola kiiroiwae mitnrhnnHrijit DMA fnr l^Q rDKTA «ir*ioi 

i^u^ium Aiuuiwac iiiiiv/viiuiiviiiai J-/IN/A. ior ioj ixviN/v, parxiai sequence 


0.003 


424 


X76693 


C Coli (TJA585^ KOdB aenp fnr <;iin£*rnviHf Hicmiitacp 




425 


AF284942 


v^arcx Ldpiidid lkjna- i nr, partial sequence, tm 1 -trnJL intergemc 
a^awbi, ixvi^.rt.-j-fcu, diiu LiiLL> - LriLr iniergenic spacer, complete 
seouence* and tRNA-Php n^rtinl cpnn^nrp' ^Viirtmi-iioct Aanor f>vr 

j\a|uviiv^, culu uvi^i^ lit, \JCLL licli ov^UCIlCC, UlilUruplaal gcncS IOr 

chloroplast products 




426 


AK017986 


Mus musculus adult male thvmns cDNA RTK"FM fnill.T^natfi t*rtrir>hf*A 

L **'** 4J ^"uovuiuj dumi nielli Li i y in uo Vvl-Zl^l/A., XvlX^-C/lN 1.UA1 ICIlliUl eiUTCUeU 

library, clone:5830443G21, full insert sequence 


5.1 


427 


AL5 12549 


S.pombe chromosome I BAG pB2B4 


1.6 


428 


AF283669 ] 


Homo sapiens kallikrein 14 (KLK14) gene, complete cds 


2.3 


429 


Z 12842 ( 


O.cuniculus mRNA for protein of unknown function 


5E-73 


430 


XM_0 16894 ] 


Homo sapiens LOC86241 (LOC86241), mRNA 


3E-65 


431 


X96616 ] 


P.primaurelia gene encoding 156D surface antigen 


0.23 


432 


X02175 ! 


Schizosaccharomyces pombe cdclO start gene 


0.027 


433 


AK021807 ] 


Homo sapiens cDNA FLJ1 1745 fis, clone HEMBA1005526 


0.001 
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NO 
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DESCRIP 


P VALUE 


434 


XM 005233 


Homo sapiens protein phosphatase 2 (formerly 2A) reeulatorv subnnii 
B (PR 52), alpha isoform (PPP2R2A), mRNA 


5E-09 


435 


X52623 


Rice 4-CL gene for 4-coumarate-CoA ligase (EC 6.2. 1. 12) 


v.UJ 


436 


AK005294 


Mus musculus adult male cerebellum cDNA, RIKEN full-length 
enriched library, clone: 1500019016, full insert sequence 


1E-16 


437 


AE002337 


Chlamydia muridarum, section 65 of 85 of the complete genome 


O 04. Q 


438 


AF106574 


Caenorhabditis elegans cosmid E02D9, complete sequence 


0.7 


439 


X56866 


S.alba5SrRNA gene 


0.74 


440 


AF127158 


Drosophila mauritiana Cu-Zn superoxide dismutase (Sod) gene, exons 
1 and 2 and complete cds 


1.7 


441 


U08466 


Nephila clavipes alanine tRNA gene, complete sequence 


2.1 


442 - 


AY021236 


Oryza sativa microsatellite MRG3561 containing (TA)X22, genomic 
sequence 


1.5 


443 


L09262 


Arabidopsis thaliana phytochrome b gene, exons 2 and 3 


0.079 


444 


AC007095 


Homo sapiens BAC clone RP1 1-303A22 from 7, complete sequence 


0.23 


445 


AKO 12607 


Mus musculus 1 1 days embryo cDNA, RKEN full-length enriched 
library, clone: 2700094F01, full insert sequence 


0.19 


446 


AF309415 


Cyprinus carpio ovarian fibroin-like substance-2 mRNA, partial cds 


0.008 


447 


X03956 


Potato (tetraploid variety Maris piper) patatin gene 


0.003 


448 


BC008617 


Mus musculus, clone IMAGE:3588380, mRNA, partial cds 


0.07 


449 


AJ242146 


Mauritiella armata 18S rRNa gene (partial), 5.8S rRNA gene, 26S 
rRNA gene (partial) and internal transcribed spacers 1 and 2 (ITS1, 
ITS2), clone 1 


0.07 


450 


M67489 


Bovine desmocoliin mRNA, complete cds 


0.066 


451 


AB048954 


Macaca fascicularis brain cDNA, clone:QnpA-10509 


0.48 


452 


AE001136 


Borrelia burgdorferi (section 22 of 70) of the complete genome 


0.18 


453 


AE001398 


Plasmodium falciparum chromosome 2, section 35 of 73 of the 
complete sequence 


0.025 


454 


AF2 18904 


Homo sapiens attractin precursor (ATRN) gene, exon 19 


1.6 


455 


AK025078 


Homo sapiens cDNA: FLJ21425 fis, clone COL04162 


1E-71 


456 


Ml 1449 


Yeast mitochondrial ori2-ori7 region DNA with putative peptide 


0.34 


457 


XM 031156 


Homo sapiens hypothetical gene supported by AB007970 
(LOC90360), mRNA 


0.006 


458 


AK015180 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4930422I22, full insert sequence 


0.007 


459 


AC090422 


Homo sapiens clone RP1 1-691 A12, complete sequence 


0.013 


460 


U06154 


Human clone 2004V-I-1 from chromosome lq terminal region 


1E-19 


461 


AE001123 


Borrelia burgdorferi (section 9 of 70) of the complete genome 


0.83 


462 


Y17797 


Enterococcus faecalis gph, ydjH, ydjG, ydjl, pbp4 and ydiC, ORF2 
and ORF3 genes, partial 


0.22 


463 


AB026906 


Homo sapiens SDHD gene for small subunit of cytochrome b of 
succinate dehydrogenase, complete cds 


0.042 


464 


L13942 ] 


Human (clone Cos35) glycerol kinase (GK) gene, exons 1-3 


0.042 


465 


U81510 


Spodoptera frugiperda caspase-1 mRNA, complete cds 


0.041 
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NO 
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DESCRIP 


P VALUE 


466 


AJ003222 


Borrelia burgdorferi flaK flhF thdF oidA <riHR ttioyR orfl nrf? 
orf3, orf4 and orf5 genes 


0.58 


467 


AY034379 


f^ansiciim annniim rirjinched-chain aminn acid amitintranofpraQf* 

V/ClL/dlvUlll OllllUUlU L/X (UlwilvU^llCUll CXAXXXALVS tlvlU CUIUIIUU CUliJivl ClOv 

mRNA, complete cds 


1.8 


468 


AF120317 


Mus musculus mahogany (mg) gene, exons 


0.56 


469 


AE007451 


Streptococcus pneumoniae section 134 of 194 of the complete genome 


0.005 


470 


AF305819 


Homo sapiens PRO0777 mRNA, complete cds 


0.055 


471 


AF045950 


Mus musculus D16Jhu5 YAC 183G11 acentric end, partial sequence 


0.22 


473 


XM 010416 


Homo sapiens GTP-binding protein ragB (RAGB), mRNA 


0.14 


474 - 


XM 028004 


Homo sapiens amyotrophic lateral sclerosis 2 (juvenile) chromosome 
region, candidate 3 (ALS2CR3), mRNA 


0.00005 


475 


AK021820 


Homo sapiens cDNA FLJ1 1758 fis, clone HEMBA1005609, highly 
similar tn Wnmo sanieiis mRNA" cDNA T")TCKZr)S64TC1 33 ffroiTi clone 

OAAIAAACLA LVs A AV/AAlvr OClL/AV-IiiJ A 1 IX VI. ^ -£*, wl/ X ^ XX MS 1V1 £JUi^UTl\.l~r^ ^IIVIII vlvllv 

DKFZd564K133 , > 


0.004 


476 


AE002579 


Drosophila melanogaster genomic scaffold 142000013385638, 
comnlete seouence 


6.7 


477 


AF047677 


Homo sapiens dystrophin (DMD) gene, deletion breakpoints 1-3 in 
intron 5 


0.64 


478 


AL589203 


Human DNA sequence from clone RP1 1-391F23 on chromosome 6, 
roirmlete sentience H-lorno sar)iensl 


0.00009 


479 


AF265343 


Danio rerio protein kinase Npk mRNA, complete cds 


0.14 


480 


XM 031879 


Homo sapiens CREB/ATF family transcription factor (CREB-H), 
mRNA 


0.44 


481 


XM 050760 


Homo sapiens ATP-binding cassette, sub-family C (CFTR/MRP), 
member 2 (ABCC2), mRNA 


0.077 


482 


AF069508 


T-Tomo ^aniens hi i man endogenous retrovirus HML6 28 *?aff and ool 
pseudogenes, complete sequence 


0.23 


483 


AK022914 


Homo sapiens cDNA FLJ12852 fis, clone NT2RP2003445 


2 


484 


XM_032753 


1-Tnmn saniens hvnothetical j?ene simnorted bv ATC022412 
(LOC90584), mRNA 


1.9 


485 


XM 018436 


Homo sapiens hypothetical protein FLJ14327 (FLJ14327), mRNA 


0.66 


486 


AF130764 


Equus caballus FN1 gene, partial sequence 


4 


487 


AK024286 


Homo sapiens cDNA FLJ14224 fis, clone NT2RP3004028 


2E-18 


488 


AF348342 


Plasmodium malariae caseinolvtic orotease C ( cloC) sene Dartial cdsi 
apicoplast gene for apicoplast product 


0.23 


489 


XM_029336 


Homo sapiens similar to potassium voltage-gated channel, subfamily 
H (eag-related), member 2; ether a go-go related (M. musculus) 
(LOC90 134), mRNA 


6.2 


490 


BC005951 


Homo sapiens, clone MGC: 14588 1MAGE:4249174, mRNA, 
complete cds 


5E-35 


491 


AJ391749 


Vicia melanops genomic repetitive element, clone M29 


0.13. 


492 


AF154170 


Sceloporus dugesii 12S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.21 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VAT T IF 


493 


X84060 


H. sapiens TCF1 1 gene, exon 3-6 


0.21 


494 


AP000418 


Arabidopsis thaliana genomic DNA, chromosome 3, PI clone:MPK17 


0.003 


495 


AJ277428 


Scenedesmus obliauus mitochondrial tRNA-Ser tRNA-Asn 16S 
rRNA, 23S rRNA genes, strain KS3-2 


0.65 


496 


LI 1447 


Mvcoolasma hvorhinis reneat reirions in notential metal hinriinp 
protein gene region 


0.046 


497 


AY029764 


Mus musculus TRAM1 mRNA, complete cds 


0.23 


498 


XM_031173 


Homo sapiens parathyroid hormone (PTH), mRNA 


0.22 


499 


AF270168 


Staphylococcus epidermidis strain SRI clone step. 105 lf02 genomic 
sequence 


0.24 


500 


AF336131 


Lumpy skin disease virus strain Neethling, 3* partial sequence 


3.5 


501 - 


AF195272 


Mus musculus serine/threonine kinase AIE1 gene, complete cds 


6.7 


502 


NC 002322 


Lanneii*; mbellii*! mitochondrion conrolete pennme 


0.26 


503 


AK007179 


miisciiliis adult male testis cDNA RTTCFN fiill-lenpth enrit*heH 
library, clone: 1700 112J16, full insert sequence 


0.24 


504 


U32706 


Haemophilus influenzae Rd section 21 of 163 of the complete genome 


0.68 


505 


AF247555 


Drosophila mauritiana heat shock protein 68 gene, partial cds 


1.9 


506 


NM_011146 


Mus musculus neroxi some nrol iterator activated recentor pamma 
(Pparg), mRNA 


0.024 


507 


M26940 


Mouse beta-casein gene, complete cds 


0.002 


508 


AB003043 


Phycomyces blakesleeanus classll chitin synthase (PbCHSl) gene, 
complete cds 


0.68 


509 


U89348 


Human papillomavirus type 16 variant, complete sequence 


0.62 


511 


AF270160 


Staphylococcus epidermidis strain SRI clone step. 1051 e03 genomic 
sequence 


0.077 


512 


AF1 19554 


Plasmodium falciparum para-aminobenzoic acid synthetase gene, 
complete cds 


0.078 


513 


AYO 14991 


Waterstoniella sp. CAM-2000 cytochrome oxidase subunit I (COI) 
gene, partial cds; mitochondrial gene for mitochondrial product 


0.68 


514 


NM_013037 


Rattus norvegicus Fos-responsive gene 1 (St2), mRNA 


0.085 


515 


AF 169003 


Hepatitis C virus isolate G2aKl polyprotein gene, complete cds 


0.028 


516 


U23442 


Tetrahymena thermophila RR internal deletion sequence 


0.25 


517 


AF381638 


Homo sapiens SNP ECU psoriasis susceptibility gene candidate 
interval, partial sequence 


0.0003 


518 


NM_023755 


Mus musculus Tcfcp2-related transcriptional repressor 1 (Crtrl- 
pending), mRNA 


0.009 


519 


NM_004706 


Homo sapiens Rho guanine nucleotide exchange factor (GEF) 1 
(ARHGEF1), mRNA 


3.4 


520 


X67164 


P.hybrida T-DNA integration region 


0.063 


521 


AB042275 


Canine herpesvirus CICP0 gene for infected cell protein 0, complete 
cds 


0.66 


522 


AB019942 


Arabidopsis thaliana gene for sigma factor SigA, complete cds 


1.4 
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SEQID 
NO 
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DESCRIP 
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523 


AF270382 


Staphylococcus epidermidis strain SRI clone step. 404 lf02 genomic 
sequence 


0.23 


524 


AF284897 


Carex supina tRNA-Thr, partial sequence; trnT-trnL intergenic 
spacer, tRNA-Leu, and trnL-trnF intergenic spacer, complete 
sequence; and tRNA-Phe, partial sequence; chloroplast genes for 
chloroplast products 


0.23 


525 


AK000493 


Homo sapiens cDNA FLJ20486 fis, clone KAT08039 


0.17 


526 


U67535 


Methanococcus jannaschii section 77 of 150 of the complete genome 


0.6 


527 


AE002754 


Drosophila melanogaster genomic scaffold 142000013385431, 
complete sequence 


1.8 


528 - 


AF241656 


Caenorhabditis elegans SR protein specflc kinase SPK-1 mRNA, 
complete cds 


1.7 


529 


AF173828 


Caenorhabditis elegans putative potassium channel Slo-2 mRNA, 
complete cds 


0.21 


530 


AE005382 


Escherichia coli 0157:H7 EDL933 genome, contig 3 of 3, section 1 of 
290 


2.3 


531 


Z36154 


S.cerevisiae chromosome II reading frame ORF YBR285w 


0.68 


532 


AF054126 


Rattus norvegicus polymorphic marker D6UI A6 sequence 


0.061 


533 


AK004780 


Mus musculus adult male lung cDNA, RIKEN full-length enriched 
library, clone: 12000 15C19, full insert sequence 


0.7 


534 


AF352555 


Staphylococcus aureus subsp. aureus strain ATCC31889 leukocidin 
LukS component gene, partial cds 


0.2 


535 


AF352555 


Staphylococcus aureus subsp. aureus strain ATCC31889 leukocidin 
LukS component gene, partial cds 


0.1 


536 


J05246 


Mouse alpha-fetoprotein (AFP) gene, 5 1 flank 


0.024 


■ 537 


AF021935 


Rattus norvegicus mytonic dystrophy kinase-related Cdc42-binding 
kinase (MRCK) mRNA, complete cds 


e-141 


538 


Y12573 


D.melanogaster Jun and 14-3-3 zeta gene 


0.5 


539 


AF248584 


Moraxella catarrhalis UDP-glucose dehydrogenase gene, partial cds; 
and UDP-glucose 4-epimerase (galE) and L-glutamine:D-fructose-6- 
phosphate aminotransferase (glmS) genes, complete cds 


0.026 


540 


AL136554 


Homo sapiens mRNA; cDNA DKFZp761D0712 (from clone 
DKFZp761D0712) 


0.089 


541 


AF158049 


Platybrachys decemmacula 16S ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


0.082 


542 


AY023618 


Oryza sativa microsatellite MRG5943 containing (TTA)X55, closest 
to marker RG908, genomic sequence 


0.00009 


543 


AL591498 


Human DNA sequence from clone RPll-113L12on chromosome 13, 
complete sequence [Homo sapiens] 


3E-14 


544 


AF183908 


Danio rerio P450 aromatase (cypl9b) mRNA, complete cds 


0.069 


545 


AKO 16658 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4933405A14, full insert sequence 


0.53 


546 


ABO 14084 


Homo sapiens genomic DNA, chromosome 6p21.3, HLA class I 
region, Cosmid clone:TY7A5, complete sequence 


0.011 
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SEQID 
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547 


AF285076 


aciena reticularis tKNA-.Leu, partial sequence; tmL-txnr mtergeruc 
spacer, complete sequence; and tRNA-Phe, partial sequence; 

^iuuiu|/iasi g^iiCa 1UI UlllUlUpiabl D rOQUClS 


0.19 


548 


XM 047604 


Homo sapiens coproporphyrinogen oxidase (coproporphyria, 

iimucxupuipiiyiicij \\,rvj) y lll2viN/\ 


0.17 


549 


Z81538 


Caenorhabditis elegans cosmid F45H10, complete sequence 


0.24 


550 


AF026800 


runuuius netcrociinis cyiocnrome 1t4jU1A (CY-rlA) rriRNA, 
cornnlete cd^ 

vWlll^/lVlv vUj 


1.9 


551 


XM_051897 


nuiiiu ^dpiciib p j lu^piio lipase az, group ivA (^cytosoiic, calcium- 
dependent) (PLA2G4A), mRNA 


9E-08 


552 


XM 015200 


Hnmn ^flrvipriQ livrw*thf*tir*ii1 rwr\\(^\ n T7T 101*70 A /ttt TOT7'5/i\ m D\i a 
x luinu aupxciia xi^puuicuwii proiem rj^jzz/z^l- ^rl_/JZZ mxvlNA 


2.1 


553 


AF266244 


vjiniciiuiYb imidDiiis adenylate cyclase type v-iiKe protein mRNA, 
complete cds 


6 


554 


Z2 1 8 1 8 

* • X O X o 


n.&apiciib t<di cinueiiiuryonic antigen gene 


0.23 


555 


AE002100 


Ureaplasma urealyticum section 1 of 59 of the complete genome 


0.023 


556 


XM 04S17Q 


nomo sapiens nypotneucai gene supported by AL 13 7724 


0.19 


557 


X16509 


Rice alpha-amylase gene 


1.8 


558 


M33753 


xy.iucidiiugaMer trumus protein nuviNA, complete Cds 


5.4 


559 


D32022 


nuiiicui iiiivrN/\ ior i ceu receptor v oeta o lukj, parciai sequence 


1.5 


560 


AF1 88893 


Homo sapiens guanidinoacetate N-methyltransferase (GAMT) gene, 

f*AmiVlf*i"f* r*Hc 
^vtxiipicLC uia 


0.18 


561 


AF367310 


Arabidopsis thaliana At2g36880/TU8.6 gene, complete cds 


0.11 


562 


AF207699 


maeis guineensis agamous-iiKe jylaljo oox protein URMADSl 
mRNA, complete cds 


0.45 


564 


ABO 14586 


riKjiLHj acipiciia uixviN/\ ior xvi/vtt.uooo protein, partial cqs 


0.17 


565 


L77040 


Homo sapiens (subclone 8_cll from PI H22) DNA sequence 


8E-14 


566 


NM_024942 


Homo sapiens hypothetical protein FLJ13490 (FLJ13490), mRNA 


0.013 


567 


U88158 


i cudiiymciid uiermopmia micronuciear aeveiopmentaily eliminated 
sequence region 


0.18 


568 


AF2 10744 


Nannochloropsis oculata large subunit ribosomal RNA gene, partial 


0.057 


569 


AL590384 


nullum sequence xrom cione Kr 1 1-J4VA10 on cnromosome 
Xq22.3-24, complete sequence [Homo sapiens] 


2E-15 


570 


AY034379 


Capsicum annuum branched-chain amino acid aminotransferase 
mRNA, complete cds 


1.2 


571 


AF038283 


Campylobacter jejuni cytolethal distending toxin A (cdtA), cytolethal 
distending toxin B (cdtB), and cytolethal distending toxin C (cdtC) 
genes, complete cds 


0.4 


572 


X98807 


A. thaliana mRNA for peroxidase ATP21a 


1.8 


573 


AF1 11941 


Dictyostelium discoideum AX4 development protein DG1 148 
(DG1148) gene, complete cds 


0.0009 
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SEQ ID 
NO 


ACCESSN 




P VALUE 


574 


AE002247 


^nidmyaopniia pneumoniae AKJ9, section 74 of 94 of the complete 


1.7 


575 


AF247555 


Drosophila mauritiana heat shock protein 68 gene, partial cds 


1.4 


576 


AJ237599 


CrflllllC 051 line mRMA -fXr frnrtiiAic tiAmnl O 

vjouua &cuAuo iiiivtN/\ ior iroquois nomoiogue-.z 


0.056 


577 


NM 030995 


xxduua iiuxvcgi^ub iviicroiuDuie-associaiea protein la (Map la), mRNA 


0.21 


578 


XM 027638 


Homo sapiens KIAA0240 protein (KIAA0240), mRNA 


1.5 


579 


AF3 10886 


DlCtVfWtfililllTl Hi ^fOlH^lim T? OP A /voaA\ nana AA-mnlA^-a n 4« 

i/iuijrudibiiuiii uiavAJtucuiii ixdc-A ^rac/\j gene, complete cos 


0.065 


580 


AB001090 


Homo sapiens gene for H-cadherin, exon 1 


0.2 


581 


BC009101 


ivAur> iiiuauuiua, oiiiuidi 10 nypotneiicai protein ±>isj H Z<p/olcl21, clone 
MGC:12146 IMAGE: 3 7 10846, mRNA, complete cds 


0.0008 


582 


XM 011473 


numu ^dpiciii serine <pr cysteine^ proteinase innioitor, clade JB 

{ovalbumin^ mpmhpr 0 f^PTJPTNTRQ^ mDMA 


1.8 


583 


AF004051 


Galium hypocarpium rpsl6 gene, chloroplast gene, partial intron 
semi en e£* 


0.068 


584 


AE007440 


outpujuuv^L-ua piicuiiiuiudc section izo oi iy4 or me complete genome 


0.066 


585 


L25647 


Homo sapiens fibroblast growth factor receptor gene (located in the 
M/iiuoi lv±± l\s j aigiidi pepuue diiu consecutive exon 


0.069 


586 


AF191079 


Stealth virus 1 clone 3B632, genomic sequence 


0.009 


587 


L20465 


wit^nuccicd vuivuius nicotinic acetyicnoiine receptor mRNA, partial 
cds 


0.72 


588 


NM_008713 


Mus musculus nitric oxide synthase 3, endothelial cell (Nos3), mRNA 


2 


589 


NM_004052 


nun 10 sapiens dljlz/ adenovirus £lld iykL; -interacting protein 3 
(BNIP3), nuclear gene encoding mitochondrial protein, mRNA 


4E-26 


590 


U40228 


Plasmodium falciparum ADP-ribosylation factor 1 (pARFl) mRNA, 

pnmnl f»tf» rHc 


0.009 


591 


AL049435 


Homo sapiens mRNA; cDNA DKFZp586B0220 (from clone 


0.008 


592 


AC000981 


nuiuu bdpiciib ^buocione z_co irom r*i tiji) una sequence, complete 
sequence 


0.68 


593 


AF304319 


iNeospora camnum small suoumt noosomal RNA and large subunit 
iiu^auiiicu ivtn/\ genes, pamdi sequences ana mtergenic region, plastic 
genes for plastid products 


0.009 


594 


BC006517 


iiuinu atiijicus, oiiiui<u to rrAK oinuing protein, clone 
IMAGE:3546031, mRNA 


7E-43 


595 


AJ249162 


Homo sanien*; tirnmntnr f*nri5inoj*r from TQfV)n o/»no 


2.1 


596 


AJ297708 


Rattus norvegicus RT6 gene for T cell differentiation marker RT6.2, 
exons 1-8 


0.024 


597 


Y11779 


R.prowazekii ygjT-like gene and 4 open reading frames 


0.078 


598 


AE005660 


Escherichia coli 0157:H7 EDL933 genome, contig 3 of 3, section 279 
of 290 


0.53 


599 


] 

XM_041617 


Homo sapiens RAB1 IB, member RAS oncogene family (RAB1 IB), 
mRNA 


4.8 


600 


AK023165 J 


Homo sapiens cDNA FLJ13103 fis, clone NT2RP3002304 


0.25 
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NO 
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601 


Z69388 


Human DNA seauence from cosmid L60G9A Hnntinaton'*; r>i«*p«* 
Region, chromosome 4pl6.3 contains ESTs and a CpG island 


0.084 


602 


U84760 


Chromatium vinosum dsr locus, complete sequence 


0.022 


603 


AF122050 


Hordeum vulgare limit dextrinase gene, complete cds 


1.8 


604 


AJ308471 


Mus musculus partial gene for acetyl-CoA carboxylase-alpha, 
promoter I 


0.022 


605 


AF158032 


Cixius inflatus 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.008 


607 


XMJH2733 


Homo sapiens KIAA1012 protein (KIAA1012), mRNA 


0.005 


608 


D37781 


Human mRNA for protein-tyrosine phosphatase HPTPeta, complete 
cds 


0.059 


609 


AC002252 


Homo sapiens (subclone 1 g7 from BAC H76) DNA sequence, 
complete sequence 


7E-11 


610 


AF160964 


Peanut witches'-broom DhvtODlasma RNA nolvrnerase siiema factor 
(rpoD) gene, complete cds 


0.63 


611 


U67528 


Methanococcus jannaschii section 70 of 150 of the complete genome 


0.67 


612 


AF063134 


Justicia caudata tRNA-Leu (trnL) gene, chloroplast gene for 
chloroplast RNA, partial sequence 


1.2 


613 


U67527 


Methanococcus jannaschii section 69 of 150 of the complete genome 


1.5 


614 


AK022455 


Homo sapiens cDNA FLJ12393 fis, clone MAMMA1002711 


0.000001 


615 


AF272082 


Poiretia punctata trnK gene, intron, 3' partial sequence; and maturase 
(matK) gene, partial cds; chloroplast genes for chloroplast products 


0.075 


616 


AF 112461 


Homo sapiens G protein-coupled receptor 57 (GPR57) gene, complete 
cds 


0.025 


617 


AF3 10884 


Dictyostelium discoideum RaclB (raclB) gene, complete cds; racK 
pseudogene, complete sequence; and unknown genes 


5.6 


618 


AF348342 


Plasmodium malariae caseinolytic protease C (clpC) gene, partial cds; 
apicoplast gene for apicoplast product 


0.22 


619 


AB043965 


Danio rerio gene for dharma, promoter and partial cds 


0.054 


620 


NC_001839 


Petunia vein clearing virus, complete genome 


0.61 


621 


AK023959 


Homo sapiens cDNA FLJ13897 fis, clone THYRO1001706 


0.67 


622 


AF028835 


Drosophila pallidosa vermilion (v) gene, complete cds 


0.13 


623 


U67598 


Methanococcus jannaschii section 140 of 150 of the complete genome 


0.072 


624 


AF283103 


Pineapple mealybug wilt associated virus-2 polyprotein (ORFla) and 
RNA-dependent RNA polymerase genes, partial cds; and hydrophobic 
protein p5, heat shock protein 70, p46, coat protein, diverged coat 
protein, p20, p22, and p6 genes, complete cds 


1.5 


625 


XM_008161 


Homo sapiens hypothetical protein FLJ20345 (FLJ20345), mRNA 


1E-15 


626 


XM_035599 


Homo sapiens adaptor-related protein complex 4, epsilon 1 subunit 
(AP4E1), mRNA 


6E-22 
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627 


AF143066 


Lepidopilum scabrisetum ribosomal protein system 4 (rps4) gene, 
partial cds 


0.61 


628 


U89283 


Biomphalaria glabrata myoglobin gene, complete cds 


0.46 


629 


U44949 


Xenopus laevis zona pellucida A glycoprotein homolog (xlZPA) 
mRNA, complete cds 


1.6 


630 


AE007395 


Streptococcus pneumoniae section 78 of 194 of the complete genome 


0.15 


631 


BC006963 


Mus musculus, clone IMAGE:3708410, mRNA, partial cds 


1.6 


632 


XM031120 


Homo sapiens peptidylglycine alpha-amidating monooxygenase 
(P AM), mRNA 


4E-40 


633 


AF3 16122 


Kinosternon arizonense isolate KariAZ2 mitochondrial control region, 
complete sequence 


0.69 


634 


U67561 


Methanococcus jannaschii section 103 of 150 of the complete genome 


0.22 


635 


D88010 


Human DNA for ribosomal nrotein S13 comnlete cds TJ14 small 
nucleolar RNA, complete sequence 


0.24 


636 


AE006321 


Lactococcus lactis subsp. lactis IL1403 section 83 of 218 of the 
complete genome 


2.2 


637 


AF045595 


Carassius auratus clone gf-40 glutamic acid decarboxylase isofonn 67 
(GAD67) mRNA, complete cds 


0.74 


638 


D38378 


Human DNA, novel tandem repeat sequence 


0.21 


639 


AF260819 


Plasmodium falciparum CGI protein gene, complete cds 


0.003 


640 


NC_000857 


Ceratitis capitata complete mitochondrial genome 


0.23 


641 


AK015383 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4930444M15, full insert sequence 


2.1 


642 


AF286367 


Homo sapiens HMGIY gene, promoter 


1.6 


643 


AK019143 


Mus musculus 10 days embryo cDNA, RIKEN full-length enriched 
library, clone:2600001P13, full insert sequence 


0.69 


644 


AE001421 


Plasmodium falciparum chromosome 2, section 58 of 73 of the 
complete sequence 


2 


645 


D 16904 


Human HepG2 3' region cDNA, clone hmd3a02 


0.21 


646 


XMJ)03799 


Homo sapiens protocadherin beta 10 (PCDHB10), mRNA 


0.69 


647 


NC_001438 


Bean golden mosaic virus A component DNA, complete sequence 


0.66 


648 


AKO 13971 


Mus musculus 13 days embryo head cDNA, RIKEN full-length 
enriched library, clone:3110001P07, full insert sequence 


2 


649 


AF078880 


Prosimulium formosum dopa decarboxylase (DDC) gene, partial cds 


0.076 


650 


AJ009600 


Begonia taipeiensis rbcL promoter, clone 16320 


0.22 


651 


AF042834 


Homo sapiens phosphodiesterase delta subunit gene, exons 2, 3 and 4 


5.8 


652 


AF244214 


Dysdera ratonensis 16S ribosomal RNA gene, partial sequence 


0.075 


653 


XM_043252 


Homo sapiens PCAF associated factor 65 alpha (PAF65A), mRNA 


2E-33 
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NO 
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x v ruj \s x-y 


654 


AF105116 


otreptococcus pneumoniae type L,psiycK (cpsivuiy) gene, 
partial cds; putative oligosaccharide repeat unit transporter (cpsl9CJ), 

T TTYP-N^Jir'Ptvl crlnr*nQ5iminp-'?— f ni mprricp' ["mcl Qf^T^^ anH fYiitsitit/f* 
uur i>i a\s\siy l giu^AJocxixxiixv ^^*ljixix&i asc ^paiy^xv^, cxiiLl UUUlUVw 

glucosyl transferase (cpsl9CS) genes, complete cds; and gluc> 


1.9 


655 


AF181720 

ill X %3 X / A^\J 


Hnmn oanipn^ RTT2 fRTT?^ qpiip rnmnlptp pHs* and RTT?^ fftTT?^ 
ppne nartial cds 


0.23 


656 


AL392164 


Human DMA «;pfnipnf*p from plnnp T3P1 1-96QT,6 nn rhmmniomp 

XXU11XCLIX 1V1 "1 il 3W\|Ubllvv VIvllv XVIT X 1 ZV/71-/W V/1X ^xxl \JlXlU3VXlXlb 

cfimiflete sennence ITfoiTio sanien*!! 

vUllll/lvlv ijvUUvIIvv ^XXV/XllV/ k^ClLVlvllkJ J 


0.0003 


657 


AF189720 


Oryctolagus cuniculus chloride channel (CFTR) mRNA, complete cds 


0.18 


658 


AF077R9 1 


f^jiTiic faini linric inHnpiWIp nitrip nviHp cvntlmcp mRNA frvmnlptp r*Hc 

\^dX!XS XuiillllCtl lo lllUIXl^lUlW 1L1 Ll UA1UC ayXllXlaaG 11IXV1N .TA. ? \AJXJXLJXbl.& VsUd 


0.077 


659 


NC_001327 


Ascaris suum mitochondrion, complete genome 


0.074 


660 


yu 003164 


Hnmn sanipn*; KTAA0218 eerie nroduct HCTAA0218. mRNA 


5.8 


661 


M22207 


T.gratilla 217g gene encoding 217g protein, complete cds 


0.071 


662 




ivius muscuius, oiiiuidr 10 coidcior required xur opi iniiiauiipuuixcii 
activation, subunit 6 (77kD), clone IMAGE: 3 5903 09, mRNA, partial 


0.22 


663 


AF007132 


Homo sapiens clone 23551 mRNA sequence 


0.023 


666 


XM_007847 


rioiiio Sapiens iiypuuicut/cii piuieiii xium L-iuiic / y\j / ituj, 
mRNA 


5E-32 


667 


AF00^7SS 


Drosophila melanogaster genomic scaffold 142000013385531, 

UUiiipicic od^uciitc 


5.8 


668 


U70780 


Clostridium botulinum unidentified protein P-48 gene, complete cds 
and neurotoxin binding protein gene, partial cds 


0.075 


669 




Homo sapiens similar to KIAA0773 gene product (H. sapiens) 
fLOC90115. mRNA 


2 


670 


XM_003091 


xxOiiiu sapiens i\ir\Auuui gene pruuiiLi, puuiiive vj-pr(jiciii"cuupicu. 
r^ppntftr' fx irmtpin rvMit^lpH rprpntnr finr T 1 1 YP-fflnr*ncp ^TCTA AOOOl^ 

IC^CUtVil, VJ Lyl \JLv>lll \*\J IX Ly 1 1%^\A^^JWJL XVJ1 \J LSI ^1 LilsVJot^ ^XVX^LrVV/V/V/ X^, 

mRNA 


1.7 


671 


AF284881 


i^arex paucmora uviN/\-inr, pamai sequence, im i -uilu lniergcmc 
spacer, LrviN /\-LiC <uiu li ilt liiicrgciiic spavci, t-umpicic 
cpniiPTi(?p' anH tRNA-Php nartial Qpnupnpp* pnlnrnnlaot pptip*i fnr 
phlnmnla^it nroduct^ 


1.6 


672 


NC_001381 


Plasmid pAL5000, complete sequence 


6.1 


673 


AF081201 


Arabidoosis thaliana villin 1 TVLNl^ mRNA comnlete cds 


0.21 


674 


AK023674 


Homo sapiens cDNA FLJ13612 fis, clone PLACE1010833, weakly 
similar to CALTRACTIN 


0.54 


675 


X60097 


Streptococcus sp. (group C) emm gene for M protein 


0.063 


676 


AF026087 


Schizosaccharomyces pombe Satl gene, partial cds 


0.22 


677 


X00117 


Galago Alu repeat type I, GAL9 


0.0001 


678 


AJ276629 


Rattus norvegicus Sacm21/RT1-A intergenic region, haplotype RTln 
and partial RT1-A gene for MHC Class I antigen 


1.9 


679 


AK021058 


Mus musculus adult male corpus striatum cDNA, RDCEN full-length 
enriched library, clone:C030009O12, full insert sequence 


0.23 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


680 


BC003727 


Mus musculus, Similar to Interleukin 1 alpha, clone MGC:5780 
IMAGE:3599550, mRNA, complete cds 


1.9 


681 


XM 048261 


Homo sapiens similar to ring finger protein 23; RING-B box-coiled 
coil-B30.2 (M. musculus) (LOC92947), mRNA 


5E-18 


682 


XM_038154 


Homo sapiens KIAA1016 protein (KIAA1016), mRNA 


0.23 


683 


XM_049918 


Homo sapiens similar to hypothetical protein MGC 10940 (H. sapiens) 
(LOC93230), mRNA 


0 


684 


X14727 


Sheep beta-B globin gene 


0.68 


685 


AB052187 


Macaca fascicularis brain cDNA, clone:QnpA-12170 


0.024 


686 


AK004783 


Mus musculus adult male lung cDNA, RDCEN full-length enriched 
library, clone: 1200015E 14, full insert sequence 


5.8 


687 


AE002799 


Drosophila melanogaster genomic scaffold 142000013385566, 
complete sequence 


0.003 


688 


AF190395 


Corymbia calophylla tRNA-Leu and tRNA-Phe genes, partial 
sequence; chloroplast genes for chloroplast products 


1.9 


689 


U57896 


Candida albicans cytochrome C gene (CYC1) gene, complete cds 


0.22 


690 


AE006710 


Sulfolobus solfataricus section 69 of 272 of the complete genome 


2 


691 


AL392164 


Human DNA sequence from clone RP1 1-269L6 on chromosome X, 
complete sequence [Homo sapiens] 


0.0003 


692 


D87001 


Homo sapiens immunoglobulin lambda gene locus DNA, clone:47H9 
downstream contig 


0.008 


693 


Y13853 


Drosophila erecta clone Erl inactive Bari-1 family transposon 


0.16 


694 


AB022927 


Oryctolagus cuniculus HAC4 mRNA for hyperpolarization activated 
cation channel, complete cds 


0.66 


695 


AF307327 


Araraquara virus medium RNA segment, G1/G2 glycoprotein 
precursor gene, partial cds 


2 


696 


AF269369 


Staphylococcus epidermidis strain SRI clone step.l002g02 genomic 
sequence 


0.57 


697 


XM_043252 


Homo sapiens PCAF associated factor 65 alpha (PAF65A), mRNA 


2E-33 


698 


XM_008640 


Homo sapiens lethal giant larvae (Drosophila) homolog 1 (LLGL1), 
mRNA 


0.22 


699 


AC024829 


Caenorhabditis elegans cosmid Y55F3BM, complete sequence 


0.023 


700 


M83660 


Dictyostelium discoideum clathrin heavy chain (chcA) mRNA, 
complete cds 


1.9 


701 


X95481 


S.cerevisiae ARC1 gene 


1.9 


702 


AJ408897 


Homo sapiens chromosomal breakpoint fragment derivative 4, 
AF4/MLL fusion, patient UPN006 


4.9 


703 


AF1 10007 


Molgula oculata p68 RNA helicase (bobcat) and transcription factor 
(manx) genes, complete cds 


0.68 


704 


AL133339 


Human DNA sequence from clone RP4-807J1 on chromosome 20. 
Contains a GSS and a CpG island, complete sequence [Homo sapiens] 


2 


705 


U10116 


Human superoxide dismutase (SOD3) gene, complete cds 


0.23 


706 


AB037771 


Homo sapiens mRNA for KIAA1350 protein, partial cds 


0 
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SEQID 
NO 


ACCESSN 


DESCWP 


P VALUE 


707 


AF023840 


Homo saniens natural killer ptouti nrntein 9- A (Knccii- a\ n^n^ 
complete cds 


0.68 


708 


U49023 


L tgwi-i^uia iiLacuiaui c Audited, cyiucnrome oxiaase suDumt l, 
cytochrome oxidase subunit II genes, mitochondrial genes encoding 
mitochondrial oroteins oartial cds and tRNA-T ph <rpnp 
mitochondrial gene 


0.001 


709 


XM_044768 


Homo saniens similar to Heath rt*rpntnr f\ (Vk ca-ni^nc^ n nro^ft^ 
mRNA 


e-izo 


710 


D50006 


Human DNA for alpha-platelet-derived growth factor receptor, exon 6- 
10 


2.1 


711 


Y14524 


Tetrahvmena thermonhila tRNA-Tvr ppnp pvnn<; l »nH 9 Hr»np 
pTetY2 




712 


AL592159 


Human DNA sequence from clone RP13-237P14 on chromosome X, 
complete sequence [Homo sapiens] 


0.52 


713 


U88825 


HW-1 isolate 92NG003 from Nigeria complete genome 


3.1 


714 


U05813 


Blastocrithidia culicis ATCC 30268 kinetoplast (MURF1) and 
aooevtochrome B ( cvtB") eenes partial cds and ATPasp snhnnit f\ 
(MURF4) gene, complete pre-RNA-edited DNA sequence 


0.23 


715 


AF048990 


Homo sapiens MutS homolog 5 (MSH5) gene, exons 11 and 12 


0.0009 


716 


AF3 17082 


Adoncholaimus thalassophygas isolate Sou39 large subunit ribosomal 
RNA cene oartial seauence* mitochondrial ffene for mttorhnnHrial 
product 


0 0008 


717 


AKO 13269 


Mus musculus 10 11 davs embrvo cDNA RTKEN fnll-lenpth 
enriched library, clone:2810439K08, full insert sequence 


0.024 


718 


L14769 


Phvsarum DolyceDhalum cvtochrome oxidase subunit 1 mRNA 
complete cds; mitochondrial gene for mitochondrial product 


1.6 


720 


U45453 


Xenopus laevis transcription factor xGATA-4 mRNA, complete cds 


0.61 


721 


AF090432 


Danio rerio serrateB mRNA, complete cds 


0.075 


722 


M62504 


Spiroplasma citri fibril protein gene, complete cds 


0 094. 


723 


AK000939 


Homo sapiens cDNA FLJ10077 fis, clone HEMBA1001864 


0 


724 


XM 032587 


Homo sapiens 2127 (KIAA1754), mRNA 


n 99 


726 


AF012899 


Sambucus nigra ribosome inactivating protein precursor mRNA, 
complete cds 


0 000001 


727 


S82175 


RAR-gamma 2.2=retinoic acid receptor gamma isoform 2.2 

if alternative lv solicedl TXenoous laevis embrvos staee 94. mRNA 

2154 nt] 


0.008 


728 


AK002856 


Mus musculus adult male kidney cDNA, RIKEN full-length enriched 
library, clone:06 10040 A09, full insert sequence 


0.44 


729 


AF077828 


Oryctolagus cuniculus indolethylamine N-methyltransferase (INMT) 
gene, exon 3 and complete cds 


1.4 


730 


BC006633 


Mus musculus, Flt3 interacting zinc finger protein 1, clone 
MGC:7201 IMAGE:3482191, mRNA, complete cds 


1.3 


731 


AK024670 


Homo sapiens cDNA: FLJ21017 fis, clone CAE05907 


0.5 


732 


Z35823 


S.cerevisiae chromosome II reading frame ORF YBL062w 


1.1 


733 


Z48636 


C.novyi gene for alpha-toxin 


0.065 
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SEQID 
NO 


ACCESSN 




1** 1 T 1 X T 1 1 l 

P VALUE 


734 


BC008580 


Homo sapiens, clone IMAGE;4 179986, mRNA, partial cds 


0.71 


735 


L37746 


Homo sapiens lamin Bl gene, exon 10 


0.11 


736 


AF142372 


r-Liiuxia iuv^u^i yyiunuumc u gene, pdiiidi cos, nuiocnonQriai gene tor 
mitochondrial product 


0.95 


737 


AF107454 


Homo sapiens clone cD622 mRNA sequence 


0.48 


738 


XM 010112 


nvjinu bct^icii^ buiuic uimcr ianuiy o vneuroiransniiixer transporter}, 
member 14 (SLC6A14), mRNA 


0.5 


739 


AF 188067 


Arabidopsis thaliana clone AtDP5 isopentenyl 
pjriv/|jiiuapiuiic.tuiucuiyii<iiiyi pyrupnoapnaie isomerase (lpi/vtzj 
mRNA narfial rrfc 


l.O 


740 


X90383 


A thaliana DNA for VI 3 pptip 




741 


AE004309 


viuiiu ciiuiciiie ciiromosome i, seciion zi / or Zji oi tne complete 
chromosome 


0.18 


742 


AF236642 


v^diuuiiix pdneuna cione iuz-z/\ ioia-zjo internal transcribed spacer, 
complete sequence; and tRNA-Ile and tRNA-Ala genes, complete 

liwu Uvllwv 


l.l 


744 


AF107699 


HomO <mnieHQ "RrWfir'? nrotPin rRTXTf^TV m=»n#» narfiol r*Ac 
iiumu oa^Jiwiio xviinvjj jJlULClli ^XxJ.i> vJJ ) gCllC, pdlUal COS 


O f\^"7 
U.UA 1 


745 


XM 035624 


Homo QanifMlQ TCTA A0R7fi nrntpi n fkTl A AnR J 7^;\ rnDMA 


1 A 


746 


XM 001795 


Homo ^aniettQ lamin R rpr^ntnr /T RR'V mPTSJA 


O 1 < 

U.l D 


747 


AF395180 


rtuiiciiciia uit-uiui cyiuciiiunic uxiutisc suuumi i gene, paruai cos,. 

mitochondrial pf*ne* for m i tor* hnnH rial r»rr»Hnr , t 


O fiQI 


748 


AF262633 


Homo sapiens clone NIGMS NA1 1418 chromosome 15 paralogous 
^ennetice variant ffptiomir* cpnnpnrp 




749 


AF086375 


Homo saniens fiill leneth in^rt rDNA Honp 7T>fiRRl 9 

ixviuu uUJ/ivlU 1U11 IvllgUl lllOvlL vL/l~X\ wlVSHCr £jL/UOJJX^ 


U.UUUUU3 


750 


AJ271002 


Mus musculus TFFl/pS2 gene for Trefoil Factor l/pS2, exons 1-3 


0.8 


751 


AF063681 


l\4us musculus tvne VTTT follaa^n /rnll^jil^ cr^n** pvnnc on 

ivj.uo iiiuo^uius ijf^/^f vviii wuilagvll ^UllJal^ gCIlC, CAVJILb 1 


U.jj 


752 


AL031634 


Caenorhabditis elegans cosmid Y32B12C, complete sequence 


0.053 


753 


Y00165 


Sns scrofa mRNA for link - nrotpin 

JvivlH 11HVL ^ 1U1 1111.KV UlUlvlll 


O OA^ 


754 


AP000693 


Homo sapiens genomic DNA, chromosome 21q22.2, PCR fragment 
from BAC clone*KB739C1 1 CRR 1 -NT rpoinn 


u.uzz 


755 


AK021535 


Homo sapiens cDNA FLJ11473 fis, clone HEMBA1001712 


0.064 


756 


AF305539 


ivxua inusuuiud jj ijiiipiiuuyic liiuuLcu maiurauon proiein i ^"romi ) 
gene, exon 8 and complete cds 


4E-12 


757 


J05493 


S.pombe no message in thiamine protein (nmtl) gene, complete cds 


0.55 


758 


AE001480 


T-Tf k lif*ohar , t^r nvlori ctrain TOO cf»r , tirm A. 1 r%f m r>f tVio /«nmn1ofo 
xa^iicuLra^>Ld pyiuii, on <uu J7? accLion fi oi ijz oi uie compicie 

genome 


0.51 


759 


AF072685 


Bos taurus matrix metalloproteinase 13 (MMP13) mRNA, complete 
cds 


0.072 


760 


XM_038408 


Homo sapiens hypothetical protein FLJ1 133 1 (FLJ1 133 1), mRNA 


0.6 


761 


XM_041148 


Homo sapiens hypothetical gene supported by AF037219 
(LOC91856), mRNA 


1.7 


762 


AF234897 


Haemulon plumieri Hpl224 NADH dehydrogenase I (ND-1) gene, 
complete cds; mitochondrial gene for mitochondrial product 


3 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


763 


NM_008399 


Mus musculus integrin, alpha E, epithelial-associated (Itgae), mRNA 


1.4 


764 


Z74898 


S.cerevisiae chromosome XV reading frame ORF YOL156w 


1.8 


765 


AF280812 


Glycine max putative Hsl pro- 1 -like receptor mRNA, complete cds 


0.59 


766 


AB013615 


Broad bean wilt virus 2 genomic RNA, complete sequence of RNA1 


0.68 


768 


AF079169 


Nicotiana tabacum feedback-insensitive anthranilate synthase alpha-2 
chain (ASA2) gene, promoter region and 5UTR 


5.9 


769 


ABO 14079 


Homo sapiens genomic DNA, chromosome 6p21.3, HLA class I 
region, Cosmid clone:TYlEll, complete sequence 


0.007 


770 


XM_012336 


Homo sapiens MBEP protein (MB IP), mRNA 


0.0003 


771 . 


U41509 


Caenorhabditis elegans cosmid W03H1, complete sequence 


0.18 


772 


U23478 


Dictyostelium discoideum phosphatidylinositol-4,5-diphosphate 3- 
kinase (PDC3) mRNA, partial cds 


0.00002 


773 


AP000380 


Arabidopsis thaliana genomic DNA, chromosome 5, genomic PCR 
product GA469 


0.39 


774 


AF1 15772 


Danio rerio basic helix-loop-helix transcription factor Ndrla (ndrla) 
mRNA, complete cds 


0.2 


775 


XM_007458 


Homo sapiens butyrate response factor 1 (EGF-response factor 1) 
(BRF1), mRNA 


0.42 


776 


L31955 


Human (clone 1NQ3-2) normalized cDNA library sequence 


1.6 


777 


D49711 


Populus kitakamiensis (P. sieboldii X P. grandidentata) homt3 gene 
for caffeic acid O-methyltransferase, complete cds (exonl-4) 


0.046 


778 


L49234 


Manduca sexta chitinase precursor gene, complete cds 


0.05 


779 


AY008848 


Ictalurus punctatus clone Icpu-UA/3 MHC class I antigen gene, 
complete cds 


0.015 


780 


AB048375 


Oserya coulteriana chloroplast matK gene for maturase K, complete 
cds 


1.9 


781 


XM_0 10323 


Homo sapiens toll-like receptor 7 (TLR7), mRNA 


0.21 


782 


X69121 


P. falciparum Pfgrp mRNA for heat shock protein 


0.18 


783 


AE001570 


Helicobacter pylori, strain J99 section 131 of 132 of the complete 
genome 


0.05 


784 


M31756 


Human t(9;22) chromosome 9 breakpoint DNA 


0.77 


785 


M37889 


Staphylococcus aureus replication (rep), control of replication (cop), 
and resistance protein (QacC) genes, complete cds 


0.2 


786 


AF091326 


Plasmodium falciparum histone deacetylase (HDAC1) mRNA, 
complete cds 


0.19 


787 


Z74851 


S.cerevisiae chromosome XV reading frame ORF YOL109w 


0.2 


788 


Z85396 


Hsapiens Ig lambda light chain variable region gene (34-34SWIIF32) 
rearranged; Ig-Light-Lambda; VLambda 


0.56 


789 


AF075681 


Hemorrhagic enteritis virus hexon protein gene, partial cds 


2.1 


790 


AY029613 


Mus musculus GTP binding protein 3 gene, complete cds; nuclear 
gene for mitochondrial product 


5.8 


791 


NC 001338 


Sulfolobus virus 1, complete genome 


0.63 
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SEQID 
NO 
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DESCRIP 


P VALUE 


792 


U67563 


Methanococcus jannaschii section 105 of 150 of the complete genome 


0.11 


794 


U67549 


Methanococcus jannaschii section 91 of 150 of the complete genome 


0.21 


795 


AF293450 


Cards fanuliaris BPAGl-like gene sequence 


0.53 


797 


XM_029942 


Homo sapiens KIAA0877 protein (KIAA0877), mRNA 


1.4 


798 


BC009775 


Homo sapiens, Similar to hypothetical protein FLJ 10842, clone 
MGC: 13398 IMAGE:4092662, mRNA, complete cds 


0.061 


799 


NM_008399 


Mus musculus integrin, alpha E, epithelial-associated (Itgae), mRNA 


0.24 


800 


NC_001733 


Methanococcus jannaschii small extra-chromosomal element, 
complete sequence 


2 


801 . 


Z98532 


S.pombe chromosome I cosmid clBl 


0.66 


802 


AL158132 


Human DNA sequence from clone RP3-323A24 on chromosome 4. 
Contains GSSs and a putative CpG island, complete sequence [Homo 
sapiens] 


0.003 


803 


XM_007847 


Homo sapiens hypothetical protein from clone 24796 (LOC57146), 
mRNA 


5E-32 


804 


XM_0 15759 


Homo sapiens hypothetical protein MGC2615 (MGC2615), mRNA 


1E-48 


805 


XM 003799 


Homo sapiens protocadherin beta 10 (PCDHB10), mRNA 


0.71 


806 


AF098697 


Mandarina mandarina from Anijima population 16S ribosomal RNA 
gene, mitochondrial gene encoding mitochondrial product, partial 
sequence 


0.086 


807 


AE007509 


Streptococcus pneumoniae section 192 of 194 of the complete genome 


1.6 


808 


AY035064 


Arabidopsis thaliana unknown protein (T20O10_100/AT3g63000) 
mRNA, complete cds 


0.2 


809 


AE000595 


Helicobacter pylori 26695 section 73 of 134 of the complete genome 


0.046 


810 


Z75093 


S.cerevisiae chromosome XV reading frame ORF YOR185c 


0.11 


811 


M30825 


Drosophila melanogaster ovarian tumor protein isoforms (otu) gene, 
complete cds, alternatively spliced 


0.11 


812 


Y17004 


Suberites domuncula mRNA for monoubiquitin 


0.005 


813 


BC005104 


Homo sapiens, Similar to RUCEN cDNA 5830420C20 gene, clone 
IMAGE:3633379, mRNA, partial cds 


1.7 


814 


XM 029941 


Homo sapiens cadherin-like 22 (CDH22), mRNA 


0.0006 


815 


NM_003246 


Homo sapiens thrombospondin 1 (THBS1), mRNA 


0.23 


816 


U38414 


Cards familiaris junctional sarcoplasmic reticulum protein mRNA, 
complete cds 


0.21 


on 
ol / 


XM_004000 


Homo sapiens chromodomain helicase DNA binding protein 1 
(CHD1), mRNA 


0.63 


818 


AF006062 


Cryptococcus neoformans var. grubii strain H99 urease (URE1) gene, 
complete cds 


1.7 


819 


XM J) 18303 


Homo sapiens hypothetical protein FLJ11565 (FLJ11565), mRNA 


0.07 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


820 


AE001164 


Borrelia burgdorferi (section 50 of 70) of the complete genome 


1.6 


821 


AF022969 


Caenorhabditis elegans cosmid C29G2 


0.2 


822 


AK021462 


Homo sapiens cDNA FLJ1 1400 fis, clone HEMBA1000673 


1.6 


823 


Y10908 


B.cereus cysA and wapA genes 


0.0003 


824 


U36892 


Triticum aestivum clone AC29 microsatellite sequence 


3.4 


825 


AF258615 


Dictyostelium discoideum LagC protein (lagC) gene, complete cds 


0.011 


826 


Z99763 


Flaveria pringlei gdcsH gene 


0.023 


827 


X71543 


B.taurus microsatellite sequence INRA118 


7E-15 


828 


AF349526 


Arabidopsis thaliana putative 60S ribosomal protein (T6C23.18) 
mRNA, complete cds 


1.7 


829 


D87001 


Homo sapiens immunoglobulin lambda gene locus DNA, clone:47H9 
downstream contig 


0.007 


830 


XM_006470 


Homo sapiens radixin (RDX), mRNA 


2.4 


831 


AF191609 


Homo sapiens filamin (FLNB) gene, exon 21 


e-140 


832 


AF064754 


Homo sapiens nuclear inhibitor of protein phosphatase-1 (PPP1R8) 
gene, exon 4 


5.5 


833 


NC_001438 


Bean golden mosaic virus A component DNA, complete sequence 


0.57 


834 


D14567 


Penicillium urticae mitochondrial 1-rRNA (large rRNA) gene and its 
flanking region 


0.23 


835 


AP000301 


Homo sapiens genomic DNA, chromosome 21q22.1, D21S226-AML 
region, clone:D16B8, complete sequence 


0.5 


836 


NM_014147 


Homo sapiens HSPC047 protein (HSPC047), mRNA 


1E-90 


837 


AF296100 


Porcine teschovirus strain DS 562/91 polyprotein gene, complete cds 


1.3 


838 


AK020740 


Mus musculus adult male spinal cord cDNA, RIKEN full-length 
enriched library, clone:A330106M24, full insert sequence 


1.6 


839 


XM_045585 


Homo sapiens hypothetical protein FLJ11085 (KIAA1434), mRNA 


3E-16 


840 


AF115104 


Homo sapiens glutaredoxin (GLRX) gene, exon 1 


1.3 


841 


AF094673 


Mus musculus Sosl gene, intron 


3E-26 


842 


AK025101 


Homo sapiens cDNA: FLJ21448 fis, clone COL04473 


5E-93 


843 


M85276 


Homo sapiens NKG5 gene, complete cds 


0.039 


844 


AK022390 


Homo sapiens cDNA FLJ12328 fis, clone MAMMA 1002 145 


0.018 


845 


AF269767 


Staphylococcus epidermidis strain SRI clone step.l023b06 genomic 
sequence 


0.47 


846 


NMJ)05916 


Homo sapiens minichromosome maintenance deficient (S. cerevisiae) 
7 (MCM7), mRNA 


4.5 


847 


AF209976 


Rattus norvegicus stromal cell-derived factor 1 (SDF-1) mRNA, 
complete cds 


1.3 


848 


NM_0 17078 


Rattus norvegicus Acetylcholine receptor alpha 5 (Chrna5), mRNA 


0.95 


849 


AF269324 


Staphylococcus epidermidis strain SRI clone step.l002al2 genomic 
sequence 


0.16 
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SEQ ID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


850 


AK008669 


Mus musculus adult male stomach cDNA, RIKEN full-length 
enriched library, clone:2210008F06, full insert sequence 


4.3 


851 


XM_044217 


Homo sapiens Ran binding protein 1 1 (LOC51 194), mRNA 


0.44 


852 


XM 011510 


Homo sapiens CDC5 (cell division cycle 5, S. pombe, homolog)-like 
(CDC5L), mRNA 


0.056 


853 


AF068574 


Arabidopsis thaliana ferulate-5-hydroxylase (F5H) gene, complete cds 


0.042 


854 


AE003072 


Drosophila melanogaster genomic scaffold 142000013386025, 
complete sequence 


1.4 


855 


AF 159462 


Escherichia coli EHEC factor for adherence (efal) gene, complete cds 


0.42 


856 


AF 110797 


Rattus norvegicus GABAB1 receptor (GABABR1) gene, exons 10 
through 20 and alternatively spliced products, complete cds 


1.6 


857 


AF3 19172 


Homo sapiens cGMP-specific phosphodiesterase type 5 A (PDE5A) 
gene, partial cds 


0.6 


858 


U90299 


Human chromosome 10 duplicated adrenoleukodystrophy (ALD) 
gene, 3* breakpoint sequence 


0.0000003 


859 


AL050065 


Homo sapiens mRNA; cDNA DKFZp566M043 (from clone 
DKFZp566M043) 


0.006 


860 


U32802 


Haemophilus influenzae Rd section 1 17 of 163 of the complete 
genome 


1.4 


861 


AK001127 


Homo sapiens cDNA FLJ10265 fis, clone HEMBB1001014 


0.17 


862 


AB042995 


Plecoglossus altivelis DNA, microsatellite, clone:Pal-3* 


0.45 


863 


NM_004668 


Homo sapiens maltase-glucoamylase (alpha-glucosidase) (MGAM), 
mRNA 


0.001 


864 


NM_021697 


Rattus norvegicus neuronal potassium channel alpha subunit (Kv8.1), 
mRNA 


0.19 


865 


Z28173 


S.cerevisiae chromosome XI reading frame ORF YKL173w 


0.035 


866 


XM_051783 


Homo sapiens hypothetical protein FLJ11712 (FLJ11712), mRNA 


1.1 


867 


X92498 


M. musculus fkh-6 gene 


0.99 


868 


XM_041339 


Homo sapiens similar to phosphodiesterase 4D interacting protein 
(myomegalin) (H. sapiens) (LOC91892), mRNA 


1.5 


869 


AL590401 


Human DNA sequence from clone RP1 1-466P12 on chromosome 6, 
complete sequence [Homo sapiens] 


0.16 


870 


Z68758 


Human DNA sequence from clone LL22NC03-85E10 on chromosome 
22 Contais GSSs, complete sequence [Homo sapiens] 


0.021 


871 


Z34916 


O.aries DNA for natural resistance associated macrophage protein 


4.6 


o 12. 


NM_009253 


Mus musculus serine protease inhibitor-2 related sequence 1 (Spi2- 
rsl), mRNA 


1.5 


873 


AY024283 


Oryza sativa microsatellite MRG6608 containing (TGTA)X6, 
genomic sequence 


3.8 


874 


Z36108 


S.cerevisiae chromosome II reading frame ORF YBR239c 


1.6 


875 


XM 008791 


Homo sapiens hypothetical protein (LOC51320), mRNA 


0.14 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


876 


X89383 


Rnorvegicus mRNA for SNFl-related kinase 


0.18 


877 


XM 002378 


Homo sapiens uncharacterized hypothalamus protein HT010 (HT010), 
mRNA 


0 


879 


XM_047537 


Homo sapiens DKFZP434C153 protein (DKFZP434C153), mRNA 


0.003 


880 


XM 006700 


Homo sapiens checkpoint with forkhead and ring ringer domains 
(CHFR), mRNA 


4E-27 


881 


NC_002756 


Mertensiella luschani mitochondrion, complete genome 


2.1 


882 


M90915 


Human immunodeficiency virus type 1, viral sample LC0LDA1 1, V3 
region 


0.019 


883 


AB011534 


Rattus norvegicus mRNA for MEGF8, partial cds 


0.19 


885 . 


AF364468 


Simian-Human immunodeficiency virus clone 201/26w 1 envelope 
glycoprotein (env) gene, partial cds 


1.4 


886 


XM 005320 


Homo sapiens cerberus 1 (Xenopus laevis) homolog (cysteine knot 
superfamily) (CER1), mRNA 


0.63 


887 


AC079875 


Homo sapiens clone CTD-2333B13, complete sequence 


0.024 


888 


NM_032394 


Mus musculus myosin Vllb (Myo7b), mRNA 


6.6 


889 


AF209448 


Borrelia hermsii plasmid cp32-like EcoRI-cut fragment #3 


1.8 


890 


AK008242 


Mus musculus adult male small intestine cDNA, RIKEN full-length 
enriched library, clone:2010015A19, full insert sequence 


0.065 


891 


AF305851 


Homo sapiens clone 16 ribosomal RNA intergenic spacer region, 
partial sequence 


2 


892 


XM_047604 


Homo sapiens coproporphyrinogen oxidase (coproporphyria, 
harderoporphyria) (CPO), mRNA 


0.22 


893 


AF211133 


Carsonella ruddii natural-host Ctenarytaina eucalypti tryptophanyl- 
tRNA synthetase (trpS) gene, partial cds; 16S ribosomal RNA, 23S 
ribosomal RNA, and 5S ribosomal RNA genes, complete sequence; 
transaldolase (tal) and alkyl hydroperoxide reductase smal> 


0.2 


894 


AY047513 


Drosophila melanogaster GH01880 full length cDNA 


8.8 


896 


AL158132 


Human DNA sequence from clone RP3-323A24 on chromosome 4. 
Contains GSSs and a putative CpG island, complete sequence [Homo 
sapiens] 


0.0001 


897 


Z21978 


B.napus Myr2.Bnl gene encoding myrosinase, thioglucoside 
glucohydrolase 


1.4 


898 


X95904 


L.esculentum mRNA for 14-3-3 protein, TFT6 


0.81 


899 


Z48930 


C.caldarium mitochondrial genes (lOkb) 


0.27 


900 


XM_038535 


Homo sapiens KIAA1074 protein (KIAA1074), mRNA 


0.4 


901 


AF016898 


Homo sapiens B-ATF gene, complete cds 


0.002 


902 


XMJ)51951 


Homo sapiens similar to PRO0097 protein (H. sapiens) (LOC93544), 
mRNA 


0.003 


903 


X89383 


R. norvegicus mRNA for SNFl-related kinase 


0.23 


904 


AF056336 


Danio rerio bone morphogenetic protein 4 precursor (BMP4) gene, 
complete cds 


0.002 


905 


U34339 


Protopterus aethiopicus 28S ribosomal RNA gene, partial sequence 


1E-09 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


906 


XM_041974 


Homo sapiens hypothetical gene supported by AK025102 
(LOC91974), mRNA 


0.06 


907 


D 14567 


Penicillium urticae mitochondrial 1-rRNA (large rRNA) gene and its 
flanking region 


0.26 


908 


AL035632 


Drosophila melanogaster BAC clone BACN32G1 1 


0.52 


909 


AF220354 


Drosophila melanogaster mitotic kinesin-like motor protein CENP- 
ana (cana) mRNA, complete cds 


0.86 


910 


U67484 


Methanococcus jannaschii section 26 of 150 of the complete genome 


2.6 


911 


Y13853 


Drosophila erecta clone Erl inactive Bari-1 family transposon 


0.17 


912 


AL137643 


Homo sapiens mRNA; cDNA DKFZp586A2324 (from clone 
DKFZp586A2324) 


1.3 


913 


AE006618 


Streptococcus pyogenes Ml GAS strain SF370, section 147 of 167 of 
the complete genome 


1.4 


914 


Z46663 


M.musculus DNA for growth hormone gene and promoter 


0.005 


915 


AF197225 


Emericella nidulans TUPA (tupA) gene, complete cds 


0.65 


916 


L20594 


Human annexin III (ANX3) gene, exon 4 


0.028 


917 


AB040730 


Clostridium haemolyticum genes for 16S rRNA, tRNA-Ala, tRNA-Ile, 
23 S rRNA, partial and complete sequence 


0.31 


918 


L10162 


Human variable number tandem repeat (VNTR) region, allele 13R4 3' 
to collagen type II (COL2A1) gene 


0.19 


919 


L35061 


Bacteriophage phi-41 orfs LI- 12, complete cds's, orf 13, 5* end 


0.75 


920 


AK018618 


Mus musculus adult male cecum cDNA, RIKEN full-length enriched 
library, clone:9130014E20, full insert sequence 


0.72 


921 


AC024865 


Caenorhabditis elegans cosmid Y74C10AL, complete sequence 


0.083 


922 


Z65965 


H. sapiens CpG island DNA genomic Msel fragment, clone 69g7, 
reverse read cpg69g7.rtlb 


0.46 


923 


AF364335 


Influenza A virus (A/Goose/Guangdong/3/97(H5Nl)) segment 6 
neuraminidase (NA) gene, complete cds 


0.46 


924 


M83534 


A.thaliana isocitrate lyase gene, 5* end 


0.003 


925 


AE001390 


Plasmodium falciparum chromosome 2, section 27 of 73 of the 
complete sequence 


0.0001 


926 


AK023864 


Homo sapiens cDNA FLJ13802 fis, clone THYRO1000186 


0 


927 


XM_010575 


Homo sapiens LOC87729 (LOC87729), mRNA 


1.3 


928 


L16771 


Drosophila simulans suppressor of forked (su(f)) gene sequence 


0.17 


931 


BC001815 


Homo sapiens, clone IMAGE:2989261, mRNA 


2E-33 


932 


BC008580 


Homo sapiens, clone IMAGE:4179986, mRNA, partial cds 


1.4 


933 


D32063 


Human gene for 2-oxoglutarate dehydrogenase, exon 14, 15, 16, 17, 
18 and 19 sequence 


0.058 


934 


XM_028551 


Homo sapiens hypothetical gene supported by AK026706 
(LOC90063), mRNA 


0.003 


935 


M87298 


Chicken neuropeptide Y gene, exons 1-3 


2.1 


936 


AE006100 


Pasteurella multocida PM70 section 67 of 204 of the complete genome 


1.5 


937 


L77040 


Homo sapiens (subclone 8_cl 1 from PI H22) DNA sequence 


9E-14 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


938 


AFO 16349 


Homo sapiens hexokinase I (HK1) gene, exon 1R and exon 1, 
alternatively spliced 


2 


939 


BC001393 


Homo sapiens, hypothetical protein, clone MGC:782 
IMAGE:3051397, mRNA, complete cds 


0.054 


940 


AK024092 


Homo sapiens cDNA FLJ14030 fis, clone HEMBA1004086 


0.025 


941 


AJ244013 


Ectromelia virus proviral spi-3 gene for serine proteinase inhibitor, 
strain Hampstead (egg) 


4.3 


942 


AK000184 


Homo sapiens cDNA FLJ20177 fis, clone COL09966, highly similar 
toY08136H 


0 


943 


U67528 


Methanococcus jannaschii section 70 of 150 of the complete genome 


0.22 


944 


AF065599 


Schistosoma mansoni annexin mRNA, complete cds 


0.64 


945 


AK020851 


Mus musculus adult retina cDNA, RIKEN full-length enriched 
library, clone:A930013F10, full insert sequence 


2.7 


946 


AF067228 


Buchnera aphidicola bacterioferritin comigratory protein (bcp) gene, 
partial cds; chorismate synthase (aroC), ATP 
phosphoribosyltransferase (hisG), histidinol dehydrogenase (hisD), 
histidinolphosphate aminotransferase (hisC), 
imidazoleglycerolphosphate> 


0.021 


947 


XM 049755 


Homo sapiens hypothetical gene supported by AL137578 
(LOC93 193), mRNA 


0.19 


948 


M14657 


Bovine thrombomodulin mRNA, 3' end 


0.004 


949 


AF312917 


Plasmodium falciparum reticulocyte binding protein 2 homolog B 
gene, complete cds 


4.6 


950 


U13067 


Drosophila melanogaster Twin of m4 (Tom) gene, complete cds; and 
Brd gene, promoter region and 5'UTR, partial sequence 


0.025 


951 


AL1 13476 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


4.1 


952 


XM 048492 


Homo sapiens talin 1 (TLN1), mRNA 


5E-50 


953 


M19686 


Bovine interstitial retinol binding protein (IRBP) mRNA, 3 1 flank 


0.0003 


954 


M69188 


Cotton legumin A D-genome alloallele gene, complete cds 


0.072 


955 


AK016519 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4932411G06, full insert sequence 


0.65 


956 


X53545 


T. cruzi mRNA for SA85-1.1 protein 


0.023 


957 


U67557 


Methanococcus jannaschii section 99 of 150 of the complete genome 


0.015 


958 


AL136165 


Human DNA sequence from clone RP5-863M1 1 on chromosome 6, 
complete sequence [Homo sapiens] 


7E-12 


959 


AE006697 


Sulfolobus solfataricus section 56 of 272 of the complete genome 


2 


960 


AB056323 


Macaca fascicularis brain cDNA, clone:QflA-10392 


0.56 


961 


XM_015216 


Homo sapiens cytochrome b5 outer mitochondrial membrane 
precursor (CYB5-M), mRNA 


0.25 


962 


U88513 


Flexamia pyrops NADH dehydrogenase 1 gene, mitochondrial gene 
encoding mitochondrial protein, complete cds 


0.025 


963 


NMJ)07869 


Mus musculus DnaJ-like protein 1 (Dnajll), mRNA 


1.5 


964 


AE002349 


Chlamydia muridarum, section 76 of 85 of the complete genome 


0.51 
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SEQID 










NO 


ACCESSN 


DESCRIP 


P VALUE 




965 


AF039714 


Caenorhabditis elegans cosmid T13E5 


1.8 




966 


AF111813 


Plasmodium falciparum 1 -deoxy-D -xylulose 5 -phosphate 
reductoisomerase (dxr) gene, complete cds 


1.3 








Escherichia coli K12 MG1655 section 315 of 400 of the complete 






967 


AE000425 


genome 


0.002 




968 


AF108658 


Homo sapiens p5327 mRNA, complete cds 


0.57 




969 


X82329 


A.hypogaea chi2.1 gene for chitinase (class II) 


3.6 




970 


AK026575 


Homo sapiens cDNA: FLJ22922 fis, clone KAT06722 


1.5 




971 


ABO 16605 


Bombyx mori gene for peptidoglycan recognition protein, complete 
cds 


0.0002 




972 


AB009629 


Avian rotavirus RNA for VP1, complete cds 


1.4 




973 


AK020413 


Mus musculus 12 days embryo embryonic body between diaphragm 
region and neck cDNA, RIKEN full-length enriched library, 
clone:9430013L14, full insert sequence 


0.095 




974 


AF013754 


Gallus gallus transcription factor RREB-1 (rreb-1) mRNA, complete 
cds 


3.4 








Trypanosoma brucei kinetoplast DNA maxicircle fragment encoding 
two very small ribosomal RNAs, a 12S rRNA homologous to E.coli 






975 


V01390 


23S rRNA and a 9S rRNA homologous to 16S rRNA 


0.18 


y. 3 






Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 






976 


AL1 12360 


deprivation 


0.22 


«-« 
3 


977 


AF389904 


Danaus chrysippus orientis 12S ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


0.022 


ff\ 


979 


X96999 


P. vulgaris YprlO gene 


1.5 


8 """3 


980 


AL1 10480 


Caenorhabditis elegans cosmid Y24F12A, complete sequence 


0.16 








Weldenia Candida chloroplast tRNA-Leu intron and trnL-F intergenic 






981 


AJ387746 


spacer 


1.8 








Plasmodium falciparum chromosome 2, section 14 of 73 of the 






982 


AE001377 


complete sequence 


0.002 




983 


U26946 


Gallus gallus inhibin beta A subunit mRNA, complete cds 


2.1 




984 


AC091651 


Homo sapiens clone RP1 1-17D6, complete sequence 


2E-76 




985 


AF143382 


Arabidopsis lyrata pistillata gene, complete cds 


0.02 








E. histolytica plasmid genes for ribosomal RNA and hemolysins 






986 


Z29969 


HLY1, HLY5mcl HLY5mc2 HLY4 


0.02 




987 


XM_001301 


Homo sapiens KIAA0469 gene product (KIAA0469), mRNA 


0.022 




988 


AB007638 


Bacillus subtilis genomic DNA containing gutA to cotA region, 48 
degree 


0.021 




989 


XM_0 12394 


Homo sapiens myosin IXA (MY09A), mRNA 


7E-09 




990 


AF258322 


Zenaida galapagoensis beta fibrinogen gene, intron 7 and partial cds 


5 








Homo sapiens similar to coagulation factor VIII, procoagulant 






991 


XM_048778 


component (hemophilia A) (H. sapiens) (LOC93021), mRNA 


0.73 








Macaca fascicularis brain cDNA clone:QflA-15307, full insert 






992 


AB056815 


sequence 


6.4 
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SEQIE 
NO 
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993 


AJ403948 


Homo sapiens partial SLC22A3 eene for omanir ratinn trancnnrt^r i 
exons 3-4 


0.008 


994 


AF069188 


Ephedrus laevicollis NADH dehvdroeena«;e 1 m»np mitnrhnTiHriai 
gene encoding mitochondrial protein, partial cds 


0.082 


995 


AF282920 


Staphylococcus aureus auinone oxidoredncta<;e and alvrinf* H^tQin^ 
transporter genes, complete cds 


6.6 


996 


AB031325 


Homo sapiens gene for calcium -sensine recentor exonQ nrnmnter 
region 


0.081 


997 


AE001421 


Plasmodium falciparum chromosome 2, section 58 of 73 of the 
complete sequence 


0 74 


998 


AF323615 


Rattus norvegicus phosphoinositide-sneciflc ohosnholinaQp C pn<:ilnn 
mRNA, complete cds 


9 'K 


- 

999 


AF305103 


Picea abies clone 1PABC5D3 hypermethylated genomic sequence 


0.25 


1000 


AY022117 


Oryza sativa microsatellite MRG4442 containing fTClX 17 clo«ie<tf tn 
marker RG450, genomic sequence 


0.7 


1001 


NM 023324 


Mus musculus pellino 1 (Pelil), mRNA 




1002 


X06329 


Mouse repetitive sequence LIMd-F type 5 'end (1G-6) 


0.73 


1003 


AE001173 


Borrelia burgdorferi (section 59 of 70) of the complete genome 


0 94 


1004 


AK021535 


Homo sapiens cDNA FLJ1 1473 fis, clone HEMBA1001712 


0 9S 


1005 


U90529 


Plasmodium vivax SV-5 Pv200 hieh-bindine reeion I eene nartial cdc 


0 94 


1006 


XM_003560 


Homo sapiens MAD2 (mitotic arrest deficient, yeast, homolog)-like 1 
(MAD2L1), mRNA 


4F-96 


1007 


AF315106 


Homo sapiens isolate HERV-H-PA9 lone terminal rmeat narfial 

sequence 


0.000001 


1008 


U25177 


Helobdella robusta cyclin A mRNA, partial cds 


0.025 


1009 


NM_021401 


Mus musculus secreted and transmembrane 1 (Sectml), mRNA 


0.084 


1010 


AF339813 


Homo sapiens clone IMAGE: 2 97403, mRNA sequence 




1011 


AF020599 


Equus caballus insulin-like growth factor 2 (IGF2) gene, partial cds 


0.38 


1012 


BC001815 


Homo sapiens, clone IMAGE:2989261, mRNA 


7F-4^ 


1013 


AJ297945 


Frankia sp. ArI3 cryptic plasmid pFQ3 1 ORF AF, korSAF gene, ORF 
CrF, ORF DrF, renF eene ORF FrF ORF GF ORF HF ORF TF 
ORF KrF, ORF LrF, ORF MF, ORF NrF, ORF OF, ORF PrF, parAF 
gene and ORF SrF 


0.64 


1014 


Z71327 


S.cerevisiae chromosome XIV reading frame ORF YNL051w 


0.61 


1015 


XM 031156 


Homo sapiens hypothetical gene supported by AB007970 
(LOC90360), mRNA 


0.003 


1016 


XM 007274 


Homo sapiens hypothetical protein FLJ10242 (FLJ10242), mRNA 


0.029 


1017 


AE001495 


Helicobacter pylori, strain J99 section 56 of 132 of the complete 
genome 


0.076 


1018 


AK024606 ] 


Homo sapiens cDNA: FLJ20953 fis, clone ADSE01979 


e-109 


1019 


] 

AF049434 


Melaleuca lateritia 5S ribosomal RNA gene, partial sequence; and 5S 
ribosomal RNA intergenic spacer, complete sequence 


1.9 
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NO 
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P VAT TTF 


1020 


AF078543 


Pestivirus type 4 strain MN fetus polymerase (NS5b) gene, partial cds 


u.uo 


1021 


M63636 


Streptococcus thermophilus beta-D-galactosidase (lacZ) gene, 
complete cds 


\J.\J 


1022 


AJ270226 


Entodinium caudatum partial mRNA for putative glycosyltransferase, 
clone L48 


0.67 


1023 


M59429 


T. thermophila F-antigen (tfa) gene, complete cds 


0.009 


1024 


X77801 


S.beecheyi genomic DNA with direct repeats and integrated Hepatitis 
B virus 


1.9 


1025 


AL590109 


Human DNA sequence from clone RP1 1-405J10 on chromosome 10, 
complete sequence [Homo sapiens] 


6E-84 


1026- 


AF269904 


Staphylococcus epidermidis strain SRI clone step.l029hl0 genomic 
sequence 


0.076 


1027 


AL591343 


Human DNA sequence from clone RP1 1-260H5 on chromosome 6 
complete sequence [Homo sapiens] 


0.22 


1028 


212986 


O.aries rearranged T-cell receptor delta 


0.78 


1029 


AY042836 


Arabidopsis thaliana Unknown protein mRNA, complete cds 


0.25 


1030 


X00117 


Galago Alu repeat type I, GAL9 


0.0001 


1031 


XM 015858 


Homo sapiens Werner syndrome (WRN), mRNA 


6.6 


1032 


M27444 


Bos taurus (clone pTXD7) dopamine and cyclic AMP-regulated 
neuronal phosphoprotein (DARPP-32) mRNA, complete cds 


0.63 


1033 


AF272852 


Arabidopsis thaliana beta 1,2-xylosyltransferase mRNA, complete cds 


0.77 


1034 


L08174 


Romanomermis culicivorax mitochondrial NADH dehydrogenase 
subunits 6 and 3 (ND6, ND3) cytochrome P450-like protein genes, 
complete CDS 


0.084 


1035 


AF387598 


Nicotiana tabacum EPSP synthase 1 (EPSPS1) gene, partial cds 


0.007 


1036 


AF157094 


Sarcophilus harrisii mitochondrial DNA control region, partial 
sequence 


0.23 


1037 


XM_046761 


Homo sapiens hypothetical gene supported by AK000735 
(LOC92705), mRNA 


0.66 


1038 


AF348479 


Bos taurus myostatin gene, partial cds 


6.1 


1039 


XM_050269 


Homo sapiens solute carrier family 1 (glial high affinity glutamate 
transporter), member 3 (SLC1A3), mRNA 


0.21 


1040 


AF164967 


Canine distemper virus strain A75/17, complete genome 


7.1 


1041 


AB028634 


Flammulina velutipes mitochondrial gene for RNA polymerase, DNA 
polymerase, complete and partial cds 


0.086 


1042 


AF3 12925 


Homo sapiens cryptic (CFC1) gene, complete cds 


0.000004 


1043 


U67596 


Methanococcus jannaschii section 138 of 150 of the complete genome 


0.26 


1044 


L04631 


Human cholesterol 7 alpha-hyroxylase (CYP7) gene, exon 3 


0.009 


1045 


U50336 


Mus musculus Ah receptor gene, promoter region and exon 1, partial 
sequence 


2.1 


1046 


AF105001 


Mus musculus Fab 1098 immunoglobulin light chain variable region 
mRNA, partial cds 


0.8 
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NO 
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P VAT TTF 


1047 


U55276 


Rattus norvegicus microtubule-associated protein IB gene, promoter 
region 


n 70 
u. /y 


1048 


AF056302 


Drosophila melanogaster eIF-2alpha kinase (GCN2) mRNA, complete 
cds 


U.OJ 


1049 


NC_001322 


Drosophila yakuba mitochondrion, complete genome 


0.24 


1050 


AF097036 


Oreina coenil&i cvtrvliromp fwiHace* ciiHunit T <jf*nF> nar+tai pHc* 

viviiia vuviuis^a vjr iv/wiiiuuib UAiUflDC OUUU111L 1 gCHC, pdillal COS, 

mitochondrial gene for mitochondrial product 


U.ZJ 


1051 


L12148 


Pasteurella haernolvtir^i-likp on nlllrtP nlllrtR r»l llrfF> anH l^nirrktnvi« 
structural protein (pllktA) genes, complete cds 


0.21 


1052 


AB005153 


MilS musculus DNA for DNA denpnripnt nrntpin Vinncp min\\it\r 

subunit, exon 5, 6, 7 


0.006 


1053- 


AF126733 


Brassica oleracea var. italica floral homeotic protein pseudogene, 
partial sequence 


1 1 


1054 


AF090900 


Homo sapiens clone HQ0189 PRO0189 mRNA, complete cds 


0.000001 


1055 


AJ223260 


Chromodoris purpurea mitochondrial COI gene, partial 


0.007 


1056 


AF089052 


Pseudoleistes vire^cen*; cvtochromp h fcvfh^ optip iriitnr'lirrnHrial apnp 
encoding mitochondrial protein, partial cds 


0 77 


1057 


AL 138748 


Human DNA ^ennence frnm Hnnp PP4.-717"F1Q nn rhrnmncnmp 

AUliliCUl. J—' -L ^ •£>. OvV^UWllvV i.1 Will ^lUltb l\T*r / JZi1j17 \Jll Will UillUoUlliC 

Xq21. 1-21.33 Contains GSSs, complete sequence [Homo sapiens] 


0.59 


1058 


AL 132767 


Human DNA sequence from clone RP5-819L10 on chromosome 6ql2- 
13. Contains GSSs, complete sequence [Homo sapiens] 


5 


1059 


AF032670 


Lucilia cuprina Notch homolog (Scl) gene, exon B 


0.055 


1060 


AE001371 


Plasmodium falciparum chromosome 2, section 8 of 73 of the 
complete sequence 


0.062 


1061 


AY039845 


Arabidopsis thaliana At 1 g3 5 1 60/T3 2 G9 30 mRNA, complete cds 


4.6 


1062 


AF365189 


Sindoropsis le-testui tRNA-Leu (trnL) gene, partial intron sequence; 
chloroplast gene for chloroplast product 


0.11 


1063 


U60591 


Drosoohila melanocaster kuzbanian flcuz 1 * mRNA cnmnletp cd<; 


0.41 


1064 


AJ297052 


Homo sapiens SDS-stable vimentin-bound DNA fragment 
HEF42VIM12 


4E-09 


1065 


AB023191 


Homo sapiens mRNA for KIAA0974 protein, partial cds 




1066 


AK021989 


Homo sapiens cDNA FLJ1 1927 As, clone HEMBB 1000402 


0.33 


1067 


XM_038524 


Homo sapiens hypothetical protein FLJ23059 (FLJ23059), mRNA 


e-169 


1068 


XM_045967 


Homo sapiens estrogen receptor 1 (ESR1), mRNA 


4.9 


1069 


AF354258 


Homo sapiens polynucleotide kinase-3 '-phosphatase (PNKP) gene, 
complete cds 


0.66 


1070 


AJ404320 


Posidonia oceanica medium repetitive sequence, clone pPopm6 


0.038 


1071 


NC_002756 


Mertensiella luschani mitochondrion, complete genome 


4.9 


1072 


Z68886 


Human DNA sequence from cosmid L21F12, Huntington's Disease 
Region, chromosome 4pl6.3 


2E-10 


1073 


AE001370 


Plasmodium falciparum chromosome 2, section 7 of 73 of the 
complete sequence 


0.025 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VAT TTF 


1074 


XM_033521 


Homo sapiens hypothetical protein FLJ 10357 (FLJ10357), mRNA 


1 S 


1075 


U02239 


Mycoplasma genitalium random genomic clone xd5, partial cds 


0.006 


1076 


AE006395 


Lactococcus lactis subso lactis IL1403 section 1 S7 of 218 nf th** 
complete genome 


0.32 


1077 


AY023826 


Oryza sativa microsatellite MRG6151 containing (ATAC)X8, 
genomic sequence 


0.031 


1079 


X63213 


B.taurus CI-B12 mRNA for ubiquinone oxidoreductase complex 


7 


1080 


AK021648 


Homo sapiens cDNA FLJ11586 fis, clone HEMBA1003720 


3E-13 


1081 


AF246689 


Dictyostelium discoideum ABC transporter mdrAl (MDRA1) and 
ABC transporter mdrA2 (MDRA2) genes, complete cds 


0.67 


1082- 


AB016100 


Oryza sativa LINE retrotransposon, endonuclease region of RILN5 


0.068 


1083 


U93157 


Orestias species mitochondrial control region, allele SRT1, partial 
sequence 


6.9 


1084 


AF173373 


Caenorhabditis eleeans transmembrane nrotein NDfi-4 fnnV_4A 
mRNA, complete cds 


1.8 


1086 


AJ3 10640 


Turkey coronavirus genomic RNA for partial spike glycoprotein (s 
ffene^ 3a orotein ff?ene 3^ 3h nrotein fpent* 31 T* nrntHn f<r*»np ^1 anH 
partial M protein (gene 4), strain turkey/UK/4 12/00 


2 


1087 


M21953 


Duck hepatitis B virus, complete genome 


0.086 


1088 


AF332583 


Homo saoiens stratum comenm chvmotrvntir pn7vmp (TCI TC*7} pptip 
complete cds, alternative transcripts 


2.1 


1089 


AY037894 


Glomus intraradices phosphate transporter mRNA, partial cds 


0.084 - 


1091 


NM_020078 


Rattus norvegicus a disintegrin and metalloproteinase domain 1 
(fertilin alpha) (Adaml), mRNA 


0.51 


1092 


AF282531 


Homo sapiens clone 18qtel c30t7 sequence 


0.003 


1093 


AF324424 


Ichthyophthirius multifiliis immobilization antigen isoform (IAG52 A) 
gene, complete cds 


0.56 


1094 


AJ404320 


Posidonia oceanica medium repetitive sequence, clone pPopm6 


0.075 


1096 


Z46803 


V.narbonensis mRNA for legumin A precursor 


2.2 


1097 


XM_050195 


Homo sapiens similar to hypothetical protein FLJ14751 (H. sapiens) 
(LOC93274), mRNA 


5.7 


1098 


AJ246004 


Homo sapiens RET gene, 5 1 genomic fragment 


0.00004 


1099 


AY031051 


HIV-1 isolate NC4057-1999 from USA pol polyprotein (pol) gene, 
partial cds 


0.23 


1100 


AE002146 


Ureaplasma urealyticum section 47 of 59 of the complete genome 


0.26 


1101 


Z23650 


H. sapiens (D1S436) DNA segment containing (CA) repeat; clone 
AFM217zc3; single read 


0.001 


1102 


D73382 


Sugarbeet mitochondrial gene for ORFB, complete cds 


0.73 


1103 


U67503 


Methanococcus jannaschii section 45 of 150 of the complete genome 


0.69 


1104 


AL590376 


Human DNA sequence from clone RP1 1-22P2 on chromosome Xq23- 
24, complete sequence [Homo sapiens] 


0 


1106 


D 14636 


Mouse mRNA for PEBP2al protein, complete cds 


0.61 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQIE 
NO 


) 

ACCESSN 


DESCRIP 


P VALUE 


1107 


Y14622 


Mus musculus Tnfr2 gene, exon 4 to exon 8 


U. /I 


1108 


AF1 30049 


Homo sapiens clone FLB341 1 PRO0852 mRNA comnlete cds 


U.UtO 


1109 


AJ011009 


Glycine max cytosolic glutamine synthetase fesl5^ eene nromntpr 


U.UU / 


1110 


XM 043523 


Homo sapiens Rho guanine exchange factor fGFF^ 1 1 fAPW^FFi i 1 * 
mRNA 


U. / 


1111 


Z77974 


H. sapiens flow-sorted chromosome 6 Hindlll fragment SC6nA^nn 




1112 


AJ002513 


Schizosaccharomyces pombe cmbl gene 




1113 


AF098878 


Homo sapiens APAF1 gene, exons 6 and 7 


1.9 


1114 


M14080 


Herpesvirus saimiri thymidylate synthase gene complete cds 


ft ft?** 


1115 


U44838 


Glycine max extensin (SbHRGP3) gene, complete cds 


0.76 


1116. 


Y15491 


Pongo pygmaeus gene encoding fertilin alpha 


9 1 


1117 


U43145 


Plasmodium chabaudi repeat organellar protein gene complete cds 




1118 


Z81524 


Caenorhabditis elegans cosmid F32H5, complete sequence 




1119 


AF052957 


Homo sapiens type XV collagen (COL15A1) gene, exons 3 and 4 


2.2 


1120 


U67523 


Methanococcus jannaschii section 65 of 150 of the complete genome 


0.001 


1121 


AL121807 


S.pombe chromosome III cosmid cl32 


0.19 


1122 


X59280 


S.douglasii gene for cytochrome b 




1123 


XM_030915 


Homo sapiens KIAA0673 protein (KIAA0673), mRNA 


SF 1Q 


1124 


X99254 


P.falciparum gene encoding primase, small subunit 


0.084 


1125 


AC004214 


Homo sapiens clone UWGC:m32agap from 6p21, complete sequence 


0.009 


1126 


AF061244 


Agrocybe aegerita B type DNA polymerase (Mtpol) gene, complete 
cds; tRNA-Asn gene, comnlete seauence* and linknnwn ^npc 
mitochondrial genes for mitochondrial products 


0.001 


1127 


AK001125 


Homo sapiens cDNA FLJ10263 fis, clone HEMBB1000991 


n 


1128 


L41046 


Pisum sativum endo-l,4-beta-glucanase (EGL1) eene comnlpt** rHc 


n nnni 

U.UUUi 


1129 


XM 008099 


Homo sapiens integrin, alpha L (antigen CD1 1A fnlfcrn lvmnhnrvt** 
function-associated antigen 1; alpha polypeptide) (ITGAL) mRNA 


ZrJJr— 1 O 


1130 


AE000049 


Mycoplasma pneumoniae M129 section 31 of 63 of the complete 
genome 


0.75 


1131 


( 

AY019627 i 


Dryza sativa microsatellite MRG1952 containing (AT)X40, closest to 
marker L128, genomic sequence 


0.024 


1132 


Z46383 ] 


Bovine herpesvirus type 4 DNA for nonconserved region A 


4.3 


1133 


Y07893 1 


5.cerevisiae DNA fragment, right arm of chromosome VII 


4.2 


1134 


XM 040867 ] 


flomo sapiens sperm acrosome associated 1 (SPACA1), mRNA 


0.003 


1135 


] 

XM 038693 i 


lomo sapiens solute carrier family 9 (sodium/hydrogen exchanger), 
soform 2 (SLC9A2), mRNA 


2 


1136 


1 

NM 033074 c 


vlus musculus DNA segment, Chr 15, Wayne State University 59, 
;xpressed (D 1 5 Wsu59e), mRNA 


0.64 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 



DESCRIP 



1137 
1138 



M97168 



D13973 



|Homo sapiens X (inactive)-speciflc transcript (XIST) complete exon 



Dictyostelium discoideum DNA for Dp87 protein, complete cds 



1139 I AF303254 jCaenorhabditis elegans gene XL 153 mRNA, complete cds 



1140 
1141 



1144 
1145 



1146 
1147 



1152 
1153 



AB009926 



1161 
1162 



ICyphonocerus ruficollis mitochondrial DNA for 16S rRNA, partial 
I sequence 



NC_002660 jCochliomyia hominivorax mitochondrion, complete genome 



1142 AL096756 



Human DNA sequence from clone RP3-477J10 on chromosome 
22ql3.3 1-13.33 Contains a GSS, complete sequence [Homo sapiensl 



AE003025 
Z11702 



Drosophila melanogaster genomic scaffold 142000013385389, 
| complete sequence 



XM_049924 
XM 047707 



V.faba mRNA for polyphenol oxidase 



Homo sapiens calcium binding protein 5 (CABP5), mRNA 



Homo sapiens KIAA1265 protein (KIAA1265), mRNA 



1148 AY005256 



Xylocopa erythrina cytochrome b (cytb) gene, partial cds; 
mitochondrial gene for mitochondrial product 



AJ001677 
J03252 



1154 AK007179 



Mus musculus adult male testis cDNA, RKEN full-length enriched 
library, clone: 17001 12J16, full insert sequence 



1155 AK004093 



Mus musculus 18 days embryo cDNA, RKEN full-length enriched 
library, clone: 1110034K09, full insert sequence 



1156 AF370142 



Arabidopsis thaliana unknown protein (T5I7.1/At2g39710) mRNA, 
[complete cds 



1157 AF053649 



Homo sapiens cellular apoptosis susceptibility protein (CSE1) gene, 
exons 15 and 16 



1158 AL080199 



Homo sapiens mRNA; cDNA DKFZp434E082 (from clone 
|DKFZp434E082) 



1159 AF267225 



Candidates Carsonella ruddii natural-host Tainarys sordida ATP 
synthase alpha subunit (atpA) gene, partial cds; ATP synthase gamma | 
subunit (atpG) gene, complete cds; and ATP synthase beta subunit 
|(atpD) gene, partial cds 



Mus musculus adult male testis cDNA, RKEN full-length enriched 
AK007179 library, clone: 17001 12J16, full insert sequence 



1163 
1164 



NM_008787 [Mus musculus pericentrin (Pent), mRNA" 



AK024592 [Homo sapiens cDNA: FLJ20939 fis, clone ADSE01583~ 



D 1 3 1 97 | Mouse gene for immunoglobulin diversity region D 1 



P VALUE 



3.4 



0.075 



0.24 



0.0009 



0.26 



1143 XMJ)31937 Homo sapiens hypothetical protein FL Jill 83 (FLJ1 11 83), mRNA 2.2 



0.22 



0.76 



5.6 



1.6 



0.086 



Rsapiens telomeric DNA sequence, clone 7PTEL026 read 
1151 | Z96679 |7PTELOO026.seq 



9E-34 



Branchiostoma floridae mRNA for phenylalanine hydroxylase, partial | 0.067 
[Human alkaline phosphatase (ALPP) gene, complete cds ~ 065~ 



0.24 



6.7 



1.7 



0.006 



0.081 



0.15 



1160 AE000654 Helicobacter pylori 26695 section 132 of 134 of the complete genome 0.51 



0.25 



1.7 



0.077 



0.076 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 



SEQID| 
NO 



ACCESSN 



DESCRIP 



P VALUE 



1165 AE001381 



Plasmodium falciparum chromosome 2, section 18 of 73 of the 
complete sequence 



0.073 



1166 



X68360 



M.fascicularis gene for apolipoprotein A-II 



0.0009 



1167 



X73032 



B.juncea ribosomal intergenic spacer 



0.023 



1168 XM 037745 



Homo sapiens hypothetical gene supported by AL137266 
(LOC91332), mRNA 



0.21 



1169 NM 017755 



Homo sapiens hypothetical protein FLJ20303 (FLJ20303), mRNA 



0.58 



1170 XM 045819 



Homo sapiens cofilin 2 (muscle) (CFL2), mRNA 



1171 AE006349 



Lactococcus Iactis subsp. lactis IL1403 section 111 of 218 of the 
complete genome 



0.12 



1172- XM 030637 



Homo sapiens G protein-coupled receptor kinase 7 (GPRK7), mRNA 



1.6 



1173 AK017298 



Mus musculus 6 days neonate head cDNA, RIKEN full-length 
enriched library, clone:5430411K23, full insert sequence 



0.18 



1174 AF154915 



Homo sapiens transcription factor HOXD12 (HOXD12) and 
transcription factor HOXD11 (HOXD1 1) genes, complete cds 



0.52 



1175 AF143066 



Lepidopilum scabrisetum ribosomal protein system 4 (rps4) gene, 
partial cds 



0.51 



1176 AL391814 



Human DNA sequence from clone RP1 1-49A4 on chromosome 6, 
complete sequence [Homo sapiens] 



0.06 



1177 AF003511 



Toxoneuron sp. 16S ribosomal RNA gene, partial sequence 



0.0000001 



1178 AK022350 



Homo sapiens cDNA FLJ12288 fis, clone MAMMA1001783 



1180 AF307986 



Mus musculus small inducible cytokine A21b (Scya21b) gene, 
complete cds 



0.00002 



0.021 



1181 NM 022242 



Rattus norvegicus niban protein (Niban), mRNA 



1182 



Z57380 



H.sapiens CpG island DNA genomic Msel fragment, clone 173c2, 
forward read cpg!73c2.ftla 



0.13 



1183 XM 043937 



Homo sapiens hypothetical protein MGC13033 (MGC13033), mRNA 



6.4 



1184 



L15439 



Geukensia demissa neuropeptide mRNA, complete cds 



0.24 



1185 



AJ251399 



Proechimys cayennensis mitochondrial cytb gene for cytochrome b 
(specimen voucher V-815) 



0.7 



1186 



U00053 



Caenorhabditis elegans cosmid K03F8 



0.62 



1187 



J00277 



Human (genomic clones lambda-[SK2-T2, HS578T]; cDNA clones 
RS-[3,4, 6]) c-Ha-rasl proto-oncogene, complete coding sequence 



7E-55 



1189 



X70677 



X.laevis XFG 5-1 mRNA for zinc finger protein 



0.23 



1190 



AJ223323 



Saccharomyces cerevisiae mitochondrial tRNA-Tyr, tRNA-Asn, & 
tRNA-Met genes 



0.083 



1191 XM 030479 



Homo sapiens KIAA1462 protein (KIAA1462), mRNA 



0.76 



1192 AK016425 



Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4931403E03, full insert sequence 



0.00003 



1193 



AF092854 



Saguinus mystax visual pigment gene, P562 allele, intron 4 



0.0001 



1194 AC025732 



Homo sapiens BAC clone RP1 1-1 16J19 from Y, complete sequence 



1.7 
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JSEQID 




NO 


ACCESSN 


1 


AFOfl') 1 11 


1196 


M94732 


1197 


X76311 


1 H98 


i-fOl / JO 


1199 


Z82625 


1200 


XMJ)12528 


1201 


M27082 


1202 


AK007856 


1203 


M69188 


1204 


AB043132 


1205 AF019084 


1206 


U32784 



Table 3A Nearest Neighbor (BlastN vs. Genbank^ 

DESCRIP 

Chlamydophila pneumoniae AR39, section 13 of 94 of the comnlete 
I genome y 

Plasmodium falciparum 15 kDa vesicular-like antigen gene exons 1 



H.sapiens endot helial nitric oxidase synthase gene, exons 15 *nrt i* . 
Homo sapiens (subclone l_g5 from PI H16) DNA sequence, complete 

[sequence 1 

R.prowazekii genomic DNA fragment (clone A396F)" 
iHomo sapiens protein kinase C, beta 1 (PRKCB1), mRNA ' 
IS.cerevisiae GCN2 gene (which couples GCN4 expression to amino 

acid availability), complete cds 

|Mus musculus 10 day old male pancreas cDNA, RIKEN full-length 
enriched library, clone: 18 10054D07, full insert sequence 
Cotton legumi n A D-genome alloallele gene, complete cds 
Thunnus thyn nus DNA, microsatellite Ttho-6*, partial sequ ence 
Homo sapiens keratin 2e (KRT2E) gene, corn^tertT 



0.061 
0.76 
2.1 

0.21 

6E-73 
0.076 
0.001 



1207 
1208 



Haemophilus inf luenzae Rd section 99 of 163 of the complete geno me I 

™ M „ | HOm ° SapienS ubi( l uitin -conjugating enzyme E2H (homologous to 
XM 004699 yeast UBC8) (UBE2H). mRNA 



6 

0.026 



1209 



AF375643 Pongo pygmaeus HERV-K-GC1 preintegration site sequence 

Am717Q r us musculus adult male testis cDNA, RIKEN full-length enriched 
AK007179 library, clone: 17001 12J16, full insert sequence 



1210 AF182214 



Glycine max glutamine synthetase gamma 2 subunit gene, 5'-flanking 
region and partial cds 1 



2.2 
0.22 



ion I a it, ™ A »oplectus meridensis ribulose 1,5-bisphosphate carboxylase large 
1211 AF170224 subunit gene, partial cds; chloroplast gene for chloroplast 



: product 



1212INM 022978 ^ ^ 1B Homeric) (SERF1B), 



1213 
1214 



Gromphadorhina portentosa mitochondrion 16S ribosomal RNA 
U17787 partial sequence 

AF144086 Papio hamadryas anubis chromosome 13 repeat region 



6.5 
0.23 



0.027 
0.25 



1215 XM_0101Q6 Homo s apiens DKFZP586N0819 protein (DKFZP58 6NQR1Q^ mRNA 

1216 AF198964 I Arabidopsis thaliana ATP sulfurylase gene, complete cds ! 



Xen ° PUS laeviS mRNA for ™&d helix transcription factor (foxd3b 
AJ298866 |gene) 



0.73 
0.072 



1217 



1218 
1219 



Uroleucon obscurum large subunit ribosomal RNA gene, partial 
sequence; tRNA-Val gene, complete sequence; and small subunit 

AF069 111 ribosomal RNA gene, partial sequence 

X16715 ID. melanogaster gene for chorion protein s!6 



0.2 



0.003 



0.23 



1220 



AF245117 |Mus musculus leukocyte cell-surface molecule (Ly9) gene, exon 1 



0.024 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1221 


AP001310 


Arabidopsis thaliana genomic DNA, chromosome 3, PI clone:MU022 


0.23 


1222 


AF092375 


Rivulus gransabanae tRNA-Ile, tRNA-Gln, and tRNA-Met genes, 
comolete seouence* NADH dehvdrof*f i n5i<»p cnHnnit TT <rf»n*» mmniptp 
cds: tRNA-Tnr tRNA-Ala tRNA-Asn tRNA-Cvs and tRNA-Tvr 
genes, complete sequence; and cytochrome c oxidase subunit I gene, 
par> 


0.26 


1223 


U84864 


HIV-1 strain M14 from USA, envelope glycoprotein (env) gene, 
partial cds 


0.76 


1224 


XM_051684 


Homo sapiens similar to estrogen-related receptor alpha; estrogen 
receptor-like 1 (H. sapiens) (LOC93491), mRNA 


0.57 


1225 


AF282391 


Cucumis sativus clone B99 mitochondrial genomic sequence 


2.2 


1226. 


AK026459 


Homo sapiens cDNA: FLJ22806 fis, clone KAIA2845 


0.024 


1227 


AJ003131 


Spodoptera littoralis nuclear polyhedrosis virus fgf, egt genes and 3 
ORFs 


0.022 


1228 


XM 028308 


Homo sapiens helicase-moi (KIAA0928), mRNA 


1.8 


1229 


AF219168 


Fritillaria camtschatcensis clone cam36 retrotransposon Tfcl, partial 
sequence 


2 


1230 


AF325699 


Homo sapiens ataxia-telangiectasia and Rad3 related protein (ATR) 
gene, partial cds, alternatively spliced 


0.069 


1231 


AE007433 


Streptococcus pneumoniae section 116 of 194 of the complete genome 


7.9 


1232 


XM_007429 


Homo sapiens CGI-35 protein (LOC51077), mRNA 


1E-08 


1233 


AF021084 


Dianous nitidulus cytochrome b (cytb) gene, mitochondrial gene 
encoding mitochondrial protein, partial cds 


0.07 


1234 


Z97349 


Plasmodium falciparum DNA *** SEQUENCING IN PROGRESS 
*** from contig 3-06, complete sequence 


2.1 


1235 


XM 038588 


Homo sapiens like-glycosyltransferase (LARGE), mRNA 


5.5 


1236 


X95276 


P.falciparum complete gene map of plastid-like DNA (IR-B) 


0.008 


1237 


U88570 


Drosophila melanogaster CREB-binding protein homolog mRNA, 
complete cds 


1.7 


1238 


AK024332 


Homo sapiens cDNA FLJ14270 fis, clone PLACE1004491 


0.68 


1239 


AC006678 


Caenorhabditis elegans cosmid R05G9, complete sequence 


1.8 


1240 


AF250958 


Lycopersicon esculentum peptide deformylase-like protein mRNA, 
complete cds 


0.076 


1241 


M26650 


Human islet amyloid polypeptide (hIAPP) gene, complete cds 


1.9 


1242 


XM_037062 


Homo sapiens DKFZP564G202 protein (DKFZP564G202), mRNA 


0.61 


1243 


AE003025 


Drosophila melanogaster genomic scaffold 142000013385389, 
complete sequence 


0.027 


1244 


U39701 


Mycoplasma genitalium section 23 of 51 of the complete genome 


2.1 


1245 


AE000036 


Mycoplasma pneumoniae M129 section 44 of 63 of the complete 
genome 


8.2 


1246 


AE000830 


Methanobacterium thermoautotrophicum from bases 394245 to 
404874 (section 36 of 148) of the complete genome 


0.67 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VAI I IE 


1247 


AL358951 


Human DNA sequence from clone RP3-456L16 on chromosome 6 
complete sequence [Homo sapiens] 




1248 


XM 045757 


Homo sapiens chloride channel 4 (CLCN4), mRNA 


0.56 


1249 


U59683 


Nicotiana tabacum squalene synthase (TSS) gene, complete cds 




1250 


AKO 10434 


Mus musculus ES cells cDNA RIKEN fill 1-1 en Pth enriched lihrarv 
clone:24 10006021, full insert sequence 


4.9 


1251 


XM 045757 


Homo sapiens chloride channel 4 (CLCN4), mRNA 


0.61 


1252 


AE000717 


Aquifex aeolicus section 49 of 109 of the complete genome 


0.024 


1253 


AC024780 


Caenorhabditis elegans cosmid Y43B1 1 AR, complete sequence 


0.027 


1254 


AF343030 


Human herpesvirus 7 isolate CR18 glycoprotein H gene, complete cds 


0.009 


1255 


AK022153 


Homo sapiens cDNA FLJ12091 fis, clone HEMBB 1002582 


0.0000001 


1256 


AF309947 


Dictyostelium discoideum RaclA (racl A) gene, complete cds; and 
unknown gene 


0.009 


1257 


AB051519 


Homo sapiens mRNA for KIAA1732 protein, partial cds 


0.25 


1258 


AC006681 


Caenorhabditis elegans clone R13H9, complete sequence 


2.2 


1260 


Y11648 


Campylobacter jejuni waaC, galE, wla[B,C,D,E,F,G,H,I,K,L,M], 
cheY genes and orfl 


0.25 


1261 


AB009592 


Oryza sativa gene for cytosolic glutathione reductase, complete cds 


2.1 


1262 


X83997 


C.parasitica eapC gene 


0.22 


1263 


AJ404320 


Posidonia oceanica medium repetitive sequence, clone pPopm6 


0.034 


1264 


AF038620 


Caenorhabditis elegans cosmid H10D12 


0.063 


1265 


U05822 


Human proto-oncogene BCL3 gene, exon 2 


0.0000001 


1266 


Z79343 


H. sapiens flow-sorted chromosome 6 TaqI fragment, SC6pA4G2 


0.13 


1267 


S60289 


LeB4=legumin {5' region} [Vicia faba, Genomic, 1222 nt] 


0.0003 


1268 


AE006303 


Lactococcus lactis subsp. lactis IL1403 section 65 of 218 of the 
complete genome 


1.9 


1269 


AE001410 


Plasmodium falciparum chromosome 2, section 47 of 73 of the 
complete sequence 


0.008 


1270 


X96972 


D.buzzatii copia element DNA 


0.2 


1271 


AB042973 


Nicotiana tabacum wizz gene, 5' flanking region, partial cds 


0.073 


1272 


Z95309 


Caenorhabditis elegans cosmid H36L18, complete sequence 


3.5 


1273 


AF091745 


Homo sapiens exonuclease I (EXOI) gene, exon 5 


0.2 


1274 


AF386945 


Arabidopsis thaliana Rubisco subunit binding-protein beta subunit 
(T5A14.11) mRNA, complete cds 


0.7 


1275 


AF269376 


Staphylococcus epidermidis strain SRI clone step. 1002h01 genomic 
sequence 


0.076 


1276 


AK012864 


Mus musculus 10, 1 1 days embryo cDNA, RIKEN full-length 
enriched library, clone:2810036I15, full insert sequence 


0.028 


1277 


Z74204 


S.cerevisiae chromosome IV reading frame ORF YDL156w 


0.18 


1278 


AB049851 


Macaca fascicularis brain cDNA, clone:QnpA- 18831 


2E-91 


1279 


AE001761 


Thermotoga maritima section 73 of 136 of the complete genome 


0.19 


1280 


AE002102 


Ureaplasma urealyticum section 3 of 59 of the complete genome 


5.5 


1281 


AK021982 


Homo sapiens cDNA FLJ1 1920 fis, clone HEMBB10003 12 


0.006 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1282 


AE007504 


Streptococcus pneumoniae section 187 of 194 of the complete genome 


0.21 


1283 


X79095 


F.trinervia pdk gene for pyruvate,orthophosphate dikinase 


0.22 


1284 


AE007413 


Streptococcus pneumoniae section 96 of 194 of the complete genome 


1.7 


1285 


AB050513 


Macaca fascicularis brain cDNA, clone:QnpA-17049 


0.41 


1286 


AF339809 


Homo sapiens clone IMAGE:274150 5 mRNA sequence 


0.52 


1287 


AF074991 


Homo sapiens full length insert cDNA YH88A03 


4E-09 


1288 


AF277581 


Campostoma anomalum microsatellite CA9 sequence 


1.6 


1289 


AF035245 


Bacillus lynceorum strain pVIL3 1 Bag320 satellite DNA 


0.23 


1291 


AF298208 


Dictyostelium discoideum transposon thug-T, complete sequence 


0.028 


1292. 


AL133656 


Homo sapiens mRNA; cDNA DKFZp434O2017 (from clone 
DKFZp434O2017) 


0.021 


1293 


AK024666 


Homo sapiens cDNA: FLJ21013 fis, clone CAE05223 


9E-34 


1294 


U55367 


Caenorhabditis elegans cosmid C55H1, complete sequence 


2.1 


1295 


X16715 


D. melanogaster gene for chorion protein sl6 


0.25 


1296 


AC001080 


Homo sapiens (subclone 3_g7 from PAC H74) DNA sequence, 
complete sequence 


0.69 


1297 


XM_051200 


Homo sapiens hypothetical protein MGC5149 (MGC5149), mRNA 


0.58 


1298 


AE000016 


Mycoplasma pneumoniae M129 section 1 of 63 of the complete 
genome 


0.026 


1299 


AF264059 


Homo sapiens RAPD fragment lost in a grade 2 astrocytoma 


0.73 


1300 


AE002308 


Chlamydia muridarum, section 39 of 85 of the complete genome 


0.078 


1301 


U58745 


Caenorhabditis elegans cosmid C10G6, complete sequence 


0.26 


1302 


XM_007768 


Homo sapiens KIAA1018 protein (KIAA1018), mRNA 


2.2 


1303 


AC000102 


Homo sapiens Chromosome 22ql 1.2 BAC Clone 60b5 In GNAZ-BCR 
Region, complete sequence 


0.17 


1304 


AF338734 


Homo sapiens hypothetical PHD zinc finger protein XAP135 
pseudogene, complete sequence 


2.3 


1305 


AF211141 


Carsonella ruddii natural-host Pachypsylla celtidis tryptophanyl-tRNA 
synthetase (trpS) gene, partial cds; 16S ribosomal RNA, 23S 
ribosomal RNA, and 5S ribosomal RNA genes, complete sequence; 
transaldolase (tal) and alkyl hydroperoxide reductase small > 


0.087 


1306 


L81391 


Homo sapiens (subclone 2_a6 from PI H39) DNA sequence, complete 
sequence 


0.18 


1307 


AB013613 


Oryza sativa DNA, centromere sequence RCB1 1 


0.085 


1308 


U03906 


Macroscelides proboscideus aldehyde dehydrogenase I, eta-crystallin 
mRNA, complete cds 


0.26 


1309 


L13470 


Human thyroxine-binding globulin gene, complete cds 


0.21 


1310 


AK026225 


Homo sapiens cDNA: FLJ22572 fis, clone HSI023 13 


3E-27 


1311 


XM_030391 


Homo sapiens dJ467Nl 1. 1 protein (DJ467N1 1. 1), mRNA 


0.001 


1312 


XM_017923 


Homo sapiens hypothetical protein FLJ12660 (FLJ12660), mRNA 


2E-32 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1313 


M57256 


B.hermsii vmp21 gene encoding Vmp21 outer membrane lipoprotein 


0.58 


1314 


Y14051 


Staphylococcus aureus mecA, mecRl, mecl genes and ORF168, 
ORF142, ORF44, ORF145 and ORF224 


0.081 


1315 


XM_032018 


Homo sapiens similar to RIKEN cDNA 2310040G17 gene (M. 
musculus) (LOC90480), mRNA 


0.027 


1316 


AF227514 


Octodon lunatus cytochrome b gene, partial cds; mitochondrial gene 
for mitochondrial product 


1.9 


1317 


AE001414 


Plasmodium falciparum chromosome 2, section 51 of 73 of the 
complete sequence 


0.73 


1318 


Z21622 


S.cerevisiae RIB 7 gene encoding HTP reductase 


0.77 


1319. 


XM 040462 


Homo sapiens ATP-binding cassette, sub-family D (ALD), member 3 
(ABCD3), mRNA 


1.8 


1320 


Z65229 


H.sapiens CpG island DNA genomic Msel fragment, clone 28a8, 
forward read cpg28a8.ftla 


0.23 


1321 


XM_040095 


Homo sapiens hypothetical protein (DKFZP434G0310), mRNA 


2.3 


1322 


XM_027314 


Homo sapiens hypothetical protein FLJ22351 (FLJ22351), mRNA 


1E-14 


1323 


AF282064 


Homo sapiens clone 2qtel_edcl7iF-cl7jF sequence 


3E-88 


1324 


AF101098 


Homo sapiens collagen type XI alpha-1 (COL11A1) gene, exon 38 


1.3 


1325 


AB050093 


Neoceratodus forsteri mRNA for pituitary glycoprotein hormone alpha 
subunit, complete cds 


0.009 


1326 


AL591507 


Human DNA sequence from clone RP1 1-289C14 on chromosome 6, 
complete sequence [Homo sapiens] 


0.026 


1327 


AJ400873 


Homo sapiens partial GPLD1 gene for glycosylphosphatidylinositol 
phospholipase D, exons 15-20 


1.9 


1328 


AF378831 


Mus musculus orphan G protein-coupled receptor (Raig2) mRNA, 
complete cds 


1.9 


1329 


XM_030116 


Homo sapiens hypothetical protein MGC3077 (MGC3077), mRNA 


0 


1330 


X54806 


R.norvegicus mRNA for cytokeratin type I (3* end) 


0.77 


1331 


XM_013141 


Homo sapiens ATPase, Cu-w- transporting, alpha polypeptide 
(Menkes syndrome) (ATP7A), mRNA 


0.72 


1332 


NM_026428 


Mus musculus RIKEN cDNA 1810027P18 gene (1810027P18Rik), 
mRNA 


0.019 


1333 


U85448 


Actinidia deliciosa sucrose-phosphate synthase (KSPS-1) mRNA, 
partial cds 


0.001 


1334 


AK013974 


Mus musculus 13 days embryo head cDNA, RIKEN full-length 
enriched library, clone:3110002D09, full insert sequence 


0.74 


1335 


U25641 


Tetrahymena thermophila telomerase component p80 mRNA, 
complete cds 


0.008 


1336 


NC_002102 


Lactobacillus helveticus plasmid pLHl, complete sequence 


0.78 


1337 


M14292 


Human LIHeg repetitive element from the intergenic region of the 
epsilon and G-gamma globin genes 


5E-08 
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SEQIE 
NO 


> 

ACCESSN 


DESCRIP 


P VALUE 


1338 


AF239910 


jrcLuiud dAiiidns napiorype qid seii-incompatibility nbonuclease 
mRNA, complete cds 


2.1 


1339 


AF 116872 


v^oiiuiud diuiwiiib aiiernauve oxidase (AUA2) gene, complete cds 


0.25 


1340 


XM 035344 


nuiiiu ^apiciii Ki/\A.ioos protein (IvlAAlooo), mRNA 


1.6 


1341 


XM 046636 


Homo sapiens hypothetical protein FLJ20080 (FLJ20080), mRNA 


6E-74 


1342 


AF292561 


dirongyioiaes siercoraiis u protein alpha subunit 2 (gpa-2) gene, 


0.76 


1343 


AF008597 


\^auidrdninus roseus aesacetoxyvmaoline-4-hydroxylase (CRSD4H) 
gene, complete cds 


0.26 


1344 


AF347671 


equine iiuiuus a virus isolate rvtiKV/oz fi (Fl) gene, partial cds 


6.9 


1345 


U89019 


Hepatitis C virus polyprotein gene, complete cds 


0.27 


1346 


XM 043815 


xzuinu aapicns> nypuuieucai proiein rJLJzl / jo (rLJ2173o) mRNA 


2.2 


1347 


L39930 


Podocoryne carnea fibrillin mRNA, partial cds 


0.7 


1348 


Z21512 


K.lactis GAL80 gene encoding K1GAL80 protein 


0.075 


1349 


AC001490 


nomo sapiens (suocione z_cl(J rrom PI H32) DNA sequence, 
complete sequence 


2E-11 


1350 


L36891 


Saccharomyces cerevisiae mitochondrion transfer RNA-fMet (tRNA- 
fMet) gene; transfer RNA-Pro (tRNA-Pro) gene; 9S ribosomal RNA 
C9S rRNA 1 * ffene 


0.025 


1351 


AKO 17200 


Mus musculus 1 1 days pregnant adult female ovary and uterus cDNA, 
xvjjsjlin iuii-iengin enncnea HDrary, clone:5033428C03, full insert 
sequence 


6.2 


1352 


AB003286 


nomo sapiens l/ina ior cnoiine Kinase like protein and muscle type 
carnitine palmitoyltransferase I, partial and complete cds 


0.069 


1353 


AF312917 


ridsmooium iaiciparum reticulocyte binding protein 2 homolog B 
gene, complete cds 


0.081 


1354 


M65060 


ivi.uiermoauiorropmcum trpli, trpu, trpC, trpF, trpB, trpA, trpD 


0.26 


1355 


AB025642 


.rtxdoiuopsis uianana genomic uiNA, enromosome 5, BAC 


0.34 


1356 


L24957 


^ygu;>atA,iidruinyces cian aipna-gaiactosiaase gene, complete cds 


0.08 


1357 


L38437 


Rattus norvegicus NADH ubiquinone oxidoreductase subunit (IP 13) 

apnp rnmnlptp rHc 


0.27 


1358 


ABO 1686 


rui^yaiuiua ap. iyix/\-ZJ Ho 1, lSOldie UJNA-ZD 


2.2 


1359 


AB002805 ] 


Liuinvj &<ipicii» \jo-y gene, j upsiream region and partial cos 


0.0004 


1361 


XM 030723 ] 


Homo sapiens CGI-107 protein (LOC51012), mRNA 


3.1 


1362 


] 

NM 031881 ] 


Mus musculus neural precursor cell expressed, developmental^ down- 
regulated gene 4b (Nedd4b), mRNA 


3.8 


1363 


] 

AC008855 i 


Homo sapiens chromosome 5 clone CTD-2178M23, complete 
sequence 


0.72 


1364 


NC 002660 < 


"ochliomyia hominivorax mitochondrion, complete genome 


0.074 


1365 


AF072589 ] 


python regius alpha enolase mRNA, complete cds 


1.4 


1366 


AJ243461 1 


^icia faba nod-CCPl gene, exons 1 and 2 


0.017 
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|SEQID| 
NO 



ACCESSN 



DESCRIP 



P VALUE 



1367 XM 011052 



Homo sapiens 28kD interferon responsive protein (IFRG28), mRNA 



1368 AL390157 



Homo sapiens mRNA; cDNA DKFZp434D179 (from clone 
DKFZp434D179) 



1.6 



0.0005 



1369 I AL023822 iCaenor habditis elegans cosmid Y102A5 A, complete sequence | 0.046 

1370 U36918 IMesocricetus auratus mucin (MUC1) mRNA, complete cds IT 



1371 AF081786 



iHomo sapiens lysyl hydroxylase 1 gene, promoter region and partial 
cds 



0.18 



[Homo sapiens similar to hypothetical protein FLJ22639 (H. sapiens) 
1372 | XM_035682 |(LOC9 1047), mRNA 



8E-08 



1373 



Z54645 



Ksapiens CpG island DNA genomic Msel fragment, clone 14h7, 
[reverse read cpg!4h7.rtlb 



0.027 



1374 AB012135 



Schizosaccharomyces pombe gene for catalytic subunit (C5) of 
|proteasome, complete cds 



1.9 



1375 



L47352 



Brassica napus (clone Sta 39-4) arabinogalactan protein mRNA, 
[complete cds 



0.73 



1376 I AB036993 IHomo sapiens RHAG gene for Rh50 glycoprotein, partjaTcds 0.0000001 



1377 



U67535 



Methanococcus jannaschii section 77 of 150 of the complete genome 



0.03 



1378 I AF203637 [Myotis myotis microsatellite A2 sequence 



0.017 



1379 XM_004748 Homo sapiens hypothetical protein FLJ1 1 1 10 (FLJ1 1110), mRNA 1.1 



1380 AJ249744 Campylobacter jejuni cjaE gene for putative membrane protein CjaE 



1381 AL591033 



[Human DNA sequence from clone RP1 1-94L3 on chromosome 6, 
complete sequence [Homo sapiens] 



0.69 



1.7 



1382 I AJ242625 |Mus musculus Dmp-1 gene, exons 1-6 



0.71 



1383 I AB036851 I Ciona intestinalis mRNA for pellino, complete cds 



0.23 



[Homo sapiens transmembrane phosphatase with tensin homology 
1384 | XM_012972 [(TPTE), mRNA 



2E-09 



1385 AF175313 



Oncorhynchus mykiss cardiac sodium-calcium exchanger (NCX) 
|mRNA, complete cds 



0.47 



1386 I AF283669 I Homo sapiens kallikrein 14 (KLK 14) gene, complete cds 044~ 



1387 AY018560 



Oryza sativa microsatellite MRG0885 containing (AT)X18, genomic 
| sequence 



0.63 



1388 AK027272 [Homo sapiens cDNA FLJ14366 fis, clone HEMBA1001020 — tm9 



Homo sapiens roundabout (axon guidance receptor, Drosophila) 
1389 I XM_00305 2 homolog 1 (ROBOl), mRNA 



1390 AF208532 



I Homo sapiens fatty acid omega-hydroxylase CYP4A1 1 (CYP4A11) 
I gene, complete cds 



1E-49 



0.69 



1391 AF233326 



Mus musculus tumor necrosis factor receptor-associated factor (Traf2) 
| gene, exons 2, 3, and 4 



0.72 



1392 AE001413 



Plasmodium falciparum chromosome 2, section 50 of 73 of the 
| complete sequence 



0.15 



1394 



X54889 | Sea urchin developmentally regulated ectodermal gene 



0.023 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1395 


AF378868 


Arabidopsis thaliana AT5gl 1200/F2I1 1 90 mRNA, complete cds 


2.5 


1396 


AF328539 


Homo sapiens clone 12qtel_c64t3 sequence 


0.0003 


1397 


AE006450 


Lactococcus lactis subsp. lactis EL1403 section 212 of 218 of the 
complete genome 


0.47 


1398 


Z12840 


Oxuniculus mRNA for protein of unknown function 


2E-97 


1399 


U31165 


Rattus norvegicus SH3 domain binding protein (CR16) gene, exon 6 


1.9 


1400 


Y15005 


Bison bonasus mitochondrial cytB gene 


1.8 


1401 


AC091267 


Caenorhabditis elegans cosmid Y37B1 1A, complete sequence 


3.7 


1402 


AB055364 


Macaca fascicularis brain cDNA, clone:QflA-12522 


e-119 


1403 


NC_001610 


Didelphis virginiana mitochondrion, complete genome 


0.006 


- 

1404 


AF362372 


Dictyostelium discoideum histidine kinase DhkJ (dhkJ) gene, 
complete cds 


0.055 


1405 


AF027164 


Homo sapiens type II integral membrane protein (NKG2-E) gene, 
partial cds; and type II integral membrane protein (NKG2-F) gene, 
complete cds 


3.6 


1406 


M76376 


Human cysteine-rich protein (CRP) gene, exon 2 


0.027 


1407 


D31863 


Mouse Pig-a gene for GPI-anchor biosynthesis (PIG-A protein), exon 
6 and complete cds 


0.026 


1408 


XM031315 


Homo sapiens hypothetical protein FLJ14033 similar to hypoxia 
inducible factor 3, alpha subunit (FLJ14033), mRNA 


0.000007 


1409 


AYO 19369 


Oryza sativa microsatellite MRG1694 containing (AT)X32, genomic 
sequence 


0.0001 


1410 


AF067610 


Caenorhabditis elegans cosmid F41A4 


0.084 


1411 


AJ401391 


Drosophila melanogaster mRNA for PDGF/VEGF-like protein 
(CG7103 gene) 


0.18 


1412 


XM_008417 


Homo sapiens hypothetical protein FLJ20694 (FLJ20694), mRNA 


0.0000001 


1413 


AF242195 


Homo sapiens KLK15 (KLK15) gene, complete cds, alternatively 
spliced 


e-160 


1414 


Y14077 


Bacillus subitlis 10.6 Kb chromosomal DNA: glyB-prsA region 


0.008 


1415 


AE006802 


Sulfolobus solfataricus section 161 of 272 of the complete genome 


0.057 


1416 


AY007149 


Homo sapiens clone CDABP0086 mRNA sequence 


0.092 


1417 


K02628 


Oxytricha nova (hypotrichous ciliate) ( clones LMiC2-(5,6,8)) 
micronuclear DNA, C2 gene (version 3), complete cds 


0.063 


1419 


X14260 


Xenopus tropicalis alpha-globin gene 


1.7 


1420 


AJ401391 


Drosophila melanogaster mRNA for PDGF/VEGF-like protein 
(CG7103 gene) 


0.15 


1421 


XM_032580 


Homo sapiens nebulette (NEBL), mRNA 


1.3 


1422 


XM_008060 


Homo sapiens solute carrier family 7 (cationic amino acid transporter, 
y+ system), member 5 (SLC7A5), mRNA 


2E-12 


1424 


AF035820 


Hordeum vulgare gibberellin action negative regulator SPY mRNA, 
complete cds 


2.1 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1425 


AL353672 


Human DNA sequence from clone RP1-241K1 on chromosome 6, 
complete sequence [Homo sapiens] 


3.8 


1426 


AF070565 


Homo sapiens clone 24425 mRNA sequence 


0.2 


1427 


AK026914 


Homo sapiens cDNA: FLJ23261 fis, clone COL05862 


0.009 


1428 


X14735 


Podospora anserina mtDNA for large subunit rRNA 


0.21 


1429 


XM 004081 


Homo sapiens aldehyde dehydrogenase 5 family, member Al 
(succinate-semialdehyde dehydrogenase) (ALDH5A1), mRNA 


1.4 


1430 


Y10687 


Sulfolobus acidocaldarius gene encoding purine NTPase 


0.019 


1431 


AF168678 


Drosophila melanogaster adenylyl cyclase 78C-S mRNA, complete cds 


0.13 


1432 


AC006666 


Caenorhabditis elegans cosmid H31G24, complete sequence 


0.014 


1433 


X02473 


Calf mRNA for delta subunit of muscle acetylcholine receptor 


1.1 


1434 


AF270148 


Staphylococcus epidermidis strain SRI clone step. 105 lc08 genomic 
sequence 


0.006 


1435 


XM 008084 


Homo sapiens hypothetical protein LOC55565 (LOC55565), mRNA 


3E-36 


1436 


AF007876 


Homo sapiens Na,K-ATPase beta 2 subunit gene, complete cds 


0.008 


1437 


AKO 16592 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4933400F21, full insert sequence 


0.61 


1438 


AB023053 


Homo sapiens genomic DNA, chromosome 6p21.3, HLA class I 
region, clone: 53L9, complete sequence 


2E-10 


1439 


XM_035837 


Homo sapiens cAMP responsive element modulator (CREM), mRNA 


1.7 


1440 


Y12710 


Athaliana gene encoding shaggy-like kinase gamma 


0.068 


1441 


AF351620 


Homo sapiens lipocalin-1 interacting membrane receptor (LIMR) 
gene, complete cds 


0.028 


1442 


U67560 


Methanococcus jannaschii section 102 of 150 of the complete genome 


2.1 


1443 


NM_002251 


Homo sapiens potassium voltage-gated channel, delayed-rectifier, 
subfamily S, member 1 (KCNS1), mRNA 


1.9 


1444 


XM 027341 


Homo sapiens similar to retinoblastoma inhibiting gene 1 (M. 
musculus) (LOC89932), mRNA 


0.0008 


1446 


AF039373 


Arabidopsis thaliana ecotype RLD chromomethylase (CMT1) gene, 
complete cds 


1.9 


1447 


U29397 


Rattus norvegicus plasma membrane Ca2+ ATPase isoform 3 
(PMCA3) gene, 5' flanking region 


0.008 


1448 


XM_036536 


Homo sapiens hypothetical protein FLJ10402 (FLJ10402), mRNA 


4.1 


1449 


AK000677 


Homo sapiens cDNA FLJ20670 fis, clone KAIA4743 


1E-09 


1450 


AJ243538 


Glomus mosseae fox2 gene for Fox2 protein 


0.018 


1451 


AJ296103 


Staphylococcus aureus repNVH99 gene for replication protein and 
smr gene 


0.23 


1452 


AF084911 


Bombus ternarius cytochrome b gene, partial cds; mitochondrial gene 
for mitochondrial product 


1.4 


1453 


AC001080 


Homo sapiens (subclone 3_g7 from PAC H74) DNA sequence, 
complete sequence 


0.72 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1454 


AF213160 


Gallus gallus microsatellite C-MNT-7 sequence 


0.019 


1455 


AB056366 


Macaca fascicularis brain cDNA, clone:QflA-14203 


0.024 


1456 


AF129515 


Paramecium tetraurelia SEC7-related protein (SEC7) mRNA, 
complete cds 


0.055 


1457 


XM_037469 


Homo sapiens similar to spastic ataxia of Charlevoix-Saguenay 
(sacsin) (H. sapiens) (LOC91290), mRNA 


1.8 


1458 


AK005404 


Mus musculus adult female placenta cDNA, RIKEN full-length 
enriched library, clone: 1600009D24, full insert sequence 


4.8 


1459 


AE001397 


Plasmodium falciparum chromosome 2, section 34 of 73 of the 
complete sequence 


4.2 


1460 


AF161253 


Dictyostelium discoideum protein phosphatase 4 catalytic subunit 
(pppC) mRNA, complete cds 


0.77 


1461 


X74912 


M.hominis lysS gene 


0.19 


1462 


AF027148 


Homo sapiens myogenic determining factor 3 (MYOD1) gene, 
complete cds 


0.45 


1463 


BC003261 


Mus musculus, serine/threonine kinase 5, clone MGC:5803 
IMAGE:3501444, mRNA, complete cds 


0.21 


1464 


AC079143 


Homo sapiens Xp BAG RP1 1-346D13 (Roswell Park Cancer Institute 
Human BAC Library) complete sequence 


2 


1465 


Y07829 


Homo sapiens RFB30 gene for RING finger protein 


4.5 


1466 


BC006448 


Homo sapiens, clone IMAGE:3637085, mRNA 


1.9 


1467 


X05287 


Locusta migratoria mitochondrial gene for 16S rRNA 


0.19 


1468 


XM_052063 


Homo sapiens hypothetical protein DKFZp547A023 
(DKFZp547A023), mRNA 


0 


1469 


AF1 13952 


Campylobacter jejuni multidrug-efflux transporter gene, complete cds; 
and unknown genes 


0.066 


1470 


AE006680 


Sulfolobus solfataricus section 39 of 272 of the complete genome 


5.2 


1471 


AY045578 


Arabidopsis thaliana AT4g37000/C7A10_360 mRNA, complete cds 


2.2 


1472 


AF269209 


Takifugu rubripes clone FTCRAV4S1 T cell receptor V alpha chain 
gene, partial cds 


1.9 


1473 


XM_027952 


Homo sapiens dihydrofolate reductase (DHFR), mRNA 


1.3 


1474 


U80449 


Caenorhabditis elegans cosmid R10A10 


1.8 


1475 


X15634 


C. elegans core histone genes H4(his-10) and H3(his-9) 


0.025 


1476 


Z16938 


H. sapiens (D19S218) DNA segment containing (CA) repeat; clone 
AFM207wc3; single read 


0.02 


1477 


X96783 


Ksapiens Syt V gene (genomic and cDNA sequence) 


0.0007 


1478 


Y08924 


P.falciparum mRNA for AARP2 protein 


0.003 


1479 


XM_036181 


Homo sapiens RAN binding protein 2-like 1 (RANBP2L1), mRNA 


0.21 


1480 


AL390874 


Human DNA sequence from clone RP1 1-238K16 on chromosome 9, 
complete sequence [Homo sapiens] 


0.002 


1481 


AL133157 


S.pombe chromosome I cosmid clB2 


0.38 


1482 


U67598 


Methanococcus jannaschii section 140 of 150 of the complete genome 


0.37 
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SEQK 
NO 


) 

ACCESSN 




P VALUE 


1483 


AJ242525 


Chlamydomonas reinhardtii partial dhclO gene for 1 beta dynein 
heavy chain, exons 16-53 


5.8 


1484 


L19842 


odiioma sp. mitocnondnal NADH dehydrogenase, and cytochrome b 
genes, 3 • end, and transfer RN A-Ser gene 


0.001 


1485 


AJ239534 


^dius idinnians iKiNA-aenvea MNE element, clone D254UC12 


0.009 


1486 


AF044206 


nomo sapiens cyciooxygenase (COX-2) gene, promoter and exon 1 


0.2 


1487 


AB035880 


Chaitoregma tattakana mitochondrial genes for small subunit rRNA, 
uuN/\-vai, large suounit rKNA, partial and complete sequences 


0.23 


1488 


AE006205 


rasieurena multociaa FM70 section 172 of 204 of the complete 
genome 


0.26 


1489 




Homo sapiens hypothetical protein FLJ10775 (FLJ10775) mRNA 


6E-16 


1490 


NM 030714 


ivius museums aeitex 3 nomolog (Drosophila) (Dtx3), mRNA 


0.39 


1491 


AE001407 


Plasmodium falciparum chromosome 2, section 44 of 73 of the 
complete sequence 


0.001 


1492 




wius muscuius, Clone liVLALjii; 3 495762, mRNA 


6.4 


1493 


XM 017181 


Homo sapiens Hypothetical protein FLJ22474 (FLJ22474) mRNA 


0.0007 


1494 


AF198964 


Arabidopsis thaliana ATP sulfurylase gene, complete cds 


0.057 


1495 




i-iomo sapiens cdina. rU23128 lis, clone LNG08384 


0.0002 


1496 


AF307860 


Homo sapiens CYR61 protein (CYR61) gene, complete cds 


3.9 


1497 


AL591667 


Human DNA sequence from clone RP1 1-389N9 on chromosome 6, 
complete sequence [Homo sapiens] 


0.004 


1498 


XM 002800 


nomo sapiens Kinesin-iiKe protein 2 (nklp2), mRNA 


2E-53 


1499 


M90Q70 


/\uxpiex nummmana una sequence 


4.8 


1500 


D14610 


Araoiaopsis tnanana mRNA tor acetyl-CoA carboxylase, complete cds 


0.22 


1501 


AC004498 


Homo sapiens chromosome 5, PI clone 1209C1 (LBNL HI 04), 
complete sequence 


3E-13 


1502 


AF104350 


Dicryostelium discoideum prespore-specific protein (pspC) gene, 
partial cds; and unknown gene 


0.017 


1503 


XM_028059 


Homo sapiens KIAA1681 protein (KIAA1681), mRNA 


0.35 


1504 


U83462 


lvius muscuius serotomn JN-acetyltransterase (AANAT) gene, 
complete cds 


0.017 


1505 


AD001534 1 


Durrciid dizeiii ^isolate ujviiji ), sequence tra^ment 


0.7 


1506 


] 
< 
< 
1 

AF092379 j 


Elivulus haraldsiolii tRNA-Ile, tRNA-Gln, and tRNA-Met genes, 

^omDlete seouence* NAOH HphvHrr*<y(*n!ic#» cnHnnit tt nAno ^Amniato 
^vnipiwiv ov^u^uw, m~LisLL ucii^tuugciiaac aUDunii ii gene, complete 

:ds; tRNA-Trp, tRNA-Ala, tRNA-Asn, tRNA-Cys, and tRNA-Tyr 

genes, complete sequence; and cytochrome c oxidase subunit I gene, 

>a> 


0.002 


1507 


] 

AF397158 j 


lomo sapiens clone 1 1 pur alpha-associated ribosomal RNA gene, 
martial sequence 


1E-39 


1509 


AF370352 c 


\rabidopsis thaliana putative ribosomal protein (AT4gl6720) mRNA, 
;omplete cds 


1.2 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQ ID 
NO 






P VALUE 


1510 


ArZoOZ4U 


fflV-1 isolate 97CDKFE4 from Republic of the Congo gag protein 
lg**&/ ana poi pruiciii ^puij gcuco, paiu<ii uii 


1.2 


1511 


L29256 


Human dinucleotide repeat polymorphism at the DXS1690 locus 


5E-50 


1512 


A "E") 1/11 1 ^ 




2.1 


1513 


XM 003799 


Homo sapiens protocadherin beta 10 (PCDHB10), mRNA 


0.57 


1514 


AF387030 


xiomo sapiens i-jina puiynicraoc cpaiiuu bu.uiu.111 ±j ) gwic, 
exons 14 and 15 


0.72 


1515 


\A0ift^A 

JVLZ /UD*f 


ITaivia conipnc ocnaraainf^ cvntliptjicf* /T'Rl W CTPTIP PVOTI<» 1 find 
flOIHO SiipiCllo aopdJ. ctgllic ajiiuit-ia.afc yiuiiy £^iiv<, wauiio a -j aim 

pruiuuiti lcgiuu 


0.2 


1516 


yivjt 

AIVL UDIojZ 


Homo sapiens mesoderm development candidate 2 (MESDC2), 

mPTsJ A 
IILiVlN/\ 


5.9 


1517' 


VTV/t ft^lftftA 


Homo sapiens heterogeneous nuclear protein similar to rat helix 


0.00009 


1518 


XIVL U4o/jU 


Homo sapiens hypothetical gene supported by L27560 (LOC92701), 


0.53 


1519 


Ar lJol fy 


JV1US musCUiUS apenmue syimuibc gcuc, ycuuai tus 


0.063 


1520 


U19360 


uncnocerca voivuius nuciear nonnoiie rcccpioi-iiR-c pcpuuc, ^vmijjidt 
cds 


1.6 


1521 


AY039907 


Arabidopsis thaliana AT5g41810/K16L22_9 mRNA, complete cds 


5.9 


1522 


Uooj / J 


r^iasmOQlUm iaiCiparUIIl UIN/\ lur lldvupiuicill auuuiui ui aui/tiuait 

uenyorogenase, cuiupicic wcia 


0.009 


1523 




Acute oee paralysis virus*, L#uiiipicic genuine 


0.07 


1524 




IVLaCaca iaSClCuiana lcnillll UcUl llLl\J.N.rY, ^uiiUJi^it; Wis 


0.3 


1525 


Arz /UU4o 


Staphylococcus epidermidis strain SRI clone step. 1043 h07 genomic 
sequence 


0.76 


1526 


AB047239 


otapnyiococcus aureus l-jin/\, compicic s»uul-luic ui ^aaaciie 
chromosome(SCC)-like element, strain:ATCC25923 


0.72 


1527 


T 11Q£LQA 

U 3yt>o4 


Mir/vmlqcmQ rrpnitalium cprti nr\ £% rvf SI nf thf* rnniTil ptp pp.nnirifi 
jVlyCOpiaoIIla. gClllliUlUlll aCwLlV/ii \j Ui a ui tii^ wv/iiipiwiv/ t>^ llx -' xlAV ' 


0.21 


1528 


XM 030430 


Homo sapiens hypothetical protein (FLJ1 1 127), mRNA 


2 


1529 


D38148 


Rotavirus strain A5-13 NSP1 gene, complete cds 


0.68 


1530 


AJsAJZUo40 


ivyfuc miter-nine aHnlt r*»titi!i rHWA T?TV"1hXT fiill-lpnPth enriched 
JVLUS mUSCUlUS auuil iCllIld. CL/iN AV, XVLlVJ_>iN lull icugui v^iii iv^iiv^u. 

liHrorv rlnn*»* AQ1O01 lPOfi fiill insert sefliience 


0.2 


1531 


AF130049 


Homo sapiens clone FLB341 1 PRO0852 mRNA, complete cds 


0.067 


1532 


AT7"310 1 Aft 


Ayfnc mii«:niilii<: r-Fo<; (TnO ffene 5' flankine and oromoter reffions 


1.8 


1533 


AF231842 


Jasminum odoratissimum tRNA-Leu (trnL) gene, partial intron and 3* 
exon, and txnL-trnF spacer region; chloroplast gene for chloroplast 
product 


1.3 


1534 


NC_001560 


Vesicular stomatitis virus, complete genome 


0.35 


1535 


AF213897 


Homo sapiens cAMP responsive element modulator (CREM) gene, 
exon C 


0.24 


1536 


XM_049251 


Homo sapiens KIAA0903 protein (KIAA0903), mRNA 


0.22 


1537 


AK027713 


Homo sapiens cDNA FLJ14807 fis, clone NT2RP4001760, weakly 
similar to PUTATIVE RHO/RAC GUANINE NUCLEOTIDE 
EXCHANGE FACTOR 


1E-36 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 



|SEQID| 
NO 



ACCESSN 



DESCRIP 



P VALUE 



1538 
1539 



AF268091 



Chloris gayana phosphoenolpyruvate carboxylase (ppc) mRNA, 
complete cds 



6.1 



AC023351 



Homo sapiens clone RP1 1-637J20, complete sequence 



1E-49 



1540 NC 002161 



Bovine parainfluenza virus 3, complete genome 



6.4 



1541 AE001092 



Archaeoglobus fulgidus section 15 of 172 of the complete genome 



1542 
1543 



AF044202 



Drosophila melanogaster glutamate receptor DGluRIIB mRNA, 
complete cds 



0.15 



0.008 



Z72767 



S.cerevisiae chromosome VII reading frame ORF YGL245w 



1.8 



1544 AJ277899 



Nicotiana tabacum drepp4 gene, exons 1-4 



0.24 



1545 XM 005131 



Homo sapiens chromatin accessibility complex 1 (CHRAC1), mRNA 



0.71 



1546 AF270048 



Staphylococcus epidermidis strain SRI clone step. 1043 h07 genomic 
sequence 



0.67 



1547 AL1 12743 



Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 



1548 



S68847 



YAP2=AP-l-like stress-induced transcriptional activator 
Saccharomyces cerevisiae, W303, Genomic, 2181 nt] 



1549 AF176634 



1551 



X79988 



Reldia sp. Amaya and Smith 578 NADH dehydrogenase subunit F 
;ndhF) gene, partial cds; chloroplast gene for chloroplast product 



H. sapiens HIP gene, exon 1 



1552 AK015054 



Mus musculus adult male testis cDNA, REKEN full-length enriched 
library, clone:4930402M22, full insert sequence 



0.076 



0.21 



0.54 



0.21 



2.2 



1553 



U07731 



Human quinone oxidoreductase2 (NQ02) gene, introns 1 and 2 and 
exon 2 



0.42 



1554 AE001370 



1555 AF222996 



Plasmodium falciparum chromosome 2, section 7 of 73 of the 
complete sequence 



Danio rerio nocA-like Zn-finger protein (nlz) mRNA, complete cds 



0.02 



0.59 



1556 AJ000080 



Trypanosoma brucei hsplOO and gpi-plc genes 



0.00004 



1557 



D30745 



Xenopus laevis MRP RNA gene 



1558 



X17256 



Yeast THR4 gene for threonine synthase (EC 4.2.99.2) 



0.64 



1559 AF203318 



HIV-1 clone LTNP1-85V3 from USA envelope glycoprotein V3 
region (env) gene, partial cds 



1.1 



1561 XM 027574 



Homo sapiens dystrophia myotonica-protein kinase (DMPK), mRNA 



1.8 



1562 BC006953 



Mus musculus, clone MGC:6936 IMAGE:28 11899, mRNA, complete 
cds 



2.2 



1563 L31848 



Homo sapiens serine/threonine kinase receptor 2 (SKR2) gene, 3 
alternative splices, 3* ends 



2E-10 



1564 AF309471 



Homo sapiens SQX6 (SQX6) gene, exon 10 



0.026 



1565 K02782 



Mouse complement component C3 mRNA, alpha and beta subunits, 
complete cds 



1567 XM 003204 



Homo sapiens leucine rich repeat (in FLU) interacting protein 2 
(LRRFIP2), mRNA 



0.94 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1568 


Z29373 


H saoiens ffene for neural cell adhesion mr»lp>^nlf» t 1 




1569 


AF138876 


Bacillus thuringiensis isolate M15 insertion sequence IS23 IN 


0.48 


1570 


AF043087 


Hordeiim V1 1 1 P"a re Hehvdri H 1 ^Hhnl"\ cw»nf» r>r*mTVI*»t*» r*Ae> 
iiviuviuu viugcub Uwiiyuxxxi x 1 j gCIlC, CUIIipicLc CQS 


z.l 


1571 


AE002293 


Chlamvdia miiridarnrn septinn 9 S rvFR*? rS \\\r± mmnioto (ronAtna 


U.57 


1572 


ABO 17043 


i\auua vcgiout* gene lor nepaiocyie nuciear iactor o gamma, partial 
cds 


J. 1 


1573 


XM_038951 


Homo saoiens hvDothetical nrotein Mfin /Mnn s«7*\ mRxiA 




1574 


AF244979 


A/fiiC tTlllQPllllIC CTSlIfP'ti T>- 1 1 ml? XT A i/Qriont K ^nmnlAta s^Ao 

iviua iiiuo^LiiLio gciicvviiii 1 1 iiuviN/\ ? vdi lciiii o, compieie CQS, 
alternatively spliced 


0.24 


1575 


XM 006069 


Homo saniens wee 1 -f fS mmhp^ homnlno r\X/FP1\ mRMA 


at? in 


1576 


AK025037 


Homo sapiens cDNA: FLJ21384 fis, clone COL03354 


1.7 


1577 


Z71443 


S cerevisiae chromosome XTV read in a fmmp nUF vmt i fHn 

wi w t idi.ttv vm viiLvuviuv .^vj. v l vuUlllg llalllv % /rv I x INijlv) / \s 


a Hi j. 

U.VJ l^f 


1578 


AF263518 


Arabidopsis thaliana protein synthesis initiation factor 4G (EIF4G) 
eerie comolete cds 




1579 


AE000680 


Aniiifex aeoliciis section 1 1 nf lOQ nf th*» mmnlptp apnnmp 


1 .z 


1580 


AE001404 


Plasmodium falciparum chromosome 2, section 41 of 73 of the 
comolete seauence 

Wlill/lviv JVUUvllvv 


1 .J 


1581 


AJ390510 


Candida albicans partial rriRNA for serine/threonine kinase (sha3 


fl 

v. io 


1582 


AC004173 


Homo saoiens clone UWGC*v23x01 1 from 6n91 comnlete cpnupnrp 


n on 1 ? 


1583 


Z71687 


S cerevisiae chromosome XTV reading frame ORF YTMR079w 


Z. 1 


1585 


X95065 


H Saoiens di nucleotide re neat (CiT} frnm rhrnmncnmp 1 "Jn fllQhkii\ 

ll..iXl|/lVl[0 UlllUVlVyUUUv IwpVyCll ^VJ 1 J XXUX1X will UlllVJoUllie 1JU *XXf>*U|JJ 


n i s 

U.lO 


1586 


Z24310 


j:x. aaxyxwila V J - / lvoJyj^ lyiNrt. aCgllieiLl wUIUailllllg J repeal, ClOIlC 

AFM3 1 lyb 1 ; single read 


0.002 


1587 


Y18476 


Trir*rtfYnlivtr*n nihnim mitrvr*V>r\nHri q! r*irtV\ «r**rn^ onH XT A T"\T-J 1 t/\ 
a -iiwiujiyiiy iuii luuiuill liiitUwiiLrllUllal wjrlU 1 gCIie dllu IN/vL/XTl lO 

NADH5 genes 


6.9 


1588 


Z47046 


Human cosmid OLL2C9 from Xn28 


n onoo7 


1589 


Z71687 


S cerevisiae chromosome XTV reading framp ORF VXTROT^w 

•JiVvivvijiaw win. w-iiiv/ouiiiw yvi v xwclUiiiK Ala-lllC V_/X\_l x 1 >t XvV/ / j£W 




1590 


XM_005485 


Homo sapiens KIAA0414 protein (KIAA0414), mRNA 


0.002 


1591 


NM_024364 


Rattus norvefficiis hairless (\xf\ mRNA 


u.zz 


1593 


AF136150 


Caenorhabditis elegans homogentisate 1,2-dioxygenase (hgo) gene, 
comolete cds 


1 A 


1594 


XM_033896 


Homo sapiens insulin-like growth factor 2 receptor (IGF2R), mRNA 


1.4 


1595 


AJ404228 


Saccharomyces douglasii mitochondrial tRNA gene cluster 


0.008 


1596 


BC007460 


Homo sapiens, acid phosphatase, prostate, clone MGC: 12236 
IMAGE:3951204, mRNA, complete cds 


0.029 


1597 


NM_024201 


Mus musculus RDCEN cDNA 061001 1N22 gene (061001 lN22Rik), 
mRNA 


5.3 


1598 


U88166 


Caenorhabditis elegans cosmid M0 1 A 12 


0.61 


1599 


AE001394 


Plasmodium falciparum chromosome 2, section 31 of 73 of the 
complete sequence 


2 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1600 


AJ306417 


Corynebacterium glutamicum partial nrdF gene for ribonucleotide 
reductase beta-subunit and ctaD gene for cytochrome aa3 oxidase 
subunit I 


1.9 


1601 


AF287270 


Homo sapiens mucolipin (MCOLN1) gene, complete cds 


6 


1602 


AE002207 


Chlamydophila pneumoniae AR39, section 37 of 94 of the complete 
genome 


0.083 


1603 


AK003449 


Mus museums 18 days embryo cDNA, RUCEN full-length enriched 
library, clone: 1 1 10004P2 1, full insert sequence 


0.2 


1604 


AB008103 


Arabidopsis thaliana AtERF-1 mRNA for ethylene responsive element 
binding factor 1, complete cds 


0.79 


1605 


AE006406 


Lactococcus lactis subsp. lactis IL1403 section 168 of 218 of the 
complete genome 


0.18 


1606 


BC008122 


Homo sapiens, clone MGC: 18053 IMAGE:4 148889, mRNA, 
complete cds 


1.5 


1607 


D28598 


Chicken mRNA for c-maf proto-oncogene product c-Maf long form, 
complete cds 


0.44 


1608 


X97314 


Msativa mRNA for cdc2 kinase homologue, cdc2MsC 


1.5 


1609 


AP001407 


Homo sapiens genomic DNA, chromosome 21q22.2, clone:10C6, 
LB7T-ERG region, complete sequence 


LI 


1611 


XM_0 17891 


Homo sapiens similar to heterogeneous nuclear ribonucleoprotein L 
(H. sapiens) (LOC91538), mRNA 


1.4 


1612 


U48562 


Corrigiola littoralis ORF2280 gene homolog, chloroplast gene 
encoding chloroplast protein, partial cds 


0.074 


1613 


XM_036274 


Homo sapiens similar to tumor protein p53-binding protein (H. 
sapiens) (LOC65591), mRNA 


2 


1614 


AJ271039 


Plasmodium yoelii yoelii partial mrk gene for MO!5-related kinase 


0.001 


1615 


ABO 16467 


Datisca cannabina chloroplast gene for maturase K, partial cds 


0.25 


1616 


AF027807 


Homo sapiens beta-casein (CSN2) gene, complete cds 


0.074 


1617 


AK002822 


Mus musculus adult male kidney cDNA, RDCEN full-length enriched 
library, clone:0610039A20, full insert sequence 


0.55 


1618 


U67580 


Methanococcus jannaschii section 122 of 150 of the complete genome 


0.032 


1619 


U89140 


Mus musculus aldose reductase gene, promoter region and exon 1 


0.48 


1620 


AE002799 


Drosophila melanogaster genomic scaffold 142000013385566, 
complete sequence 


0.17 


1621 


L14855 


Bos taurus neurexin I-alpha mRNA, complete cds 


0.48 


1622 


AF379854 


Cloning vector p VLH/hsp, complete sequence 


2 


1623 


Z11883 


S.tuberosum UlsnRNA variant genes Ul-1, Ul-2, Ul-3, Ul-4, Ul-5 
andUl-6 


0.54 


1624 


AJ294727 


Mus musculus ILTIFa gene for IL-TIF alpha protein (IL-22), exons la 
5 


0.011 


1625 


XM_043073 


Homo sapiens hypothetical protein DKFZp434A1319 
(DKFZP434A1319), mRNA 


1.9 
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Table 3A Nearest Neighbor (BlastN vs. GenbanJrt 


SEQIE 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1626 


M84732 


Plasmodium yoelii sporozoite surface protein 2 gene , complete cds 


0.007 


1627 


M83304 


G.hirsutum late embryogenesis-abundant protein 2-D (LeaA2-D) 
mRNA, complete cds 


9.8 


1628 


t mil e 

U32725 


Haemophilus influenzae Rd section 40 of 163 of the complete genome 


0.14 


1629 


A "Ci occon 

Ar 185589 


Homo sapiens cytochrome P450 3 A4 (CYP3A4) gene, promoter 
region 


0.027 


1630 




Human DNA sequence from clone RPl 1-389N9 on chromosome 6, 
complete sequence [Homo sapiens] 


0.15 


1631 


A DACC5 m 


Oryza sativa CHS gene for chalcone synthase, complete cds 


0.16 


1632 


Ml 1043 


Mouse DNA fragment that hybridizes to HSV-l Smal A fragment 


1.5 


1633 


A "C 1 A C 41 C\C\ 

Ar 145699 


Staphylococcus aureus putative membrane protein MprF (mprF) gene, 
complete cds; and unknown gene 


1.5 


1634 


AF088902 


Mus museums SLITl protein (Slitl) mRNA, partial cds 


0.5 


1635 


Afc>048882 


Macaca fascicularis brain cDNA, clone:QnpA-l6525 


0.21 


1636 


X98369 


M.musculus mRNA for SOX15 protein 


0.24 


1637 


AF131767 


Homo sapiens clone 24900 mRNA sequence 


0.25 


1638 


AT 1 'X'HZLE*^ 

AL1 17657 


Homo sapiens mRNA; cDNA DKFZp586Fl924 (from clone 
DKFZp586F 1 924) 


1.9 


1639 


AB042973 


Nicotiana tabacum wizz gene, 5' flanking region, partial cds 


0.072 


1640 


AJ272U83 


Staphylococcus aureus empbp gene for extracellular matrix and 
plasma binding protein, strain SA 1 13 


0.008 


1641 


BC009423 


Homo sapiens, clone MGC:15151 IMAGE: 3 53 8202, mRNA, 
complete cds 


1.9 


1642 


AF1 14905 


Saccharomyces sp. CID1 ATP synthase subunit 8 (ATP8) gene, 
mitochondrial gene encoding mitochondrial protein, complete cds 


0.18 


1643 


AF270417 


Staphylococcus epidermidis strain SRI clone step.4052d03a genomic 
sequence 


0.063 


1644 


X91340 


G.hybrida chs-like gene 


0.51 


1645 


AF097026 


Homo sapiens autosomal highly conserved protein (AHCP) gene, 
complete cds 


0.008 


1646 


AM UzoolO 


Homo sapiens 67292 (KIAA1755), mRNA 


5.5 


1647 


AJL049339 


Homo sapiens mRNA; cDNA DKFZp564P046 (from clone 
DKFZp564P046) 


2.1 


1648 


AE001398 


Plasmodium falciparum chromosome 2, section 35 of 73 of the 

comnlete ^£*nnpnpp 


0.13 


1649 


M55171 ] 


Mouse opsin (MOPS) gene, complete cds 


0.52 


1650 


AB008103 1 


Arabidopsis thaliana AtERF-1 mRNA for ethylene responsive element 
binding factor 1, complete cds 


0.66 


1651 


] 

AL589705 ( 


Human DNA sequence from clone RPl 1-64M7 on chromosome 6, 
complete sequence [Homo sapiens] 


0.5 


1652 


X79559 


r.trypanoides (Rl) 18S rRNA gene 


0.069 


1653 


XM 048416 1 


tao sapiens PRO0478 protein (PRO0478), mRNA 


0.0001 
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SEQIE 
NO 


> 

ACCESSN 




P VALUE 


1654 


AF082761 


i/aajuiua itia^uictLua ibuidic ViJvi.ui mitocnonctriax i-/-loop, partial 
sequence 


0.063 


1655 


AF282568 


Homo saoiens clone 20nte1 rl Sht7 c*vm#»«/v* 


4E-39 


1656 


AB027617 


Pyrus pyrifoha gene for UDP-glucose pyrophosphorylase, complete 
cds 


0.19 


1657 


AK023864 


Homo saoiens cDNA FL 1 1 380? fis Hnnp thvooi nnn i «a 


0.2 


1658 


NM 009905 


Mus musculus CDC-like kinase (Clk), mRNA 


1.7 


1659 


AL023846 


Caenorhabditis elej?ans f^osmirl if mmniptA romiQ«^ Q 
wv * %, ' xiv ' A *»c*»^vt-iui.ij viwgcuio uuaxiuu x ~j z.i-> x. i v^, woinpieie sequence 


2.3 


1660 


AF334780 


Homo saniens nrp <yn a n r*v- i n H 1 1 r*f»H trmwtli inV»iKi+/-v»- Ayj -30 
xivuuu aapi&iia picgiituic-j-iiiuuLeu growin lnmoitor vJivLrjo gene, 

partial cds 


0.73 


1661 


XM 042066 


iiuiiiu oapiuiia iiiiiugcii-ticuvdLcu piuiein Kinase Kinase Kinase 1 
(MAP3K1), mRNA 


1.9 


1662. 


XM_034872 


Homo saoiens KIAA0202 nrotein (JC1A ACion*)} mPXTA 


ft 


1663 


AC002252 


Homo sapiens (subclone l_g7 from BAC H76) DNA sequence, 
complete sequence 


"2T7 1 A 


1664 


AK023113 


Homo saoiens cDNA F'LJ H05 1 fis Hrmp nn 1 ±A1 


21i-z U 


1665 


BC003552 


Homo sanien*; ealnpvin Hnnp A>1f~J/" , *l 10/; TA>f Af^n>'3^/K'5Qo m n\TA 

xxuiiivr aapi^iia, wiiiiCAxxi, ^.luilc JV1VJ\_^. 1 1VO llVLAOxi. jD4uJoy, ITlKJN A 

complete cds 


ft ft1£. 


1666 


AF097142 


XaoinaSDlS SO 12S small SiiHiinit rihncnm'il D XT A rrono now-vnl 
i aLfniaopis ap. x^.o aiiiaii 0UUUIU.L IlUUbUIIuLl JvtNA gene, paiTiai 

sequence; mitochondrial gene for mitochondrial product 


1.5 


1667 


AL117103 


Botrvtis cinerea strain TJ. pDNT A 1 ihrartr iin^^r s+s\nAi+innf 

uuu^ua uiiicic<i a Li din i*t L/Jl/in/\ iioraiy unaer conaitions 01 mtrogen 
deprivation 


£L 1 

o. 1 


1668 


NM_013825 


Mus musculus lymphocyte antigen 75 (Ly75), mRNA 


0.74 


1669 


NM_007043 


Homo sapiens HIV-1 rev binding protein 2 (HRB2), mRNA 


7E-22 


1670 


Z27359 


Cj hirSUtlirn ( AD 5 2 ^ COnia-lilcP rpvprc** trancrnntacp (ronmantc 


U.24 


1671 


AK021535 


Homo saoiens cDNA FL 1 1 1 471 fis Hnnp wftAvTR a i c\(\ i n i o 


ft 1 c 

0.25 


1672 


Y00145 


Oicrvnstpliiim Hi ^rrnHfiim TTTiP crln/v-ico mrmntinmiinnriAnA a-a**** 

x/ivijrui)ibuuiu uiawuiucuiii ult glucose pyropnospnoiyiase gene 
(UDPGP; EC 2.7.7.9) 


ft n 
U.22 


1673 


M63291 


Drosophila silvestris alcohol dehydrogenase gene, complete cds 


0.028 


1674 


AF100401 


Influenza B virus B/Guangdong/5/94 segment 8 nonstructural protein 

fNSl^ and nonstructural nrntpin /'AIQO^ apnpc rnmntot^ r*Ae> 


ft o 

0.2 


1675 


U80449 


Caenorhabditis elegans cosmid R10A10 


1.6 


1676 


X15634 


C. elegans core histone penes H4ffiis-in , i anH m/iiic-Qi 


U.U22 


1677 


XM 050366 


Homo sapiens zinc finger protein, X-linked (ZFX), mRNA 


0.23 


1678 


AE007046 


Mycobacterium fiihemilnsi*: PTiP I^SI confirm iio A f ocn svT+ii A 

complete genome 


0.062 


1679 


XM_008207 


Homo sapiens glucose-6-phosphatase, catalytic (glycogen storage 
disease type I, von Gierke disease) (G6PC), mRNA 


4E-18 


1680 


] 

AB049118 < 


Hemicentrotus pulcherrimus mRNA for transcription factor HpLiml, 
complete cds 


0.21 


1681 


XM_002421 3 


Homo sapiens hypothetical protein FLJ20254 (FLJ20254), mRNA 


4.1 


1682 


X96783 ] 


FI. sapiens Syt V gene (genomic and cDNA sequence) 


0.000004 
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1683 


U79732 


Plasmodium berghei extrachromosomal plastid PB-2, tRNA-Pro, 
tRNA-Glu, tRNA-Lys, tRNA-Asp, tRNA-Ser, tRNA-Tyr, tRNA-Met, 
tRNA-Leu, tRNA-Cys, and tRNA-His genes, complete sequence, rps4 
gene, complete cds, tRNA-Thr gene, complete sequence, and large 
subu> 


0.001 


1684 


NC_002619 


Pteropus scapulatus mitochondrion, complete genome 


6.3 


1685 


LI 6679 


Caenorhabditis elegans cosmid K07D8, complete sequence 


1.8 


1686 


Y08924 


P.falciparum rnRNA for AARP2 protein 


0.003 


1687 


U32760 


Haemophilus influenzae Rd section 75 of 163 of the complete genome 


2.2 


1688 


AF2 98624 


Dicryostelium discoideum chromosome 2 repeat region 


0.009 


1689 


XM031155 


Homo sapiens RAS protein activator like 2 (RASAL2), mRNA 


0.33 


1690 


AE006899 


Sulfolobus solfataricus section 258 of 272 of the complete genome 


5 


1691 


U67556 


Methanococcus jannaschii section 98 of 150 of the complete genome 


0.42 


1692 


AKO 12631 


Mus musculus 10, 1 1 days embryo cDNA, RIKEN full-length 
enriched library, clone:2810001G20, full insert sequence 


0.064 


1693 


BC007399 


Homo sapiens, clone MGC: 16308 IMAGE: 3 836 116, mRNA, 
complete cds 


0.022 


1694 


AL389897 


Human DNA sequence from clone RP1 1-1 10C10 on chromosome 20. 
Contains GSSs, complete sequence [Homo sapiens] 


1E-26 


1695 


Z62421 


Rsapiens CpG island DNA genomic Msel fragment, clone 68h2, 
forward read cpg68h2.ftla 


4E-70 


1696 


BCO 10535 


Homo sapiens, clone IMAGE:3456494, mRNA 


0.002 


1697 


Z68319 


Caenorhabditis elegans cosmid T23G7, complete sequence 


0.18 


1698 


U48363 


Mus musculus transcriptional activator alpha-NAC (Naca) gene, 
complete cds 


0.058 


1699 


AF135430 


Ustilago nuda 18S ribosomal RNA gene, partial sequence; internal 
transcribed spacer 1, 5.8S ribosomal RNA gene and internal 
transcribed spacer 2, complete sequence; and 28S ribosomal RNA 
gene, partial sequence 


1.4 


1701 


XM_007461 


Homo sapiens similar to enhancer of rudimentary (Drosophila) 
homolog (H. sapiens) (LOC65719), mRNA 


0.18 


1702 


NM 025083 


Homo sapiens hypothetical protein FLJ21128 (FLJ21128), mRNA 


e-124 


1703 


XM 050592 


Homo sapiens uncharacterized hematopoietic stem/progenitor cells 

^lUlClli IVLL^OUJ^ ^IVLL/oUJZJ, IIUVLNA 


0.068 


1704 


AF277167 


Homo sapiens Makorin RING zinc-finger protein 2 (MKRN2) gene, 
exon 4 


2 


1705 


] 

AE001433 i 


Plasmodium falciparum chromosome 2, section 70 of 73 of the 
complete sequence 


0.67 


1706 


AF075003 ] 


Homo sapiens full length insert cDNA YH97B03 


0.001 


1707 


] 

NM 011378 i 


VIus musculus transcriptional regulator, SIN3A (yeast) (Sin3a), 
mRNA 


0.62 
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SEQIE 
NO 


> 

ACCESSN 




P VALUE 


1708 


AE001321 


^iuaiii_yui<i udciiunidus secuon 4© 01 0 / 01 trie complete genome 


0.026 


1709 


XM 046388 


Homo sapiens similar to elongation of very long chain fatty acids 
jrcm/mo^, bUK^/nioJ, yeast)-like 1 (H. sapiens) (LOC92652), 
mRNA 


6.1 


1710 


S55685 


MOR6.5=ouabain resistance gene {repeat sequence} [mice, Genomic, 
6395 nt] 


5.7 


1711 


AB055378 


Macaca fascicularis brain cDNA, clone:QflA-13294 


0.22 


1712 


XM 043597 


nuiiiu ^ctpiciib ceniaunn-aipna z protein (rioAz/ziyS), mRNA 


1E-20 


1713 


AE007426 


ouc;jnuiAAA,u& piicuinonide section iuv or 1V4 01 tne complete genome 


0.32 


1714 


AK016519 


mus muscuius aauii mate testis cuna, KIlvbN rull-length enriched 
nuicuy, isHjiK.tyjz,** i iuuo, iuii insert sequence 


e-129 


1715 


AL122033 


S.pombe chromosome II cosmid cl921 


0.53 


1716 


AK012960 


iviufc uiuscuius iu, 1 1 aays emoryo cdjna, kik±in lull-length 
enriched library, cione:2810401G03, full insert sequence 


0.075 


1717 


AL590123 


nuiiiaii i^iN/\ sequence rrom clone kfi 1-4 /ij 14 on chromosome X, 
complete sequence [Homo sapiens] 


0.19 


1718 


D17343 


rvaiiuo liVJi vcgiUUo \yy LUvlLIUlIlC ir'tDK) ZX31 J gene, eXOn 1 


0.66 


1719 


XM 028703 


Homo sapiens KIAA0889 protein (KIAA0889), mRNA 


0.00004 


1720 


AB060865 


ivict^dud xctadcuictns orain cl/in/\ cione. v^trA-iiyj j, tull insert 
sequence 


2.3 


1721 


AL133397 


nuinaxi l/in/\ sequence rrom cione Kri-ziJJip on enromosome 
^iqii.^i-iz.z, cuiiipieie sequence [xiomo sapiens] 


0.083 


1722 


X58913 


Mitochondrion Drosophila eugracilis ND2 and COI genes (partial) 

ariH (TPTIPQ fWr tUXTA -Ttx* fTJ XT A Txrr rtrtA +T>XT A /"V, T „ 

cuiu gcuca iui LrviN/\- 1 ip 5 uvin /\- 1 yr, ano lKJN A-LJys 


0.074 


1723 


AC002359 


nonio sapiens sLpzz cosmia uzjytjj (rrom Liawrence Livermore X 
library) complete sequence 


0.0008 


1724 


AE006322 


LduiuLu^cus laciis suosp. lactis UL14UJ section 84 01 218 of the 


1.8 


1725 


Z71290 


S.cerevisiae chromosome XIV reading frame ORF YNL014w 


0.67 


1726 


XM 005723 


rauiiiu sdpiciib iiypouicllCdl protein risXJZojy (rrsXJZTyjy), mRNA 


0.014 


1727 


AF123488 


South River virus polyprotein mRNA, complete cds 


0.57 


1728 


D78176 


iviouse i/JNA ior pnmary transcript 01 atrial natriuretic clearance 


0.061 


1730 


XM 006871 


numu adpiciib euidnoiamine Kinase (iiJ^-il), rriKJN A 


4.1 


1731 


AL078640 


nuinaii i^ina sequence irom cione jjoro on enromosome 22. 
Contains an EST, complete sequence [Homo sapiens] 


0.07 


1732 


AC092030 


Homo sapiens clone RP1 1-636112, complete sequence 


0.029 


1733 


AK003349 


Mus muscuius 18 days embryo cDNA, RIKEN full-length enriched 
library, clone: 1110003F05, full insert sequence 


0.16 


1734 


AF131767 ] 


Homo sapiens clone 24900 mRNA sequence 


0.2 


1735 


Z74246 


S.cerevisiae chromosome IV reading frame ORF YDL198c 


0.023 


1736 


NM_021401 ] 


Mus muscuius secreted and transmembrane 1 (Sectml), mRNA 


0.069 


1737 


] 

AF213931 < 


Buchnera aphidicola DNA polymerase III beta subunit (dnaN) gene, 
:omplete cds 


0.68 
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ACCESSN 



DESCRIP 



1738 
1739 



AF288892 
X99719 



Homo sapiens clone 20r DH4-17/BCL-1 gene fusion reciprocal 
[breakpoint sequence 



1740 
1741 



XM_041866 
AY010113 



[S.enterica hsdM, hsdS & hsdR genes 



Homo sapiens KIAA1468 protein (KIAA1468), mRNA 



Homo sapiens unknown mRNA sequence 



1742 
1743 



Mus musculus adult male tongue cDNA, RIKEN full-length enriched 
AK009187 library, clone: 23 10006124, full insert sequence 



AF130074 jHomo sapiens clone FLB9348 PRQ2523 mRNA, completely 



1744 AJ249165 [Plasmodium falciparum gc-beta gene, exons 1-13 



1745 
1746 



AKO 14565 



Mus musculus 0 day neonate skin cDNA, RIKEN full-length enriched 
library, clone:4632411B12, full insert sequence 



Ml 5058 Vaccinia virus (strain WR) Hindlll D fragment DNA, complete 



1747 
1748 



Z16416 



■t 1 - ' — ? -~ ~ — j"*— ■ 

M.pulmonis DNA for insertion sequence IS1 138 encoding putative 
|transposase 



XMJ)47638 Homo sapiens Fzrl protein (FZR1), mRNA 



Anabaena sp. phycobilisome core component (apcF) gene, complete 
1750 | U21853 cds, and glutamine synthetase (glnA) gene, partial cds 



1751 AF169454 Neisseria meningitidis strain Z2491 clone Cm045 unknown sequence 



1752 AF102750 



Apanteles canarsiae 16S large subunit ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 



m 



1753 AF034389 



Plasmodium falciparum sexual stage antigen (sl6) gene, promoter and 
| partial cds 



1754 
1755 



Staphylococcus aureus DNA, complete structure of cassette 
AB047239 chromosome(SCC)-like element, strain:ATCC25923 



X98669 | A.thaliana zatl gene 



1756 X16619 Chlamydomonas reinhardtii ARG7 gene for argininosuccinate lyase 

1757 I XM_005488 [Homo sapiens KIAA0354 gene product (KIAAQ354), mRNA 



1758 AF1 10966 HIV-1 isolate C-96BW04.10 country Botswana, complete genome 



1759 AF247193 |Mus musculus endobrevin (Vamp8) gene, exon 1 



1760 XM 045302 



Homo sapiens proteasome (prosome, macropain) 26S subunit, non- 
|ATPase, 5 (PSMD5), mRNA 



1761 AE006350 



Lactococcus lactis subsp. lactis IL1403 section 112 of 218 of the 
[complete genome 



1762 AJ247001 



Anguilla anguilla mitochondrial DNA for partial tRNA-Thr and 
|tRNA-Pro genes and D-loop, isolate RM1 1 



1763 I AB024996 
1764 



Cicer arietinum mRNA for rac-type small GTP-binding protein, 
[complete cds 



Ml 3945 Mus musculus pim-1 protein kinase gene, complete cds 



1765 AF106842 Platanus racemosa PlaraLFY protein (PlaraLFY) gene, complete cds 



Staphylococcus epidermidis strain SRI clone step,1024f03 genomic 
1766 | AF269822 | sequence 



1767 I AF256830 TSalmo salar clone BHMS549 microsatellite sequence 



P VALUE 



0.23 



5.9 



0.64 



4E-23 



0.14 



0.002 



0.54 



0.15 



0.026 



1.2 



0.076 



0.19 



0.066 



0.18 



1.5 



0.023 



1.5 



0.063 



1.1 



5.4 



0.0008 



0.69 



0.052 



1.6 



5.7 



0.22 



7.5 



1.8 
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SEQIE 

JXKJ 


) 

ACCESSN 


DESCRIP 


P VALUE 


1 /OO 


AE007395 


Streptococcus pneumoniae section 78 of 194 of the complete genome 


1.6 


1 TAO 

i /oy 


AB028896 


Streptococcus agalactiae type-specific capsular polysaccharide 
biosynthesis gene region (cpsIaA to ung gene), complete cds 


0.025 


1 / /U 


AL583831 


Human DNA sequence from clone RP1 1-469L10 on chromosome 6, 
complete sequence [Homo sapiens] 


0.15 


1 / / 1 


AL354683 


Human DNA sequence from clone RP1 1-259K1 1 on chromosome 20, 
complete sequence [Homo sapiens] 


0.21 


1 7T) 

1 / 12. 


XM 047524 


Homo sapiens procollagen (type III) N-endopeptidase (PCOLN3) 
mRNA 


4.6 


I / / J 


AC084500 


Caenorhabditis bnggsae cosmid G13G15, complete sequence 


0.076 


1774 


M13945 


Mus musculus pim-1 protein kinase gene, complete cds 


6.1 


1775 


L27474 


Human arginase (ARG1) gene dinucleotide repeat polymorphism, 
intron 5 


0.21 


1776 


Z79696 


Caenorhabditis elegans cosmid F54F3, complete sequence 


1.9 


1777 


AYO 18282 


Oryza sativa microsatellite MRG0607 containing (AT)X14, closest to 
marker C132, genomic sequence 


0.071 


1778 


AF1 14938 


Saccharomyces sp. IFO 1815 small subunit ribosomal RNA gene, 
mitochondrial gene for mitochondrial RNA, complete sequence 


0.078 


1779 


AF170085 


Homo sapiens Charcot-Leyden crystal protein gene, partial sequence 


6E-18 


1 /oil 


Y08919 


P.flesus mRNA for p53 protein 


0.082 


1 *7Q 1 

1 /ol 


L46868 


Trypanosoma brucei dominant expression site (DES) gene, promoter 
region 


2.2 


1782 


AF157008 


Prunus dulcis self-incompatibility associated ribonuclease gene, 
complete cds 


0.44 


I /o3 


AB017459 


Carabus sylvestris mitochondrial ND5 gene, isolate: Baumgartner 
Austria, partial cds 


0.029 


1 /o4 


AJ294714 


Pichia anomala leu2 gene for beta-isopropylmalate dehydrogenase 


0.26 


1785 


AF125972 


Caenorhabditis elegans cosmid H25P19 


0.64 


I /OO 


AB015413 


Helicobacter pylori cagA gene, complete cds, strain ATCC43526 


1.5 


1787 


AK021477 


Homo sapiens cDNA FLJ1 1415 fis, clone HEMBA1000942 


7E-12 


1 TOO 
1 / OO 


AC006812 


Caenorhabditis elegans cosmid Y64H9A, complete sequence 


0.63 


1 *7QO 

I /oV 


XM_004181 


Homo sapiens megakaryocyte-enhanced gene transcript 1 protein 
(MEGT1), mRNA 


7E-66 


1790 


] 

AJ236640 J 


Homo sapiens chromosome 22 CpG island DNA, genomic Msel 
fragment, clone 22CGDB49A20 , complete read 


9E-19 


1791 


] 

XM_041751 i 


Homo sapiens discoidin domain receptor family, member 1 (DDR1) 
tnRNA 


3.5 


1792 


] 

AL034546 < 


Human DNA sequence from clone RP5-898I4 on chromosome 
22ql3.33 Contains a GSS and a putative CpG island, complete 
sequence [Homo sapiens] 


0.000005 


1793 


] 

AJ006873 ( 


3uchnera aphidicola plasmid pBUsl, repA2 gene, leu A gene and 
3RF1 


0.23 
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NO 



Table 3A Nearest Neighbor (BlastN vs. Genbank) 



ACCESSN 



1794 | X79779 IS.tuberosum KST1 mRNA 



DESCRIP 



P VALUE 



1795 



AF371373 Cricetulus griseus hypothetical protein Al-3 mRNA, complete cds 



1796 AF165082 



iPollachius virens cytochrome b gene, partial cds; mitochondrial gene 
| for mitochondrial product 



1797 
1798 



AE007418 



1799 
1800 



. ■*» 

w 

fy 

K 4 



1801 
1802 



1804 



1806 
1807 



rn 

■Jr.? * 

H 



1811 
1812 



1813 
1814 



1817 
1819 



Streptococcus pneumoniae section 101 of 194 of the co mplete genome I 



X67082 jH. sapiens AMBP gene transcription regulatory region 



Homo sapiens mRNA; cDNA DKFZp586E2024 (from clone 
AL1 17655 |DKFZp586E2024) 



AB048972 iMacaca fascicularis brain cDNA, clone:QnpA-12040 



Homo sapiens M-phase phosphoprotein 10 (U3 small nucleolar 
XM_030865 ribonucleoprotein) (MPHOSPH10), mRNA 



AF123488 South River virus polyprotein mRNA, complete cds 



1803 AB027590 



Pachliopta aristolochiae mitochondrial mRNA for NADH 
dehydrogenase subunit 5, partial cds 



J00889 I Chicken cellular myc proto-oncogene, complete cds 



AB023486 
D21196 



Homo sapiens gene for histamine H2 receptor, promoter region and 
| complete cds 



pictyostelium discoideum mitochondrial DNA 



1808 AY035029 



Arabidopsis thaliana putative dihydroorotate dehydrogenase 
(MEB5.3/AT3gl7810) mRNA, complete cds 



1809 AB047389 



Clostridium difficile purL gene for formylglycinamidine 
ribonucleotide synthetase (FGAM synthetase), complete cds 



1810 AK012033 



Mus musculus 10 days embryo cDNA, RIKEN full-length enriched 
library, clone:2610315E15, full insert sequence 



AF044202 
AK026967 



Drosophila melanogaster glutamate receptor DGluRIIB mRNA, 
I complete cds 



AB050432 
X16871 



Homo sapiens cDNA: FLJ233 14 fis, clone HEP1 1989 



iMacaca fascicularis brain cDNA, clone:QnpA-21861 



Human DNA for elongation factor 1 -alpha (clone lambda-4) 



1815 AE000001 



Mycoplasma pneumoniae M129 section 16 of 63 of the complete 
| genome 



1816 AK002671 



Mus musculus adult male kidney cDNA, RIKEN full-length enriched 
library, clone : 06 1 002 5N 14, full insert sequence 



0.67 



0.61 



0.53 



0.027 



AF202903 



Sporopachydermia cereana var. D77-321B internal transcribed spacer 
1, 5.8S ribosomal RNA gene and internal transcribed spacer 2, 
complete sequence; and large subunit ribosomal RNA gene, partial 
I sequence 



1820 



L42167 
XM 027078 



Mus musculus (clone R24) rds gene, partial cds 



[Homo sapiens ATPase, Class V, type 10D (ATP10D), mRNA 



1821 



L27619 



Macaca nemestrina T-cell receptor beta chain V region (VM5.1) 
| gene, partial cds 



0.006 



5.2 



0.53 



0.18 



0.49 



0.024 



0.008 



Homo sapiens similar to protein kinase C binding protein 1 (H 
1805 | XM_030024 | sapiens) (LQC90218), mRNA 



0.009 



0.064 



0.21 



5.3 



0.019 



2.2 



0.008 



0.66 



0.4 



4.7 



1.9 



0.052 



0.5 



0.16 



0.16 



0.016 
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NO 


ACCESSN 


DESCRIP 


P VALUE 


1 ROO 
loZz 


BC003900 


Mus musculus, Similar to hypothetical protein 384D8_6, clone 
MGC:6766 IMAGE:3601298, mRNA, complete cds 


0.51 


1823 


XM_050561 


Homo sapiens KIAA0700 protein (KIAA0700), mRNA 


0.51 


1824 


S66407 


FLT4=receptor tyrosine kinase isoform FLT4 long {3* region, 
alternatively spliced} [human, mRNA Partial, 216 nt] 


8E-36 


lo/D 


AY008805 


Equus caballus retinoblastoma (RBI) gene, partial cds 


0.058 


lozo 


AJ276629 


Rattus norvegicus Sacm21/RT1-A intergenic region, haplotype RTln 
and partial RT1-A gene for MHC Class I antigen 


3.7 


i en 

182 / 


AF029975 


Eucalyptus grandis MADS box protein (EGM1) mRNA, complete cds 


L2 


1828 


AL109708 


Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 
254679 


0.019 


1 OQ 

lozy 


AF270353 


Staphylococcus epidermidis strain SRI clone step.4025a02 genomic 
sequence 


0.051 


1831 


U72236 


Dictyostelium discoideum ModA (modA) gene, complete cds 


0.5 


1832 


L15234 


Mycoplasma pneumoniae Cys-tRNA, Pro-tRNA, Met-tRNA, Ile- 
tRNA, Ser-tRNA, flVlet-tRNA, Asp-tRNA and Phe-tRNA genes 


1.1 




XM_012722 


Homo sapiens RNA (guanine-7-) methyltransferase (RNMT), mRNA 


0.5 


1834 


AF108228 


Ginkgo biloba GinLFY protein (GinLFY) gene, complete cds 


4.2 


15 JO 


XM_049943 


Homo sapiens topoisomerase (DNA) II alpha (170kD) (TOP2A), 
mRNA 


0.47 


i en 


NC_002674 


Aulopus japonicus mitochondrion, complete genome 


3.1 


1838 


AE001399 


Plasmodium falciparum chromosome 2, section 36 of 73 of the 
complete sequence 


0.008 


1 o-jo 

lojy 


U53695 


Leishmania enriettii multidrug resistance (lemdrl) gene, 5* UTR 


0.5 


i ©/in 


AF257022 


Salmo salar clone BHMS7-009 microsatellite sequence 


0.7 


1841 


AF239571 


Acineta supeiba tRNA-Leu (trnL) gene, partial sequence; and trnL-F 
intergenic spacer region, complete sequence; chloroplast gene for 
chloroplast product 


5.4 


lo4Z 


AL032651 


Caenorhabditis elegans cosmid Y6D1A, complete sequence 


0.019 


1 O A A 

1844 


XM_034229 


Homo sapiens similar to hypothetical protein FLJ14260 (H. sapiens) 
(LOC90803), mRNA 


0.068 


1845 


AL137121 


Human DNA sequence from clone RP1 1-122K21 on chromosome 13, 
complete sequence [Homo sapiens] 


e-147 


1846 


AF137535 


Danio rerio clone Dare-D AXX DAXX protein mRNA, partial cds 


1.7 


154 / 


AK023539 


Homo sapiens cDNA FLJ13477 fis, clone PLACE1003638 


0.21 


lo4o 


AF2 18799 


Gallus gallus clone LDH-BT1 lactate dehydrogenase B mRNA, 
complete cds 


6.4 


1849 


AF250345 


Xenopus laevis early growth response protein mRNA, complete cds 


0.52 


1850 


BC006858 


Mus musculus, clone IMAGE:3594635, mRNA 


1.5 


1852 


XM 006782 


Homo sapiens cyclin Tl (CCNT1), mRNA 


0.092 


1853 


J03883 


Rabbit zeta-1 globin processed pseudogene, complete cds 


1.5 
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NO 



ACCESSN 



DESCRIP 



P VALUE 



1854 I AF349951 [Homo sapiens HP95 mRNA, complete cds" 



3E-09 



1855 



Candidates Carsonella ruddii natural-host Bactericera cockerelli RNA 
polymerase beta subunit (rpoB) and RNA polymerase beta-prime 
AF268062 | subunit (rpoC) genes, partial cds 



0.17 



1856 AF1 14508 



I Uncultured gamma proteobacterium SUR-ATT-8 16S ribosomal RNA 
|gene, complete sequence 



1.5 



1857 AE006785 Sulfolobus solfataricus section 144 of 272 of the complete genome 



0.091 



1858 AF078684 [Psoroptes ovis glutathione S-transferase mRNA, complete cds 



1859 BC004671 



Mus musculus, FK506 binding protein la (12 kDa), clone MGC:6253 
|IMAGE:3498682 > mRNA, complete cds 



0.0006 



0.12 



1860 



NM_025083 Homo sapiens hypothetical protein FLJ21 128 (FLJ21 128), mRNA 



e-121 



1861 BC004161 



Homo sapiens, transmembrane 4 superfamily member (tetraspan NET- 
7), clone MGC:2447 IMAGE:2958221, mRNA, complete cds 



0.051 



1862 BC009855 



Homo sapiens, Similar to nucleolin, clone MGC: 16354 
|lMAGE:3926227, mRNA, complete cds 



1863 
1864 



Mus musculus adult male testis cDNA, RIKEN full-length enriched 
AK0 16507 library, clone:4931439C15, full insert sequence 



e-171 



0.019 



AF334756 Homo sapiens interleukin-1 HY2 (IL1HY2) gene, complete cds 



1.5 



Human DNA sequence from cosmid L118G10, Huntington's Disease 
1865 | Z68881 |Region, chromosome 4pl6.3 



0.005 



1866 AJ277737 



Homo sapiens partial POLR2J2 gene for RPB1 lbl protein intron 4 
1(1824 BP) 



0.0000001 



1867 XMJ)51572[Homo sapiens heat shock transcription factor 2 (HSF2), mRNA 



1868 AF256918 |Salmo salar clone BHMS267 microsatellite sequence 



1E-11 



0.11 



1869 
1870 



AE000557 [Helicobacter pylori 26695 section 35 of 134 of the complete genome 
XM_050469 [Homo sapiens ferritin, light polypeptide (FTL), mRNA 



0.17 



0.081 



1871 XM_042818 [Homo sapiens homeo box D4 (HOXD4), mRNA 



0.24 



Petunia x hybrida mRNA for lateral shoot inducing factor complete 
1872 I AB035093 cds 



1.9 



Pasteurella multocida PM70 section 151 of 204 of the complete 
1873 | AE006184 [genome 



5.6 



1874 AE006880 Sulfolobus solfataricus section 239 of 272 of the complete genome 



3.3 



1875 NC_002669 Bacteriophage bIL3 10, complete genome 



4.7 



1876 AK0 12924 



Mus musculus 10, 1 1 days embryo cDNA, RIKEN full-length 
enriched library, clone:2810048H02, full insert sequence 



1.8 



1877 XM 032188 Homo sapiens cell division cycle 27 (CDC27), mRNA 



5.5 



1878 M33662 [Hamster G-o protein alpha subunit 2 mRNA, complete cds 



5.8 



1879 Y00478 |L.esculentum LEACQ2 gene 



0.28 



1880 XM_050519 [Homo sapiens annexin A2 (ANXA2), mRNA 



8E-34 



1881 | AF054822 [Drosophila melanogaster activin beta precursor, gene, partial cds 



0.43 
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SEQIE 
NO 


) 

ACCESSN 


DESCRIP 


P VALUE 


1882 


AB062967 


Macaca fascicularis brain cDNA c!one:QmoA-10357, full insert 
sequence 


1.7 


1883 


D87001 


Homo sapiens immunoglobulin lambda gene locus DNA clnnp-47HQ 
downstream contig 


0.64 


1884 


AF378137 


Plasmodium falciparum clone r2-4-4#5 MB2 eene nartial cd* 


n ns*i 


1885 


XM 011791 


Homo sapiens laminin, gamma 3 (LAMC3), rnRNA 


3E-58 


1886 


XM 007214 


Homo sapiens hypothetical protein FLJ10956 (FLJ10956) mRNA 


e_i iq 

^ 117 


1887 


AF227741 


Rattus norvegicus protein kinase WNK1 (WNK1) mRNA, complete 
cds 


0.96 


1888 


AL359578 


Homo sapiens mRNA; cDNA DKFZp547N163 (from clone 
DKFZp547N163) 


0.063 


1889 


AE002841 


Drosophila melanogaster genomic scaffold 142000013385478, 
complete sequence 


0.19 


1890 


X01121 


Mycoplasma capricolum genes for ribosomal proteins S8 and L6 


0.5 


1892 


AF082924 


Trissolcus basalis cytochrome oxidase II gene, partial cds; and tRNA- 
Asp and tRNA-Lys genes, complete sequence- mitochondrial eenes for 
mitochondrial products 


0.02 


1893 


BC005693 


Mus musculus, Similar to RIKEN cDNA 1 1 10002A15 gene, clone 
MGC: 1 1574 IMAGE: 3 597594, mRNA, complete cds 


1.4 


1894 


U29721 


Entamoeba histolytica p21racB (Eh racB) gene, partial cds 


0.15 


1895 


AK023959 


Homo sapiens cDNA FLJ13897 fis, clone THYRO1001706 


0.17 


1896 


XM_038475 


Homo sapiens DKFZP564J102 protein (DKFZP564J102), mRNA 


6E-33 


1897 


AL010164 


Plasmodium falciparum DNA *** SEQUENCING IN PROGRESS 
*** from contig 3-95, complete sequence 


1.1 


1898 


AF176059 


Helcon sp. 16S ribosomal RNA gene, partial sequence; mitochondrial 
gene for mitochondrial product 


0.009 


1899 


L34640 


Homo sapiens platelet/endothelial cell adhesion molecule- 1 (PECAM- 
1) gene, exons 7, 8 and 9 


0.007 


1901 


AC026061 


Homo sapiens BAC clone RP1 1-223K9 from Y, complete sequence 


1.5 


1902 


AF188516 


Staphylococcus aureus Bmr-like protein SblA (sblA) gene, complete 
cds 


0.08 


1903 


AL360177 


Human DNA sequence from clone RP1 1-86K9 on chromosome 10, 
complete sequence [Homo sapiens] 


0.077 


1904 


L76261 j 


Meloidogyne javanica mitochondrial transfer RNA His, 16S ribosomal 
RNA (16S rRNA) genes, ND3 gene, complete cds, and cytochrome b 
gene, 5' end of CDS 


0.17 


i oris 


3 

XM 042904 ] 


Homo sapiens mitochondrial ribosomal protein S25 (MRPS25), 
mRNA 


2 


1906 


] 
< 

XM 051831 ( 


fiomo sapiens similar to CYTOCHROME P450 51 (CYPL1) 
P450L1) (STEROL 14-ALPHA DEMETHYLASE) (LANOSTEROL 
1 4- ALPHA DEMETHYLASE) (LDM) (P450-14DM) (H. sapiens) 
LOC93525), mRNA 


0.66 
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SEQIE 
NO 


ACCESSN 


DESCRIP 


P VALUE 


1907 


XM 012289 


Homo sapiens cytoskeleton associated protein 2 (CKAP2) mRNA 


2 


1908 


U44391 


Human Down Syndrome region of chromosome 21, clone A12H1-1B2 


0.008 


1909 


L04961 


Mouse nuclear-localized inactive X-specific transcript (Xist) mRNA 


1.8 


1910 


X61202 


Methanococcus voltae fruA, fruB, fruG genes for subunits of F420- 
reducing hydrogenase and fruD orf 


0.078 


1911 


AP000242 


Homo sapiens genomic DNA, chromosome 21q22.1, clone:f43Dll, 
SOD-AML region, complete sequence 


0.0001 


1912 


AL023777 


S.pombe chromosome III cosmid cl827 


0.66 


1913" 


XM 038450 


Homo sapiens hypothetical protein FLJ20694 (FLJ20694), mRNA 


0 


1914 


AF129510 


Mus musculus DACH protein (Dach) mRNA, complete cds 


2.1 


1915 


AE001405 


Plasmodium falciparum chromosome 2, section 42 of 73 of the 
complete sequence 


0.18 


1916 


XM 039566 


Homo sapiens KIAA0948 protein (KIAA0948), mRNA 


6.8 


1917 


AP001431 


Homo sapiens genomic DNA, chromosome 21 q22. 2, clone:T1492, 
LB7T-ERG region, complete sequence 


0.074 


1918 


AF329137 


Camelus bactnanus microsatellite Cmsl sequence 


0.008 


1919 


NM_003816 


Homo sapiens a disintegrin and metalloproteinase domain 9 (meltrin 
gamma) (ADAM9), mRNA 


1E-17 


1920 


AE001414 


Plasmodium falciparum chromosome 2, section 51 of 73 of the 
complete sequence 


0.001 


1921 


U95074 


Caenorhabditis elegans protein disulphide isomerase isoform I (pdi-1) 
and cyclophilin isoform 9 (cyp-9) genes, complete cds 


0.029 


1922 


ABO 19534 


Homo sapiens gene for cathepsin L2, complete cds 


0.07 


1923 


AJ133777 


Arabidopsis thaliana mRNA for gamma-adaptin 2 


6.6 


1924 


XM_001304 


Homo sapiens cyclin-dependent kinase inhibitor 2C (pl8, inhibits 
CDK4) (CDKN2C), mRNA 


0.25 


1925 


AB037166 


Clostridium botulinum genes for ORF-22, HA-70, HA- 17, HA-33, 
NTNHA, neurotoxin, complete cds 


0.2 


1926 


Z48243 


A. thaliana PARP mRNA for PARP protein 


0.081 


1927 


XM_041006 


Homo sapiens choreoacanthocytosis gene; KIAA0986 protein 
(KIAA0986), mRNA 


0.25 


1928 


XM_050592 


Homo sapiens uncharacterized hematopoietic stem/progenitor cells 
protein MDS032 (MDS032), mRNA 


0.083 


1929 


i 

U34610 


Mus musculus alpha- 1 (XVIII) collagen (COL 18 A 1) gene, exon 6 and 
7 


0.008 


1930 


] 

BC007658 < 


Homo sapiens, clone MGC747 IMAGE:3343994, mRNA, complete 
ids 


6E-50 


1931 


] 

AL451135 < 


Human DNA sequence from clone RP1 1-45G20 on chromosome 6, 
:omplete sequence [Homo sapiens] 


2 


1932 


] 

AB004883 < 


Vdarchantia paleacea mRNA for ribulose 1,5-bisphosphate 
carboxylase/oxygenase small subunit, complete cds 


0.26 
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SEQIE 
NO 


> 
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DESCRIP 


P VALUE 


1933 


AFO 19973 


Rattus norvegicus neuron-specific enolase (NSE) mRNA, complete 
cds 


0.22 


1934 


AF231684 


Drosophila melanogaster outstretched (os) gene, complete cds 


0.75 


1935 


AC079136 


Homo sapiens clone RP1 1-153L12, complete sequence 


0.75 


1936 


AE006611 


Streptococcus pyogenes Ml GAS strain SF370, section 140 of 167 of 
the complete genome 


1.5 


1937 


AE007418 


Streptococcus pneumoniae section 101 of 194 of the complete genome 


0.029 


1938 


L26949 


Simian immunodeficiency virus (T5) surface envelope glycoprotein 
proviral gene, 5* end 


0.027 


1939 


AF343914 


Campylobacter jejuni NCTC 11828 LOS biosynthesis cluster, partial 
sequence 


0.58 


1940 


Z69925 


Human DNA sequence from clone LL22NC03-1 16A5 on chromosome 
22 Contains GSSs, complete sequence [Homo sapiens] 


6.2 


1941 


U96876 


Homo sapiens insulin induced protein 1 (INSIG1) gene, complete cds 


0.78 


1942 


XM 030024 


Homo sapiens similar to protein kinase C binding protein 1 (H. 
sapiens) (LOC90218), mRNA 


0.009 


1943 


AF226993 


Rattus norvegicus selective LIM binding factor mRNA, complete cds 


0.7 


1944 


AF033037 


Plasmodium falciparum merozoite surface protein 4, merozoite 
surface protein 5, merozoite surface protein 2, and adenylosuccinate 
lyase genes, complete cds 


0.69 


1945 


AB053087 


Human immunodeficiency virus type 1 pol gene, reverse transcriptase 
amino terminal part, partial cds, 99JP-NH3-II plasma virus 
clone:NH3RT-V3-2-9 


0.028 


1946 


XM 033335 


Homo sapiens hypthetical protein PR02389 (PR02389), mRNA 


0.0001 


1947 


XM_010334 


Homo sapiens transducin (beta)-like 1 (TBL1), mRNA 


2.2 


1948 


AK003943 


Mus musculus 18 days embryo cDNA, RIKEN full-length enriched 
library, clone: 1 1 10028F1 1, full insert sequence 


2E-10 


1949 


AF085600 


Drosophila melanogaster inorganic pyrophosphatase NURF-38 (Nurf- 
38) gene, complete cds 


0.78 


1950 


T T7Q7^ 1 

u fy / j l 


Plasmodium berghei extrachromosomal plastid PB-1, ORF470 gene, 
partial cds, tRNA-Thr, large subunit ribosomal RNA, tRNA-Met, 
tRNA-Arg, tRNA-Val, tRNA-Arg, tRNA-Leu, tRNA-Asn, tRNA-Ala, 
and small subunit ribosomal RNA genes, complete sequences 


0.72 


1951 


AK022877 ] 


Homo sapiens cDNA FLJ12815 fis, clone NT2RP2002546 


0.003 


1952 


M62755 ] 


Potato 4-coumarate-CoA ligase (St4Cl-l) gene, complete cds 


0.085 


1953 


] 

AKO 17902 1 


Mus musculus adult male thymus cDNA, RIKEN full-length enriched 
Library, clone:5830406J20, full insert sequence 


2.3 


1954 


j 

AF164418 


Sorghum bicolor maturase (matK) gene, complete cds; chloroplast 
*ene for chloroplast product 


0.078 


1955 


X54567 


iuman Mr^U22 dinucleotide repeat DNA 


3E-13 
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SEQIC 
NO 


> 

ACCESSN 


DESCRIP 


P VALUE 


1956 


AB056832 


Macaca fascicixlaris brain cDNA clone:QflA-14079, full insert 
sequence 


0.00009 


1957 


XM 034132 


Homo sapiens ubiquitously transcribed tetratricopeptide repeat gene, 
Y chromosome (UTY), mRNA 


7 


1958 


L09706 


Homo sapiens complement component 2 (C2) gene allele b, exons 1 
through 8 


0.24 


1959 


XM 017231 


Homo sapiens hypothetical protein FLJ23342 (FLJ23342), mRNA 


8E-76 


1960 


X83989 


P.palustris chloroplast rbcL gene, promoter region 


0.003 


1961 


AK025090 


Homo sapiens cDNA: FLJ21437 fis, clone COL04285 


0 


1962 


AK013805 


Mus museums adult male hippocampus cDNA, RDCEN full-length 
enriched library, clone:2900079F10, full insert sequence 


0.14 


1963 


AE006210 


Pasteurella multocida PM70 section 177 of 204 of the complete 
genome 


0.042 


1964 


AF075080 


Homo sapiens full length insert cDNA YQ80D07 


0.000001 


1965 


D42118 


Tobacco gene for par As protein, promoter region and partial cds 


0.009 


1966 


AF269350 


Staphylococcus epidermidis strain SRI clone step.l002dl2 genomic 
sequence 


2.3 


1967 


AF068904 


Staphylococcus aureus cell division protein FtsZ (ftsZ) gene, partial 
cds; YlmD (ylmD), YlmE (ylmE), YlmF (ylmF), YlmG (ylmG), and 
YlmH (ylmH) genes, complete cds; and cell division protein DivIVA 
(divIVA) gene, partial cds 


0.19 


1968 


AL589920 


Human DNA sequence from clone RP1 1-1 17A20 on chromosome 6, 
complete sequence [Homo sapiens] 


0.003 


1969 


AF027688 


Mus musculus survival motor neuron (Sinn) gene, promoter region 
and partial cds 


0.56 


1970 


AL049340 


Homo sapiens mRNA; cDNA DKFZp564P056 (from clone 
DKFZp564P056) 


0.25 


1971 


AJ223385 


Fowlpox virus strain HP-440 DNA, isolate FP9, 14.6 kb fragment 


0.25 


1972 


U25354 


Human mitochondrial DNA control region, Indonesian, sequence 1 of 
60 


3E-65 


1973 


AF269868 


Staphylococcus epidermidis strain SRI clone step.l027g02 genomic 
sequence 


0.083 


1974 


AE003326 


Drosophila melanogaster genomic scaffold 142000013385598, 
complete sequence 


0.087 


1975 


XM_050217 


Homo sapiens RNA binding protein (LOC84549), mRNA 


2.1 


1976 


AX1UUI4UZ 


Plasmodium falciparum chromosome 2, section 39 of 73 of the 
complete sequence 


0.026 


1977 


XM_001935 ] 


Homo sapiens forkhead box D2 (FOXD2), mRNA 


2.1 


1978 


] 

AL590384 , 


Human DNA sequence from clone RP1 1-349A16 on chromosome 
Xq22.3-24, complete sequence [Homo sapiens] 


0.008 


1979 


] 

AL1 17050 < 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 
ieprivation 


2.2 


1980 


XM_037529 ] 


Homo sapiens KIAA0410 gene product (KIAA0410), mRNA 


2.3 


1981 


X56212 < 


C.elegans gene for vitellogenin 


1.8 
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SEQID 
NO 


• 

ACCESSN 




P VALUE 


1982 


AF231684 


Drosophila melanogaster outstretched (os) gene, complete cds 


0.7 


1983 


AB032264 


L/miiu icuu nixvi^rt. j.ur glycogen syninase Kinase j aipna, complete 
cds 


2.2 


1984 


AK022877 


Homo sapiens cDNA FLJ12815 fis, clone NT2RP2002546 


0.009 


1985 


U89140 


\zillC TT111QClllll<5 £1 iHoQP fpHl lCti* C** <J(*Y\f± nrATTlAtpr ve* rri r\ n o*-»/4 nvn*i 1 
mui) jiiuovuiuo <uuv;3b ibUULiaoC gCllC, pi UillUlCI IC^lOH ailQ CXOIl 1 




1986 


M37036 


Rat nucleolar proteins B23. 1 and B23.2 


0.086 


1987 


XM 035498 


aa^/iciia aj.iuiuiiu.c _ aciiMLivc tauon cnannei z, neuronal 
(ACCN2> mRNA 




1988 


U40944 


C^flGTlorhflhH'iti** p1pCJJ}nc f*ncmift PHR 1 pr\mnlAtn c«viiion^A 
v^aviiv/iiiauuiua ^lbgaiio wuoiEiiu iL/Jj 1 ^ VAJIIipiClC oC(JUeilCC 




1989 


AF102762 


Dasylagon n. sp. 'Mardulyn & Whitfield 1 16S large subunit ribosomal 
mi,ri paiuai acLjucucc, iiiiiuC'iionuiiai gene ior miiocnonciriai 
product 


0.24 


1990 


AF247631 


iviciui l^i^ciuci ivJidiiLiic cyiuciironic oxioase suounit i ^v^uij gene, 
partial cds; mitochondrial gene for mitochondrial product 


0.026 


1991 


Z81538 


Caenorhabditis elegans cosmid F45H10, complete sequence 


6.2 


1992 


Z72516 


va^/iiui licm/uiLio ^i^gaiis L/VJollllu. izjuj, CUIIipiCie oCCJ 




1993 


AE002201 


i^iiiciiiiyuupiiiid piieunioiuae /uuy, section jz oi y4 01 tne complete 
genome 


6.6 


1994 


NM_003816 


nuiiiu Exipiciia d. uxaiiLLcgrin ana meuuioproieinase domain y (meitnn 
gamma) (ADAM9), mRNA 


2E-17 


1995 


AL158132 


n uiu dii i^i>/-v sequence rrom cione Kx\3oz J/VZ4 on chromosome 4. 
Contains GSSs and a putative CpG island, complete sequence [Homo 
sapiens] 


U.U /4 


1996 


NM 023122 


Mus musculus fflvconrotein m6h (Cr-nm&\i\ tnPMA 


/ 


1997 


AFO 14502 


CrlvcitlG max <5PpH r*nJlt nPTnviHjicp rxre^nircnr fT^rs\ aanp ^nmnlofD n/lo 
^-»xj^-i*av iiicuv owvu wai pci UAiuao^ piCl/lLLMJl ^.cp^ gCIlC, (JUHipieie COS 


l.o 


1998 


AJ270226 


Entodinium caudatum partial mRNA for putative glycosyltransferase, 
clone L48 


0 1 

Z. 1 


1999 


AF159913 


^upiuita tiaaaua Li ailopvJMJll iCLl ClOIlC 1CC1-Z Oil Z aXlQ OlT j 
nsendoffenes comnlptf* opnupnrrf* 


U.UI 1 


2000 


AF101309 


Caenorhah(iiti<! pIppsitic poqitiiH T-f?4f^n^ ^n-m«l*»t(a cpniipnr>d 


z.z 


2001 


AB021240 


a ij po^it/uu is ^viiipicaous liLiiuciiuiiuxidi gene ior ^yxoenrome o, 
complete cds 


0.022 


2002 


AC078798 


Homo sanierm ^ RAf^ PAf*-S^Pl 1 rRr*e«rt»ll ParL- pon^Ar Tnrtitnto 
Human BAC Libra rv^ pnmnl pfp oviiipnrp 


U.UU1 


2003 


AK021504 


Homo sapiens cDNA FLJ1 1442 fis, clone HEMBA1001327 


0.76 


2004 


AF3 14531 


Chilli leaf curl virus A component, complete sequence 


0.23 


2005 


XM_047794 


Homo sapiens KIAA1641 protein (KIAA1641), mRNA 


e-123 


2006 


AC079136 


Homo sapiens clone RP11-153L12, complete sequence 


0.65 


2007 


AE001422 


Plasmodium falciparum chromosome 2, section 59 of 73 of the 
complete sequence 


0.001 


2009 


AJ306692 


Drosophila virilis ORF1, rpL14 gene and ORF2 


0.26 


2010 


AC084242 


Arabidopsis thaliana chromosome 1 BAC T24P22 genomic sequence, 
complete sequence 


0.72 
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2011 Z16768 



H. sapiens (D12S90) DNA segment containing (CA) repeat; clone 
|AFM172xd8; single read 



0.058 



2012 AF3 11633 



Eimeria nieschulzi ORF470 gene, partial cds; apicoplast gene for 
|apicoplast product 



1.9 



2013 XM_012933 

2014 AB035190 



Homo sapiens hypothetical protein FLJ1 1656 (FLJ1 1656), mRNA 



2.5 



Homo sapiens RHP gene, intron 2, complete sequence 



1.6 



2015 NC 002135 Pleurotus ostreatus mitochondrial plasmid mlpl, complete sequence 



0.22 



2016 
2017 



AF085619 
D85424 



Elgaria paucicarinata NADH dehydrogenase subunit I (ND1) gene, 
partial cds; tRNA-Ile, tRNA-Gln, and tRNA-Met genes, complete ' 
sequence; NADH dehydrogenase subunit II (ND2) gene, complete cds 
tRNA-Trp, tRNA-Ala, tRNA-Asn, tRNA-Cys, and tRNA-Tyr genes, 
O 



2.3 



Homo sapiens gene for alphaSl -casein, 5 t flanking region 



0.077 



2018 
2019 



AE001138 
BC007114 



Borrelia burgdorferi (section 24 of 70) of the complete genome 



0.026 



Homo sapiens, clone IMAGE:4295422, mRNA 



0.23 



2020 AF080510 



Homo sapiens mannose-binding protein gene, exon 4 and complete 
cds 



0.00001 



Homo sapiens hypothetical protein DKFZp761J139 (DKFZp761J139) 
2022 | XM_027941 mRNA 



H.sapiens mRNA for immunoglobulin kappa light chain VJ region 
2023 | Z68996 |(ID POM021) 



0.003 



0.15 



2024 AB049191 



Mus musculus DNA, clone:lambda4/6, endogenous mouse mammary 
I tumor virus LTR region 



0.15 



2025 U62057 



Mycoplasma capricolum NADH oxidase (naox) gene, partial cds, and 
[lipoate-protein ligase (Ipla), pyruvate dehydrogenase EI alpha subunit 
(odpa), pyruvate dehydrogenase EI beta subunit (odpb), pyruvate 
dehydrogenase EII (odp2), dihydrolipoamide dehydroge> 



2026 AY018527 



Oryza sativa microsatellite MRG0852 containing (AT)X17, genomic 
I sequence 



0.0003 



2027 AF266084 



Phyllopetalia apicalis large subunit ribosomal RNA gene, partial 
sequence; tRNA-Valine gene, complete sequence; and small subunit 
ribosomal RNA gene, partial sequence; mitochondrial genes for 
[mitochondrial products 



0.009 



2028 AF249912 



Cucumis melo galactinol synthase (GAS1) gene, upstream sequence 
|and partial cds 



0.021 



2029 XM 048172 



Homo sapiens hypothetical protein DKFZp434F1017 
|(DKFZP434F1017), mRNA 



6.5 



2030 AF367436 



Erysimum cheiri chloroplast outer envelope membrane protein 
|mRNA, complete cds 



1.2 



2031 I AF135026 
2032 



Homo sapiens kallikrein-like protein 3 (KLK9) gene, complete cds, 
I alternatively spliced 



0.76 



Z69782 | Athermophilum xylR gene, xynA gene and alf A gene 



0.0004 
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> 
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2033 


JNJV1_U1 j fji) 


Mus musculus signaling lymphocyte activation molecule (Slam), 
mRNA 


6.4 


2034 


NC_002811 


Tarsius bancanus mitochondrion, complete genome 


0.69 


2035 


Ar 184822 


Arabidopsis thaliana inhibitor tagged site ITS55 genomic sequence 


0.24 


2036 


AE006626 


Streptococcus pyogenes Ml GAS strain SF370, section 155 of 167 of 
the complete genome 


5.4 


2037 


AF324424 


Ichthyophthirius multifiliis immobilization antigen isoform (IAG52A) 
gene, complete cds 


0.003 


2038 


AT lOI ill 
/VLoV YLZ5 


Human chromosome 14 DNA sequence Partial sequence from BAC R- 
325N7 PCR1 of library RPCI-11 from chromosome 14 of Homo 
sapiens (Human), complete sequence 


6.3 


2040- 


AF369348 


Rabies virus V231.RB phosphoprotein (P) gene, complete cds 


1.4 


2041 


AF023666 


Apis melhfera sn-glycerol-3 -phosphate dehydrogenase (Gpdh) gene, 
complete cds 


0.25 


2042 


XM_011553 


Homo sapiens aldo-keto reductase family 1, member Dl (delta 4-3- 
ketosteroid-5-beta-reductase) (AKR1D1), mRNA 


0.001 


2043 


AM UU4530 


Homo sapiens nuclear receptor subfamily 2, group E, member 1 
(NR2E1), mRNA 


5E-41 


2044 


AJ277602 


Gallus gallus partial mRNA for initiation of translation factor 2 (cIF2 
gene) 


0.72 


2045 


XM_041537 


Homo sapiens echinoderm microtubule-associated protein-like 
(EMAPL), mRNA 


1.7 


2046 


AE001429 


Plasmodium falciparum chromosome 2, section 66 of 73 of the 
complete sequence 


0.57 


2047 


T A*7 1 OO 

.L03188 


Saccharomyces cerevisiae integrin analogue gene, complete cds 


0.23 


2048 


XM034810 


Homo sapiens similar to ALU SUBFAMILY SQ SEQUENCE 
CONTAMINATION WARNING ENTRY (H. sapiens) (LOC90908), 
mRNA 


2E-25 


2049 


AJbUUo57U 


Streptococcus pyogenes Ml GAS strain SF370, section 99 of 167 of 
the complete genome 


0.64 


2050 


AiUZJV / / 


Oryza sativa microsatellite MRG6302 containing (ATTA)X6, closest 
to marker R1674, genomic sequence 


0.6 


2051 


AyTQl 70-1 


Dictyostelium discoideum thioredoxin (TRX1) mRNA, complete cds 


0.68 


2052 


XM_032748 


Homo sapiens KIAA1634 protein (KIAA1634), mRNA 


0.066 


2053 


A13UU4514- 


uryza sativa UNA lor NLS receptor, complete cds 


0.009 


2054 




Homo sapiens clone FLB341 1 PRO0852 mRNA, complete cds 


0.078 


2055 


Y19213 


iaviiiu j;mauvc psiiLnurv pacuuugciic ior naix Keratin, exons 2. 
to 7 


0.038 


2056 


AF305146 ] 


Picea abies clone 1PABCA3F hypomethylated genomic sequence 


1.8 


2057 


] 

Z67997 ] 


Human DNA sequence from cosmid L206D7, Huntington's Disease 
Region, chromosome 4pl6.3 


0.0001 


2058 


U51024 1 


iuman Down Syndrome region of chromosome 21 DNA 


0.0008 


2059 


AC092063 ] 


lomo sapiens clone RP1 1-745H7, complete sequence 


0 
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DESCRIP 



2060 AE 002123 Ureaplasma urealyticum section 24 of 59 of the complete genome 

lHIV-1 DNA V3 region (homosexual H1031, seroconversion sample 

2061 I Z68019 1987) 



2062 



XM_006487 Homo sapiens spectrin, beta, non-erythrocytic 2 (SPTBN2), mRNA 



2063 AF329716 



Oncorhynchus mykiss clone OSU9 T cell receptor beta chain variable 
| region gene, partial cds 



2064 
2065 



AC024773 Caenorhabditis elegans cosmid Y40C7 A, complete sequence 



2066 



XM_032748 Homo sapiens KIAA1634 protein (KIAA1634), mRNA 



M58526 Human alpha-5 collagen type IV (CQL4A5) mRNA, 3' end 



2068 X56697 I B.thuringiensis gene for sigma factor 35 



2069 



D00768 



Pig pituitary glycoprotein hormone alpha subunit gene, exons 3 and 
and 3 , flank 



P 

%y 



m 
CO 

U = 



2070 
2071 



AF1 10621 



Boophilus microplus 12S ribosomal RNA gene, partial sequence; 
ItRNA-Ile, tRNA-Gln, and tRNA-Phe genes, complete sequence; and 
NADH dehydrogenase subunit 5 (ND5) gene, partial cds, 
mitochondrial genes for mitochondrial products 



2072 



AC024779 I Caenorhabditis elegans cosmid Y43B11AL, complete sequence 



AK023589 Homo sapiens cDNA FLJ13527 fis, clone PLACE1006076 



2073 
2074 



W.suaveolens mitochondrial DNA intergenic region between ORF1 
X77691 |and 15S rRNA genes 



Z68749 I Caenorhabditis elegans cosmid F56H1 1, complete sequence 



2075 AF302113 



Solanum tuberosum glutamine synthetase GS2 (gin) mRNA, partial 
| cds; nuclear gene for plastid product 



2076 



2077 
2078 



Trichaptum abietinum mitochondrial small subunit ribosomal RNA, 
U27078 mitochondrial gene, partial sequence 



XMJ) 12456 Homo sapiens golgin-67 (KIAA0855), mRNA 



AB032960 Homo sapiens mRNA for KIAA1 134 protein, partial cds 



2079 
2080 



Homo sapiens hypothetical gene supported by AF106046; AK000332 
XM 052376 [(LOC93587), mRNA 



AB021866 Homo sapiens KIP gene, complete cds 



2081 XM 029213 



Homo sapiens hypothetical protein DKFZp564O0523 
|(DKFZP564O0523), mRNA 



2082 AF384143 



Triticum aestivum pathogenesis-related protein 1 mRNA, complete 
cds 



2083 
2084 



I Neisseria meningitidis serogroup B strain MC58 section 10 of 206 of 
AE002368 |the complete genome 



AE006662 I Sulfolobus solfataricus section 2 1 of 272 of the complete genome 



2085 
2086 



Oryza sativa microsatellite MRG3085 containing (TA)X13, closest to 
AY020760 I marker R2976, genomic sequence 



AF161419 [Homo sapiens HSPC301 mRNA, partial cds 



2087 AJ249838 [Gallus gallus mRNA for radixin (rdx gene) 



2088 



U43536 



ICorynebacterium glutamicum heat shock, ATP-binding protein (clpB) 
I gene, complete cds 



2089 | XM_037588 [Homo sapiens KIAA0874 protein (KIAA0874), mRNA 



P VALUE 



1.8 



0.25 



0.000003 



0.0008 



0.035 



0.079 



2.1 



0.59 



1.6 



0.21 



0.24 



0.24 



1.4 



1.4 



0.64 



2.2 



1E-36 



0.66 



1.3 



0.38 



0.0003 



0.71 



0.63 



6.9 



1.8 



2.1 



0.22 



2E-34 
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2090 


MP 009^07 


PVirvcrtTfivn f^Vilrttwwffo TTiii'ivViriTiH'pirtTi f*f* fin Til t*tf* <y(*tirttnA 

V^lll jroUllljrd 1/lllUlUpygd 11ULU\*111J11U11\J11, \AJ111L/1GLG gCllUUlC 


O 077 


2091 


AF"*'*7907 


i^aenoriiauuius eiegaiis ciouc yK jU!/d / nuciear receptor inxxk-d / gene, 




2092 


U39886 


Bos taurus Y-chromosome specific genomic sequence 


0.2 


2093 


Z19055 


B.aphidicola tryptophan operon 


0.26 


2094 


a t9qo^oq 


OllllClilla llldLLl<x£Z,ll 1*111 UI Up Idol U ILL* LI 111 UHCl gClllL* apdwCl 


0 ^7 


2095 


AK013631 


jvius inuscuius aauii maie mppocampus cljina, kjjve,in iuii-iengin 
enriched library, clone:2900041A09, full insert sequence 


0.18 


2096 


AF01 Sd^R 


PlQcmnHiiifn "fsil^ii^aniTn tni^frtcaf^llitp* T TIO'? cp^fiin^Tirv* 
i IdollHJUllllll Idlt/ipdl lull llllwlUocllGllllC 1 L\JJ sd|Udll*G 


0.11 


2097 


AJ277734 


Sus scrofa microsatellite sequence S0601 


0.4 


2098 


ArZUjj'+i 


Glycine max chloroplast carboxyl transferase alpha subumt (accA-4) 
nuclear pseudogene, partial sequence; and putative steroid reductase 
gene, coinpieie cub 


fi 001 

\J . VJ\s 1 


2099 


J AJV1_U3Z J7 / 


LI nrn « cnr\innc VT A A 1 "3^0 nrntptn fffT A A 1 1 ^0"\ mPMA 

riomo sapiens j\j./\/\ij->u proicin ijjxj), iilx\j.n/v 




2100 




H. sapiens telomeric DNA sequence, clone 6QTEL010, read 


O 007 


2101 




Mus musculus, Harvey rat sarcoma oncogene, subgroup R, clone 
ivioi^ . o i oz iivi/wjxj jhojjIj, iillvtn /\, complete UlS 


± 1 


2102 


A 170/1^1 1 7 


jvius muscuius leuKocyie ceii-suridtve moiecuie \L^yy) gene, exuii i 


0 09 S 


2103 


AL023810 


Caenorhabditis elegans cosmid C16D2, complete sequence 


0.008 


2104 




n. gracilis jlhn/\ lor riuussuiiidi proiein operon 


0 71 


2105 


AF005392 


Homo sapiens alpha tubulin (TUB A2) gene, partial cds 


0.63 


2106 


AF0097QSi 


L/rosopnua meianogasier genomic scanoiu a4zuuuuijjoj4oo, 

/-»<-» T"r"IT"\1 £kfck c F*C\ 1 1 »^ Tl CC± 

VyOIIipiClC aCqUClllyC 


0.025 


2107 


AP009 1 ij. 


TTrponlaciTiQ iif**»iilT7tifiinn cf»r , tioTi 1 S rvF S Q of tVi* 3 rriiTi'nt^t'A (JPTinmp 
U I Cdpictbllld 111 Cdljr LlClllll oCCLLVJll U Ul JJ U1C t»\JllipiClC gCllUlllt/ 


0.074 


2108 




Fibrobacter succinogenes endo-l,4-beta-D-glucanase gene, complete 

/-»/-! c f»nrlr»_ 1 /4 -W^tn — T™^_ol tiOQti q opnp S ' *»tiH nf pHc 
LUS>, ClltlU ~1,*t "UCld™!-/ glUL<dlldoC gCllw, -J C11U Ul tuo 


2 


2109 


D87922 


ixHi uin/\ ior j u iiv oi sKeieuu inuscie bou.1u.111 ciuuiiiei, pdiudi 
sequence 


0.71 


2110 


AF1 S71 "*7 


VJUH&l UllClld UUL1C11 1 OlJ IimJaUlIldl IVLN.rY ^CllC, pdlUdl !MA[uCllWC 


0.24 


2111 


Z92970 


Caenorhabditis elegans cosmid H06O01, complete sequence 


0.24 


2112 


ATVHSJ.QJ. 


/Vllcnidlld di LCI lid Id gCIlC 1UI r\Jfk.\.D"Z ty lAJllipiCIC l^lla 


0.61 


2113 


JVDoj J 1 


Rat embryonic glutamic acid decarboxylase gene, embryonic stop (ES) 
exon 


0.67 


2114 


X16715 


D. melanogaster gene for chorion protein sl6 


0.21 


2115 


U16271 


Homo sapiens AMP deaminase isoform L (AMPD2) gene, exons 1A 
and IB 


0.07 


2116 


AF3 17672 


Helicobacter pylori isolate 4424 vacuolating cytotoxin VacA (vacA) 
gene, partial cds 


0.24 


2117 


NM_0 10250 


Mus musculus gamma-arninobutyric acid (GABA-A) receptor, subunit 
alpha 1 (Gabral), mRNA 


0.76 


2118 


BCO 10786 


Mus musculus, Similar to CREB/ATF family transcription factor, 
clone MGC: 18836 IMAGE:421 1480, mRNA, complete cds 


0.69 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VAT TIE 

IT V r\J-/ VJ 


2119 


TJ67482 


A^Pthanococnis iannaQcTiii Qpption 94. of 1 SO of trip fomnlptp cpnnmp 

lVA^tAldllV/vV/xv^llO jaAlAia.Ox.'lllA o&VLlVsll ^» ■ Ul IJv VJA LilG IAJIIILJaCIC t^ClMJlilG 


0 94. 


2120 


U73373 


T-TTV-l plnnp 106*3 -9 -VPR vnr apnp rtntrmlptp rrlc 
i aa v i vivsii^ ivuj"a" virxv gene, wvjiai^jicic wua 


o 


2121 


AF007370 


StrPTYtoPfiPPiic nnpii mnn i pf* cfvtinn S'? of 1 QA of th** rnmnlpfp optinmp 

ijUv^IUvV/v^UO LJllWLXlllWl licit/ OWWUUll JJ VJA 1 7^ IU U1G wVilll^JICLC ^CaIVJIIIC 


u.uo 


2122 


AL449163 


T-Ti i m o n DMA cprnipnr'f* from nnp R P 1 1 P 1 f\ on rhrnmncnTtip f\ 

1 Alii 11X111 ±SL^ir\ OwtfUblltsG 11 Will trflUllw IxTl 1"JjjJj1U l/il LflLUJlllUoUlllC \Jj 

comnlete ^ennence ffi-Torno ^anienstl 


2E-11 


2123 


X63382 


Antithamnion sp. rhodoplast genes alpl, atpH, atpG, atpF, atpD, atpA, 
orfl orf2 find orfl 


0.081 


2124 


AF 186797 


Homo saniens RalGDS-like fRGL i eene exon 1 7 


0.009 


2125 


AF1 10796 


Rattus norvegicus GABAB1 receptor (GABABR1) gene, exons 1 
through 9 


4.2 


2126 


XM 038798 


Homo aniens KTAA1457 orotein fiCIAA1457i mRNA 


2E-96 


2127 


U96998 


Homo sapiens MET proto-oncogene, intron 19, 3' end 


0.67 


2128 


/VCUUUt A \J 


T af*fYwwMic lactic cnhcn luetic TT 1 J-O*? cpftfon 179 of 91 ft of thf* 
batlUvUvtuo lairUo ouusp. la^ua iijituj occLiun v i l. vja lio vja iaic 

pomT^lf tr^ ffptinrnp 

WUlli^JiwLW g&llUUlts 


0.084 


2129 


AB050422 


Macaca fascicularis brain cDNA, clone:QnpA-21421 


6.9 


2130 


AF375S97 


ivius iiiu.ocu.iuo iLicuiuiii diiu oiiun uiioJii j^-j"iiyuruxyd.(^i-v^ociiz.yiiic t\. 

HpiivHrnffpTiJicp nVAcflisid i optip pvnnc 9 fhrnii</h R anH rrinnrilptp pHq* 
UwiiyUAUgbiLaoc« ^lVAot/iiciiJ. ) gviKs, wav/ho lAiiuugii o, cxiiu i*uiujji&l& tua, 

nii pi par pptip fnr mitochondrial nrodnct 


2.2 


2131 


AF0097SR 


Drosophila melanogaster genomic scafFold 142000013385931, 

pfYtnnlptp ^ptiiipiipp 


0.26 


2132 


AJWOIV/J 


f^hrv^nli na rimarplioidp*: f^OT pptip rvartial f*OS 

v/lllj OvJllllu lAllluA WllUlUba V^V_/A gvilb, Udl Lid! V^A-' O 


0.031 


2133 


TT67S96 


A/Tf»tn5iTior , or , r*iic lannacphii cpftinn 68 of 1 SO nf thp rnmnlptp <rpnnmp 
iviCLiictiiV/wvfw^uo joAiAiaa^iui Ddsiiuii uo vji i jv ui laiv^ wuiikljiwlvw ^tiivjuiv 


2 


2134 


AJ223966 


Mus musculus NAGA gene 


0.63 


2135 




f^allipphnQ mnloph pammjil-plohin <ypnp pnmnlptp pdQ 

dill wwL/ 113 A11V1V/V11 gaJLlUIldA glUUlll WJlllL/lWLw WU3 


0.68 


2136 


U66872 


Danio rerio enhancer of rudimentary homolog mRNA, complete cds 


0.029 


2137 


A T271220 


Homo viTiipTi*! nartial gene for dv^tronhin intron 49 


0.009 


2138 


AF075602 


funic familicirtc nrrictiitTlfindin F9 Tf*c&vtic\T cnHtvtv* FP9 mRM A 

V^cAlllD AdllllAlCll 1j |J1 VJoLtlgltlAlLlAll AJ>^ ICVvCjJtUl & WJ LyjJw AJ/A 111AV1N/T.J 

complete cds 


0.23 


2139 


XM 045015 


Homo saniens hvoothetical orotein FLJ00052 (ELJ00052 # mRNA 


0.75 


2140 


AY034614 


Danio rerio Sec61 alpha form B mRNA, complete cds 


0.1 


2141 


XM_028150 


Homo aniens similar to MACROPHAGF MANNOSF RFCFPTOR 

JTAU111U !3<tL/lWlli3 dllllAlOA l\J lVl/iVyA.V7l 1 ATVVJij 1V1/\J.i1iv/uJj An 1 <Vv 1 <1T A vylv 

PRECURSOR (H. sapiens) (LOC90010), mRNA 


1.9 


2142 


M24026 


Rat MHC class I RT1 (HT44i mRNA Cu haolorvnei 3' end 

lull iTiiiv^ wiuJu a i\i a yi\ i t~ y iiu\im y u iidi/iv/Lj uv ^ wuu 


0.24 


2143 


AK010194 


Mus musculus adult male tongue cDNA, RIKEN full-length enriched 
library, clone:2310076E16, full insert sequence 


2 


2144 


AF165062 


Hepatitis C virus strain MD9-2 complete genome 


1.2 


2145 


AE007469 


Streptococcus pneumoniae section 152 of 194 of the complete genome 


2.1 


2146 


Z97349 


Plasmodium falciparum DNA *** SEQUENCING IN PROGRESS 
*** from contig 3-06, complete sequence 


0.046 


2147 


NM 010161 


Mus musculus ecotropic viral integration site 2 (Evi2), mRNA 


0.02 
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2148 


AE005586 


Escherichia coli 0157:H7 EDL933 genome, contig 3 of 3, section 205 
of 290 


1.2 


2149 


AF332928 


Psacothea hilaris isolate HA15 16S ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


0.069 


2150 


D49504 


Borrelia garinii gene for outer surface protein C, complete cds 


0.37 


2151 


L25647 


Homo sapiens fibroblast growth factor receptor gene (located in the 
central MHC) signal peptide and consecutive exon 


0.05 


2152 


AB040746 


Carassius auratus gene for c-MYC, complete cds 


0.063 


2153 


X55978 


T. brucei ESAG 8 gene for a leucine-rich repeat family protein 


0.12 


2154 


ABO 10426 


Phytoplasma sp. gene for AL1 like protein, complete cds 


0.009 


2155 


XM_005841 


Homo sapiens cdk inhibitor p21 binding protein (TOK-1), mRNA 


5.8 


2156 


AK025222 


Homo sapiens cDNA: FLJ21569 fis, clone COL06508 


1E-33 


2157 


AF234652 


Mesembryanthemum crystallinum protein kinase MK5 mRNA, 
complete cds 


0.2 


2158 


AE006513 


Streptococcus pyogenes Ml GAS strain SF370, section 42 of 167 of 
the complete genome 


9.2 


2159 


XM 003660 


Homo sapiens calcium modulating ligand (CAMLG), mRNA 


0.023 


2160 


AF003490 


Mesostoa kerri 16S ribosomal RNA gene, partial sequence 


0.000004 


2161 


XM_048768 


Homo sapiens tetratricopeptide repeat domain 3 (TTC3), mRNA 


6.7 


2162 


AF323442 


Plasmodium yoelii yoelii clone 10 235 kDa rhoptry protein (Ilia. 1) 
gene, partial cds 


0.019 


2163 


AKO 16560 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4932441H21, full insert sequence 


4.7 


2164 


Y12332 


E.tenella plastid genes rpsl2, rps7, tuf, ORF45, and tRNA-Phe 


0.003 


2165 


AL1 14081 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


0.002 


2166 


AF332207 


Caenorhabditis elegans clone yk509b7 nuclear receptor NHR-57 gene, 
partial cds 


0.49 


2167 


AF102263 


Helicoverpa zea clone S211 cytochrome P450 (CYP6B8) mRNA, 
complete cds 


0.074 


2168 


AL133397 


Human DNA sequence from clone RPl-213Jlp on chromosome 
22ql 1.21-12.2, complete sequence [Homo sapiens] 


0.039 


2169 


AL157423 


Homo sapiens mRNA; cDNA DKFZp761O051 1 (from clone 
DKFZp761O0511) 


7E-15 


2170 


AF282578 


Homo sapiens clone 20ptel_cl622_12_3.3f sequence 


9E-17 


2171 


AB022280 


Oryzias latipes gene for soluble guanylyl cyclase alpha subunit, 
complete cds 


3.7 


2172 




Homo sapiens transcriptional repressor p54 gene, promoter region and 
partial cus 


2 


2173 


XM 007685 


Homo sapiens MAGE-like 2 (MAGEL2), mRNA 


0.008 


2174 


AK019539 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4921514G21, full insert sequence 


1.6 


2175 


AE007472 


Streptococcus pneumoniae section 155 of 194 of the complete genome 


6.2 


2176 


AL162381 


Human DNA sequence from clone RP1 1-174B19 on chromosome 6, 
complete sequence [Homo sapiens] 


0.059 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


2177 


M21836 


Mouse extra-embryonic endodermal cytokeratin type II (EndoA) 
mRNA, complete cds 


e-138 


2178 


X06292 


Human c-fes/fps proto-oncogene 


1.7 


2179 


AtjOUZ /O 


Arabidopsis thaliana putative MLH1 protein (AT4g09140) mRNA, 
complete cds 


0.0001 


2180 


X78548 


G.max gene encoding epoxide hydrolase 


0.68 


2181 


Z56222 


H.sapiens CpG island DNA genomic Msel fragment, clone 97b6, 
reverse read cpg97b6.rtla 


5E-60 


2182 


AF200412 


Wiebesia brusi cytochrome oxidase subunit I (COI) gene, partial cds; 
tRNA-Leu gene, complete sequence; and cytochrome oxidase subunit 
II (COII) gene gene, partial cds; mitochondrial genes for 
mitochondrial products 


0.084 


2183 




Arabidopsis thaliana putative metal-binding protein 
{ML,nz, lo/Al 5g63330) mRNA, complete cds 


0.014 


2184 


-AJVl uUjoj / 


Homo sapiens CUG triplet repeat, RNA-binding protein 2 (CUGBP2), 

mKJNA 


0.22 


2185 


BC008152 


Mus musculus, caspase 1, clone MGC:6106 IMAGE:3583883, 
mRNA, complete cds 


1.9 


2186 




Dictyostelium discoideum K7 kinesin-like protein mRNA, complete 
cds 


0.66 


2187 


AF163863 


Mustela vison tyrosine aminotransferase gene, complete cds 


0.23 


2188 


XM_017198 


Homo sapiens hypothetical protein FLJ12085 (FLJ12085), mRNA 


4E-36 


2189 


T\ 1 (L 1 Of! 

ULOloK) 


Human PMP2 gene for peripheral myelin protein 2, exon 2 and 3 


0.68 


2190 


AB037920 


Clostridium botulinum genes for ORF-22, HA-70, HA- 17, HA-33, 
NTNHA, neurotoxin, complete cds 


0.083 


2191 


/VtUUO /Uj 


Sulfolobus solfataricus section 62 of 272 of the complete genome 


1.6 


2192 


AB047401 


Periplaneta americana Vg-2 mRNA for vitellogenin-2, complete cds 


0.25 


2193 


U11491 


Human rotavirus M37 non-structural protein NSP1 mRNA, complete 
cds 


0.17 


2194 




Mus musculus flavohemoprotein b5/b5R mRNA, complete cds 


2.3 


2195 




Mus musculus genomic DNA, chromosome 7, clone:Bl 18_5, 
complete sequence 


0.56 


2196 


XM_010940 


Homo sapiens putative GR6 protein (GR6), mRNA 


2E-6I 


2197 


AF1 19554 


Plasmodium falciparum para-aminobenzoic acid synthetase gene, 
complete cds 


0.009 


2198 


AY035995 


Streotococciis nneiimnrnnp Qtrnin ^P*M1 trvr\/-\i c^mo»-o po t\/ n t u,,«^ 
uu^iuw^ua jjiit^uiiujiiiac alioill OriN I _)UO lOpOlSOmeiaSe IV SUDUmt 

A (parC) gene, complete cds 


1.2 


2199 


XMJ)06892 


Homo sapiens cAMP responsive element binding protein-like 2 
(CREBL2), mRNA 


0.049 


2200 


] 

X87127 i 


Borrelia burgdorferi repeated DNA element, 30.5 kb circular plasmid 
copy 


0.46 


2201 


AB010958 < 


Clostridium thermocellum xynA and xynB genes for xylanase, 
complete cds 


0.23 
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Table 3A Nearest Neighbor (BlastN vs. Genbanto 


SEQIE 
NO 


) 

ACCESSN 


DESCRIP 


P VALUE 


2202 


Ajb0O3004 


Drosophila melanogaster genomic scaffold 142000013385492, 
complete sequence 


0.23 


2203 


U63418 


Mus musculus uncoupling protein (Ucp) gene, nuclear gene encoding 
mitochondrial protein, complete cds 


0.23 


2204 


AJ243265 


Homo sapiens partial PGM1 gene for phosphoglucomutase 1, exons 5- 
7 


e-135 


2205 




Yersinia pseudotuberculosis virulence plasmid pIBI yopA gene for 
Yopl protein 


2.1 


2206 


XM 011639 


Homo sapiens KIAA0535 gene product (KIAA0535), mRNA 


8E-22 


2207 


Ar098990 


Caenorhabditis elegans cosmid H35N03 


2.2 


2208 


AB037920 


Clostridium botulinum genes for ORF-22, HA-70, HA-17, HA-33, 
NTNHA, neurotoxin, complete cds 


0.2 


2209 


AB053087 


Human immunodeficiency virus type 1 pol gene, reverse transcriptase 
amino terminal part, partial cds, 99JP-NH3-II plasma virus 
clone:NH3RT-V3-2-9 


0.028 


2210 


NM 008989 


Mus musculus purine rich element binding protein A (Pura), mRNA 


0.028 


2211 


U97291 


Equus caballus random genomic clone JH226 


0.008 


2212 


AK001699 


Homo sapiens cDNA FLJ10837 fis, clone NT2RP4001260, highly 
similar to Homo sapiens mRNA for KIAA0875 protein 


0.016 


2213 


U84532 


Human dystrobrevin (DTN) gene, exon 4 


0.23 


2214 


Z54522 


Hsapiens CpG island DNA genomic Msel fragment, clone 12b6, 
forward read cpgl2b6.ftlc 


1.5 


2215 


Z69666 


Human DNA sequence from cosmid 24F8 from a contig from the tip 
of the short arm of chromosome 16, spanning 2Mb of 16pl3.3. 
Contains ESTs, repeat polymorphism and CpG island 


6.5 


2216 


M69205 


Ockelbo virus complete genome 


2.2 


2217 


AF202552 


Homo sapiens DNA methyltransferase (DNMT1) gene, exons 2, 3, 
and 4 


0.24 


2218 


D16541 


Ciucken DNA tor connectin(titin), partial sequence 


0.064 


2220 


AC023351 


Homo sapiens clone RP1 1-63 7 J20, complete sequence 


1E-59 


2221 


AT ci ^inc 

AL5 12305 


Human DNA sequence from clone RP1 1-85E24 on chromosome 6, 
complete sequence [Homo sapiens] 


0.002 


2222 


U67581 


Methanococcus jannaschii section 123 of 150 of the complete genome 


5 


2223 


AE007428 


Streptococcus pneumoniae section 1 1 1 of 194 of the complete genome 


1.9 


2224 


AK023601 


Homo sapiens cDNA FLJ13539 fis, clone PLACE1006640 


1.7 


2225 


AF375468 


riumu sapiens enaomeiiai protein l, receptor (PROCR) gene, complete 
cds 


0.016 


2226 


AF350726 


Symphyglossum sanguineum chloroplast tRNA-Leu (trnL) gene and 
trnL-trnF intergenic spacer, partial sequence 


1.4 


2227 


] 

NM 008308 i 


Mus musculus 5-hydroxytryptamine (serotonin) receptor 1 A (Htrla), 
mRNA 


0.23 


2228 


] 

Z62421 i 


sapiens CpG island DNA genomic Msel fragment, clone 68h2, 
forward read cpg68h2.ftla 


4E-68 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQIE 
NO 


) 

ACCESSN 


DESCRIP 


P VALUE 


2257 


XM 037301 


Homo sapiens hypothetical protein FLJ12697 (FLJ12697) mRNA 


2E-61 


2258 


XM 028046 


Homo sapiens KIAA0220 protein (KIAA0220), mRNA 


5E-61 


2259 


U94520 


Lactococcus lactis plasmid pND861 abortive phage resistance proteins 
(abiLi) and (abiLii) genes, complete cds 


0.81 


2260 


U00037 


Caenorhabditis elegans cosmid T20H4 


0.081 


2261 


Z74866 


S.cerevisiae chromosome XV reading frame ORF YOL124c 


0.55 


2262 


Z92779 


Caenorhabditis elegans cosmid C44E1, complete sequence 


0.7 


2263 


V00694 


Saccharomyces cerevisiae mitochondrion COX/OXI3 gene encoding 
for cytochrome oxidase subunit 1 


0.021 


2264 


ACO 10728 


Homo sapiens BAC clone RP1 1-258E22 from Y, complete sequence 


3E-30 


2265 


AJ406392 


Beta vulgaris partial En/Spm-like transposon, clone 44.9 


0.054 


2266 


AE001430 


Plasmodium falciparum chromosome 2, section 67 of 73 of the 
complete sequence 


0.064 


2267 


AF378270 


Pangasms sp. ZZSH-2001 microsatellite PSP-G 456 sequence 


1.1 


2268 


XM_047227 


Homo sapiens hypothetical gene supported by AL1221 12 
(LOC92776), mRNA 


0.38 


2269 


AE002758 


Drosophila melanogaster genomic scaffold 142000013385931, 
complete sequence 


0.2 


2270 


AL115861 


Botrytis cmerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


0.14 


2271 


AE006407 


Lactococcus lactis subsp. lactis IL1403 section 169 of 218 of the 
complete genome 


1.8 


2272 


XM 049351 


Homo sapiens KIAA1600 protein (KIAA1600), mRNA 


0.0007 


2273 


AE005977 


Caulobacter crescentus section 303 of 359 of the complete genome 


5.2 


2274 


XM 007057 


Homo sapiens chromosome 12 open reading frame 5 (C12orf5), 
mRNA 


0.63 


2275 


XM 031923 


Homo sapiens hypothetical gene supported by AK023761 
(LOC90466), mRNA 


0.0000007 


2276 


AJ005330 


pGAII(-) SK positive selection cloning vector gltS gene 


3E-21 


2277 


XM_029246 


Homo sapiens collagen, type I, alpha 2 (COL1 A2), mRNA 


0.051 


2278 


AF185279 


Homo sapiens polymerase delta small subunit (POLD2) gene, partial 
sequence 


0.002 


2279 


AF080592 


Mus musculus centrin (Cetn2) gene, complete cds 


1.8 


2280 


XM 033626 ] 


Homo sapiens nucleoporin-like protein 1 (NLP 1), mRNA 


3.6 


2281 


] 

ABO 11667 < 


[pomoea purpurea genes for dihydroflavonol 4-reductase, complete 

■"He 


0.25 


2282 


] 

AB011667 ( 


[pomoea purpurea genes for dihydroflavonol 4-reductase, complete 
:ds 


0.25 


2283 


XM 002698 ] 


fiomo sapiens LOC89635 (LOC89635), mRNA 


6.3 


2284 


BC009385 ] 


iomo sapiens, clone IMAGE:4127835, mRNA 


0.61 


2285 


U39718 1 


Mycoplasma genitalium section 40 of 51 of the complete genome 


3.4 


2286 


< 

AB036755 c 


Schizosaccharomyces pombe spol4/stll gene for Secl2-like protein, 
;omplete cds 


0.079 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRTP 


P \/AT TTP 
x VALUC 


2287 


AF369960 


Sepiella maindroni isolate c 16S ribosomal RNA gene, partial 

CPilllPTlPP* 1TliiYV*ii 01x1x1*1 III (TPTIP for m i tnr»li nn H r-i a 1 nrnrlnM 
■SfcAJ Ufcll^t*, llLll\J^XlljXlLUX£iX £dLG lUi ilULVM^llVlllUI ldi UIUUUL<1 


\j.ZL 


2288 


AE005360 


Escherichia coli 0157:H7 EDL933 genome, contig 2 of 3, section 29 
of 55 


u.o 


2289 


U01034 


odbiciii equine encepndiomyeiius virus iNorui American antigenic 
variety nonstructural polyprotein and structural polyprotein genes, 
conrolete cds 


1 7 


2290 


AF355103 


Pleurotus ostreatus linear mitochondrial plasmid mlp2, partial 

■9%sU Uvl IW 


57 


2291 


AC000043 


Homo sapiens Chromosome 22ql3 Cosmid Clone p74a8, complete 
sennetice 


0.085 


2292 


X89398 


H ^aniens lint? <?ene for uracil T3NA-<*1vrinovla<;p 


0.71 


2293 


AY008264 


YprQiniii Piitpror'olii'ifvi nliaof* clioflr r*rofpin Ionic f*oint*i1 pfp cpntiptifp 

x ^lollxld v/HLtl VA/VJll U^/u ^sxudgw OxlUWlV ^slULGlli iUlrUa, lAJlllLJxGLG owUllCxIVG 




2294 




uiuziuyLiiia. iiicicuiugdbicr 011^ diiu diiun- / jjd 1 genes dim pdiiidi qzzj 

o"p»n onH Proc*?^* ctptip 


O.J 


2295 


XM_040290 


Homo sapiens glutamate receptor, ionotropic, AMP A 1 (GRIA1), 
mRNA 


4 


2296 


AJ293578 


Homo sapiens partial MOCS1 gene, exons 2-8 


6.3 


2297 


BC007051 


1-fomn cjmipnc Honp A/ffiP* 1 lAf* S TA/f A OF- ^RIO^l mPM A 
nuiilU awipiCIia, ClUUG IVlVJv^. IZ^OJ lXYlrVVJIl. jOOjUjI, IILlVTN r\ 7 

complete cds 


7E-32 


2298 


M23103 


B.subtilis spoIVCB gene, complete cds 


0.2 


2299 


AF280606 


T'ritinim apcttvuiTi nmpffa crliaHin ncpiiHocypnp r*omTi1 ftf* cpnnpnr*p 

Xlllx^Ulll a&SUVUlll \JLLl^/fjCL L^xxaUxll ^JZM^UVllJgGxxG, lA/lli^JIGLG DCljllCHLC 


0 74 


2300 


AF266095 


Platycnemis pennipes large subunit ribosomal RNA gene, partial 
sequence; tRNA- Valine gene, complete sequence; and small subunit 
i luubunidi xvin/\ gcuc, pdiiidi sequence, iiixiouiiuiiujidi genes 101 
mitochondrial products 


0.077 


2301 


AF1 14927 


Qsif f^Tisn-ofm/r^PC nuctoriijTiiic f^RQI ^^51 email ciihnnit riKr\cnm ?a 1 DMA 
Oavf^llalUllljrU^o ^J<iolAJxltUxU.& v^J-JO 1 JJO oxlldll DU-Ulxllll I ILHJoUillctx XvlN /A 

apnp mitopliniiHrial qptip for rnitopixOfiHrijil R M A r'nmnlptp cpoiiPTir*p 
gGUG, iim.wvxiwiiu.i lax ^^iiw iui liiiiuviiuiiuixax -tvln zv, wiupicic ocl|uciiww 


0.23 


2302 


M6890 1 


"P TiniiltopiHsi aHptivlatp pvpIjiqp fc^isi} cxptip rntnnl pfp rHc 

r ■UluilUviua du^xijr laiv^ ^jvlaob \\syaj K&llw, wuillLllCLw wUa 


0.23 


2303 


AJ235878 


Amomhonhalhis Iciusiiien^i^ chloronla<;t atnR-rhcT. <*riacer isolate PT,1 


0.23 


2304 


NM_016158 


Homo sapiens erythrocyte transmembrane protein (LOC51 145), 
mRNA 


5E-34 


2305 


Y17254 


Punlote*; octocarinatii** pamma-tiibiiliii 9 miprnnnr , lpjir ppup 

JUUUlUlVij VvlVAnU llldlUi] t_) 1XXX1Q LLAl/IXXXXE 1111 vl UlllivlvCU K^XX^ 


2.3 


2306 


X99872 


Sulfolobus acidocaldarius pyrB gene 


0.033 


2307 


X81001 


H saniens HCG II mRNA 


0.001 


2308 


Z80213 


Caenorhabditis elegans cosmid C09E9, complete sequence 


0.025 


2309 


AF257022 


Salmo salar clone BHMS7-009 microsatellite sequence 


0.74 


2310 


AL138748 


Human DNA sequence from clone RP4-732E19 on chromosome 
Xq2 1.1-2 1.33 Contains GSSs, complete sequence [Homo sapiens] 


0.21 


2311 


AJ409108 


Entamoeba histolytica rpl27a-2 gene for ribosomal protein large 
subunit 27a-2, exons 1-2 


0.009 


2312 


U26556 


Human ferritin H (FTHL13) pseudogene 


0.008 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


2313 


AF126320 


Protoclythia modesta 12S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.26 


2314 


AL137590 


Homo sapiens mRNA; cDNA DKFZp434K0610 (from clone 
DKFZp434K0610) 


2 


2315 


Z81471 


Caenorhabditis elegans cosmid C14B4, complete sequence 


0.029 


2316 


AE001567 


Helicobacter pylori, strain J99 section 128 of 132 of the complete 
genome 


0.028 


2317 


U47913 


Tribolium freemani Woot retrotransposon, partial sequence and 3' 
insertion junction F3 ( 1 ) 


0.081 


2318 


XM_037521 


Homo sapiens centrosomal P4. 1 -associated protein; uncharacterized 
bone marrow protein BM032 (BM032), mRNA 


2.2 


2319* 


AF269701 


Staphylococcus epidermidis strain SRI clone step.l017g02 genomic 
sequence 


0.7 


2320 


AF378183 


Oryza sativa cultivar Milyang23 cytosolic fructose- 1 mRNA, partial 
cds 


6.9 


2321 


AY019090 


Oryza sativa microsatellite MRG1415 containing (AT)X26, genomic 
sequence 


0.085 


2322 


AL449163 


Human DNA sequence from clone RP11-533E16 on chromosome 6, 
complete sequence [Homo sapiens] 


0.00004 


2323 


M19540 


Adenovirus type 41 DNA-binding protein (DBP) gene, exons 2 and 3, 
23K protease, complete cds, hexon and 100K protein genes, last exons 


0.026 


2324 


AJ272029 


Homo sapiens partial CD30 gene for cytokine receptor CD30 and 
promoter region 


0.003 


2325 


AF082966 


Homo sapiens genomic sequence 


0.75 


2326 


AE002940 


Drosophila melanogaster genomic scaffold 142000013385820, 
complete sequence 


0.028 


2327 


Z54147 


Human DNA sequence from cosmid L129H7, Huntington's Disease 
Region, chromosome 4pl6.3 contains CpG island 


0.001 


2328 


D 12590 


Rattus norvegicus DNA, RFLP marker at locus A403 


0.077 


2329 


XM 035524 


Homo sapiens KIAA1201 protein (KIAA1201), mRNA 


0.025 


2330 


XM 003575 


Homo sapiens soluble liver antigen/liver pancreas antigen 
(LOC51091), mRNA 


4E-27 


2331 


BC009503 


Homo sapiens, Gl to S phase transition* 1, clone MGC:1735 
IMAGE:2822947, mRNA, complete cds 


0.69 


2332 


BC005572 


Mus musculus, golgi vesicular membrane trafficking protein pi 8, 
clone MGC: 11601 IMAGE:3967968, mRNA, complete cds 


0.78 


2333 


U39726 


Mycoplasma genitalium section 48 of 51 of the complete genome 


ft 08^ 


2334 


XM__039942 


Homo sapiens Meis (mouse) homolog 3 (MEIS3), mRNA 


0.72 




X67506 


C.thermocellum ancA gene 


0.084 


2336 


AL1 17553 


Homo sapiens mRNA; cDNA DKFZp564Fl 171 (from clone 
DKFZp564F1171) 


0.22 


2337 


AF1 15574 


Pisum sativum pathogenesis-related protein (DRR206-C) gene, 
complete cds 


0.009 


2338 


Y11874 


M.musculus uPA gene, promoter sequence 


2.3 
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|SEQID 
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DESCRIP 



P VALUE 



2339 BC008277 



Mus musculus, Similar to Endothelin receptor type A, clone 
MGC:6567 IMAGE :28 12426, mRNA, complete cds ' 



2.2 



2340 XM 009543 



Homo sapiens transcription factor AP-2 gamma (activating enhancer 
binding protein 2 gamma) (TFAP2C), mRNA 



0.21 



2341 NM 015809 



Mus musculus keratin-associated protein 5-4 (Krtap5-4), mRNA 



0.42 



2342 AE000813 



Methanobacterium thermoautotrophicum from bases 206991 to 
217929 (section 19 of 148) of the complete genome 



5.2 



2343 XM 034632 



Homo sapiens hypothetical protein PRQ2832 (PRQ2832), mRNA 



0.0000005 



2344 XM 034229 



Homo sapiens similar to hypothetical protein FLJ14260 (H. sapiens) 
(LOC90803), mRNA 



0.069 



2345 XM 033361 



Homo sapiens 62537 (FLJ00024), mRNA 



0.008 



2346 BC005332 



Homo sapiens, Similar to immunoglobulin kappa constant, clone 
MGC: 12418 IMAGE: 3 93 465 8, mRNA, complete cds 



3E-33 



2347 



AF321120 



2348 AB054512 



Mus musculus gap junction membrane channel protein alpha 8 (Gja8) 
gene, complete cds 



Berardius bairdii DNA, SINE flanking sequence Tuti35 locus 



2.1 



0.77 



2349 



Z98549 



Plasmodium falciparum DNA *** SEQUENCING IN PROGRESS 
*** from contig 3-14, complete sequence 



0.029 



2351 XM 033483 



Homo sapiens hypothetical gene supported by L39957; L43086; 
M85256; M99603; S56182; S73911; U07989; U79763; U96290; 
U96294; U96397; X58082; Z18327; AB006847; AB027436; 
AB027438; AF007572; AF035035; AF052535; AF099196; ' 
AF103384; AF103390; AF103466; AF10> 



2352 XM 033483 



Homo sapiens hypothetical gene supported by L39957; L43086; 
M85256; M99603; S56182; S73911; U07989; U79763; U96290; 
U96294; U96397; X58082; Z18327; AB006847; AB027436; 
AB027438; AF007572; AF035035; AF052535; AF099196; ' 
AF103384; AF103390; AF103466; AF10> 



2353 XM 033483 



Homo sapiens hypothetical gene supported by L39957; L43086; 
M85256; M99603; S56182; S73911; U07989; U79763; U96290; 
U96294; U96397; X58082; Z18327; AB006847; AB027436; 
AB027438; AF007572; AF035035; AF052535; AF099196; 
AF103384; AF103390; AF103466; AF10> 



8E-33 



1E-27 



8E-33 



2354 XM 043728 



Homo sapiens hypothetical gene supported by AF074988 
(LOC92227), mRNA 



0.001 



2355 AE001425 



Plasmodium falciparum chromosome 2, section 62 of 73 of the 
complete sequence 



6.4 



2356 XM 033483 



Homo sapiens hypothetical gene supported by L39957; L43086; 
M85256; M99603; S56182; S73911; U07989; U79763; U96290; 
U96294; U96397; X58082; Z18327; AB006847; AB027436; 
AB027438; AF007572; AF035035; AF052535; AF099196; ' 
AF103384; AF103390; AF103466; AF10> 



9E-14 



2357 AF184072 



Homo sapiens target of methylation-induced silencing 1 (TMS1) gene, 
complete cds 



6.6 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VAT TTF 


2358 


XM 032347 


Homo sapiens region containing TLS-associated serine-arginine 
piuL&in x, iLfO daaixadicu bcnne-arginine proiein z, i-Lo-associated 
ovxiiiw aigiiuiiw pxuL&iii z,, 1 j^o aoMJciaicu ocriiic~dxgiiiine protein 
TLS-associated serine-arginine protein 1; TLS-associa> 


0 


2360 


X54807 


Human CYP2C8 gene for cytochrome P-450, 5* flank and exon 1 


0.00003 


2361 


X67506 


C.thermocellum ancA gene 




2362 


AL5 13350 


Human DNA ^pnnpnrp frnm Hnnf* PP1 1 -^fiQFift nn rlirnmncnma 0 
^.j.uAxicixi i^x^jt^ i3vv|u.viicv J.1U111 ^xuiic i\ri i"ju"L/o uii uiixuii 10 sonic 

complete sequence [Homo sapiens] 


0.003 


2363 


AF020663 


Homo sapiens myotubularin (MTM1) gene, promoter and exon 1 


0.068 


2364 


U43626 


Human chromommp 1 Snl 1 -n 1 "X nntativp tjxj a r^r\1ir>otinn nnmn ; n 

the g-aminobutyric acid receptor b3 and a5 gene cluster 


0.059 


2365 


AY019090 


Orvza sativa micros tel lite N/TR 0141 S ^nntainino- ( A T^V)^ a^nnmi^ 
sequence 


0.081 


2366 


U27586 


CvtODhaca heDarina henarinase III fl-IenO gene comnlete pHq 


v/.JO 


2367 


XM 047192 


Homo sapiens similar to MYOSIN HEAVY CHAIN, CARDIAC 
MUSCLE ALPHA ISOFORM (MYTTr-AT PUA^ f\A mncniln^ 
(LOC92771), mRNA 


e-133 


2368 


M20319 


S CerGVlRlflG A* 1 "H 1 ppnp rtonfprri n tr n tn i n r\t t~i a Tr* If* tv»cie+QTi/va ^Amntoto 
Il31U ^ ^vxxxx £,v_-ii\^ vuiu&lllllg cUlLlllUUId^UlG ICblaLdlHsC, UUIIILJICIC 

cds 


1.3 


2369 


XM_051036 


Homo sapiens carbohydrate (keratan sulfate Gal-6) sulfotransferase 1 
(CHST1), mRNA 




2370 


AB062951 


Macaca fascicularis brain cDNA clone :QmoA- 12201, full insert 
sequence 


1 < 

I.J 


2371 


U29335 


PiSUm Sativum Clone MPRTPCR62 wniinH-inrliirihif* rvtnrhrrtnip 

mRNA, partial cds 


0 A\ 

v.*f 1 


2372 


AF272630 


Ureaplasma urealyticum serovar 14 23 S ribosomal RNA and 5S 

nhnQnmJil RMA opiipc f*omn1ptf» c*»nnf»nr'f»* onH HJViirrlTv^-fXI ofa 

1 it/vov/xnciA avi^^-v ^^iito, v^uiii^icic oci^udiW'C, diiu uiiiyurojoiaie 
reductase (folA) gene, partial cds 


0.061 


2373 


AF179375 


MvcoDlasma fermenfanc nrflDI opnt* Tticprtirvn c#»mi<=»Tio*» TQ1 ^ifYT* 

Arijf vv/L/xaoxixa i^i iiiv^xiLOiia uxxj_/x gciig, xxlaClUvfll aCqUCIlUC IoIOjUU^ 

orfD2 gene 


n ^i 


2374 


X16876 


Soybean ENOD2B gene for Ngm-75 


0.96 


2375 


AF263831 


Homo saoiens chromosome 1 1 DNa«;f*T hvnprcpnQii-ivf* r^oirm 


z. / 


2376 


AF157162 


Svzvcites mecalocarous 18S ribosomal RNA Pene nartial c^npnrp 




2377 


AJ308588 


Medicago truncatula partial mRNA for nodulin 25 (nod25 gene), 
clone 3 


4.6 


2378 


X02870 


Bovine gene for cytokeratin VIb 


0.053 


2379 


AL590157 


Human DNA sequence from clone RP13-420K18 on chromosome X, 
complete sequence [Homo sapiens] 


4E-08 


2380 


AF282242 


Plasmodium falciparum strain Dd2 chloroquine resistance marker 
protein gene, complete cds 


4.2 


2381 


Z24745 


A.thaliana 2S albumin gene isoforms 1 and 2, complete CDS's 


0.6 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


2382 


AL035418 


Human DNA sequence from clone RP1 -14113 on chromosome 

22a 13 1-13 33 Contains a CiSS eomnl pfp spnnpnrp n-Tnmn csmipncl 


1 fx 


2383 


Y17968 


Gallus sallus mRNA for hiffh mobilitv eroim 1 nrotein 


1 5 


2384 


XM 034735 


Homo sapiens similar to hypothetical protein FLJ21463 (H. sapiens) 
(LOC90883), mRNA 


5.6 


2385 


AF001415 


Arabidopsis thai i an a 14-3-3-like protein GF14 upsilon (GRF5) gene, 
complete cds 


1.3 


2386 


XM_009026 


Homo sapiens retinol dehydrogenase 8 (all-trans) (RDH8), mRNA 


1.9 


2387 


XM 003095 


Homo sapiens zinc finger protein 288 (ZNF288), mRNA 


1.8 


2388 


AE003083 


Drosophila melanogaster genomic scaffold 142000013385449, 
complete sequence 


0.51 


2389 


AE002798 


Drosophila melanogaster genomic scaffold 142000013385466, 
complete sequence 


0.018 


2390 


AL359506 


Human DNA seauence from clone CTD-2007A7 on chrnmn^omp 90 
Contains GSSs, complete sequence [Homo sapiens] 


0.28 


2391 


U32726 


Haemonhilus influenzae Rd section 41 of 161 of the nnmnlpfp aptinmp 


0.3 


2392 


ABO 10468 


Cyprinus carpio mRNA for CXC chemokine receptor- 1, complete cds 


0.9 


2393 


AE006886 


Sulfolobus solfataricus section 245 of 272 of the complete genome 


0.19 


2394 


M19810 


Human apolipoprotein B-100 (apoB) gene, exon 4 


0.43 


2395 


AB005548 


Homo saniens f?ene for souamoiis cell carcinoma antippn 7 narrial pHq 


0.0004 


2396 


AJ3 18337 


Plasmodium falciparum krueppell gene for Krueppel-like protein 


0.62 


2397 


U01102 


Human Inn*? C*lara cell** 10 lcrla <;pprptoTv lYrntpiti ffY^lOA opnp 

satellite and Alu repeat sequences, complete cds 


3.8 


2398 


AK017766 


mii<;cii1ii<; 8 <1av^ pmhrvn rDNA RTTCT^Nf fill 1 -1p»n <xtVi pnripVipH 

library, clone : 573 051 2 J02, full insert sequence 


1.7 


2399 


AE000032 


Mvconlasma nnenmoniae A/1129 section 48 of 6^ of thp romnlptp 
genome 


0.55 


2400 


AF333234 


Homo saniens recentor activator of nuclear factor kanna B li^and 
(RANKL) gene, promoter region 


0.22 


2401 


BCO 10652 


Homo sapiens, clone MGC:9753 IMAGE: 3 855206, mRNA, complete 
cds 


1.8 


2402 


AK007856 


Mus musculus 10 day old male pancreas cDNA, RUCEN full-length 
enriched library, clone:1810054D07, full insert sequence 


2E-94 


2403 


AF172723 


Homo sapiens integrin alpha 10 subunit (ITGA10) gene, exons 7 
through 17 and partial cds 


0.21 


2404 


NM_026570 


Mus musculus glioma-amplified sequence-41 (Gas41 -pending), 
mRNA 


1.1 


2405 


U78770 


Mus musculus spasmolytic polypeptide (mSP) gene, complete cds 


1.8 
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|SEQID| 
NO 
2406 



Table 3A Nearest Neighbor (BlastN vs. Genbank) 



ACCESSN 
U26729 



DESCRIP 



Human neprilysin gene, non-coding exon 3 



2407 AE006346 



Lactococcus lactis subsp. lactis IL1403 section 108 of 218 of the 
I complete genome 



2408 AF258783 



Felis catus Niemann-Pick type CI disease protein (NPC1) mRNA, 
| complete cds 



2409 
2410 



AC007280 
D 13065 



Homo sapiens BAC clone RP11-332C7 from 7plL2-p21, complete 
I sequence 



O 

iu 

3 s : ; 

o 

CP 



2413 
2414 



2415 



2416 
2417 



2419 
2420 



2421 



2423 
2424 



2425 
2426 



2430 



2435 



I Mycoplasma capricolum mcs4 gene for MCS4 RNA 



2411 AF196058 



Drosophila pseudoobscura strain PSU606 period (per) gene, intron 4, 
I partial sequence 



2412 AF370534 



Arabidopsis thaliana Putative L-ascorbate peroxidase (AT4g09010) 
I mRNA, complete cds 



Pongo pygmaeus gamma-aminobutyric acid receptor A5 subunit 
AF084197 [duplicated gene, 5'UTR region 



Y17045 
AL021177 



| Plasmodium falciparum gltS gene" 



jCaenorhabditis elegans cosmid Y1A5A, complete sequence" 



I Drosophila melanogaster mRNA for drosophila dodeca-satellite 
AJ238847 |protein 1 (DDP-1) 



AB056333 jMacaca fascicularis brain cDNA, clone:QflA-13486 



24 1 8 AF 1 04821 I Danio rerio Hlx2 gene, promoter region 



AF138960 



Neospora caninum ycf24 protein (ycf24) gene, partial cds; DNA 
dependent RNA polymerase beta subunit (rpoB) gene, complete cds; 
and DNA dependent RNA polymerase beta subunit 1 (rpoCl) gene, 
partial cds, plastid genes for plastid products 



AF338712 [Emeus crassus mitochondrion, partial genome 



Z82625 [ R.prowazekii genomic DNA fragment (clone A3 96F) 



AK013530 



Mus musculus adult male hippocampus cDNA, RIKEN full-length 
enriched library, clone:2900012M01, full insert sequence 



D86077 Homo sapiens DNA for cyclin G, partial cds 



2427 X94208 [H.sapiens TPR gene (112 lbp)" 



2428 AF2 11528 



Nicotiana tabacum Avr9/Cf-9 rapidly elicited protein 4 (ACRE4) 
I mRNA, complete cds 



2432 AK023305 [Homo sapiens cDNA FLJ1 3243 fis, clone OVARC100062r 



2434 AF167585 



fHomo sapiens clone C13 immunoglobulin heavy chain variable region 
[gene, partial cds 



AB013796 iCeratopteris richardii mRNA for CRHB6, complete cds 
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P VALUE 



0.92 



0.23 



0.39 



0.062 



1.5 



1.8 



0.00009 



0.063 



0.22 



0.22 



e-126 



0.056 



0.21 



1.3 



0.46 



2422 XM_015945 Homo sapiens hypothetical protein FLJ13842 (FLJ13842), mRNA | 6E-62 



X57010 Human COL2A1 gene for collagen II alpha 1 chain, exons E2-E15 | 0.63 
AF283525 [Homo sapiens FIP2 gene, exons 11, 12 and 13 " 4E . 71 



5.3 



0.02 



0.19 



0.22 



L35688 Homo sapiens (subclone H9 10_dl0 from PI LBL#3) DNA sequence | 5E-08 



Cavia porcellus seminal vesicle secretory protein SVP-1/-3/-4 gene 
2431 I U59711 [complete cds 



0.14 



0.002 



2433 I AB048868 [Macaca fascicularis brain cDNA, clone :Qnp A- 1005 5 0.044 



0.13 



0.024 






Table 3A Nearest Neighbor (BlastN vs. Genbanto 


SEQIE 
NO 


> 

ACCESSN 


DESCRIP 


P VALUE 


2436 


AF071481 


Homo sapiens potassium channel (KCNQ3) gene, exon 4 


1.9 


2437 


AF282242 


Plasmodium falciparum strain Dd2 chloroquine resistance marker 
protein gene, complete cds 


0.22 


2438 


AY018307 


Oryza sativa microsatellite MRG0632 containing (AT)X14, genomic 
sequence 


0.073 


2439 


274273 


S.cerevisiae chromosome IV reading frame ORF YDL225w 


0.028 


2440 


AY021257 


Oryza sativa microsatellite MRG3582 containing (TA)X22, genomic 
sequence 


0.018 


2441 


XM 012164 


Homo sapiens S 1 b20-like kinase (JIK), mRNA 


0.27 


2442 


XM 036587 


Homo sapiens hypothetical protein DKFZp564B1023 
PKFZP564B1023), mRNA 


0.008 


2443' 


AL1 12743 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


0.069 


2444 


AB020575 


Conocephalum conicum psbA gene, complete cds, strain J type 


0.074 


2445 


BC001772 


Homo sapiens, Similar to glutaminyl-tRNA synthetase, clone 
MGC:1525 IMAGE;3543728, mRNA, complete cds 


5.5 


2446 


AF289113 


Drosophila montana clone mon 10 transposon Penelope gene 
sequence 


0.064 


2447 


XM_0 15806 


Homo sapiens hypothetical protein FLJ22621 (FLJ22621), mRNA 


0 


2448 


AY047559 


Drosophila melanogaster GH07383 full length cDNA 


0.92 


2449 


XM_052662 


Homo sapiens amino-terminal enhancer of split (AES), mRNA 


0.59 


2450 


AL138723 


Human DNA sequence from clone RP1 1-446N1 on chromosome 6, 
complete sequence [Homo sapiens] 


0.008 


2451 


AF151389 


Dermatobia hominis strain Ponta Grossa tRNA-Ile gene, partial 
sequence; D-loop, complete sequence; and 12 S ribosomal RNA, 
partial sequence; mitochondrial genes for mitochondrial products 


0.46 


2452 


NM_018123 


Homo sapiens hypothetical protein FLJ10517 (FLJ10517), mRNA 


0.21 


2453 


XM_042803 


Homo sapiens hypothetical gene supported by AL359651 
(LOC92091), mRNA 


1.9 


2454 


AE001397 


Plasmodium falciparum chromosome 2, section 34 of 73 of the 
complete sequence 


5 


2455 


AFO 17996 


Mus musculus lysosomal acid lipase gene, promoter region 


0.022 


2456 


XM_028409 


Homo sapiens KIAA1871 protein (KIAA1871), mRNA 


0.0003 


2457 


XM 051692 


Homo sapiens KIAA1705 protein (KIAA1705), mRNA 


0.0003 


2458 


U00037 


Caenorhabditis elegans cosmid T20H4 


0.073 


2459 


] 

AL138748 . 


Human DNA sequence from clone RP4-732E19 on chromosome 
Xq21. 1-21.33 Contains GSSs, complete sequence [Homo sapiens] 


0.2 


2460 


J 

AF052744 i 


Homo sapiens transcriptional repressor p54 gene, promoter region and 
partial cds 


1.8 


2461 


XM 007006 ] 


Homo sapiens KIAA0985 protein (KIAA0985), mRNA 


1.7 


2462 


AF303046 1 


Homo sapiens prostinogen mRNA, complete cds 


0.44 


2463 


AK023562 ] 


Homo sapiens cDNA FLJ13500 fis, clone PLACE1004693 


e-116 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


2464 


AF076692 


Aspergillus fumigatus aureobasidin-resistance protein (aurl) mRNA, 
complete cds 


3.2 


2465 


ArOo4olo 


Glossina morsitans morsitans thrombin inhibitor precursor (TTI) 
mRNA, complete cds 


0.44 


2466 


Z54147 


Human DNA sequence from cosmid L129H7, Huntington's Disease 
Region, chromosome 4pl6.3 contains CpG island 


4E-09 


2467 


AM 040290 


Homo sapiens glutamate receptor, ionotropic, AMPA 1 (GRIA1), 
mRNA 


4 


2468 


AY026512 


Rattus norvegicus dynein-associated protein RKM23 (km23) mRNA, 
complete cds 


1.5 


2469 


AY033091 


Plasmodium berghei Pbs36 gene, complete cds 


0.0008 


2470 


XM_05l572 


Homo sapiens heat shock transcription factor 2 (HSF2), mRNA 


2E-42 


2471 


AF2 87003 


Floriceps minacanthus 18S ribosomal RNA gene, complete sequence 


1.8 


2472 


X99872 


Sulfolobus acidocaldarius pyrB gene 


0.2 


2473 


U49451 


Bombina orientalis SAP bombesin preprohormone mRNA, complete 
cds 


0.56 


2474 


AC024819 


Caenorhabditis elegans cosmid Y55B1AL, complete sequence 


6.9 


2475 


AF2 82602 


Litoria subglandulosa strain SAMA_R51051 NADH dehydrogenase 
subunit 4 (ND4) gene, partial cds; mitochondrial gene for 
mitochondrial product 


0.073 


2476 


AL591343 


Human DNA sequence from clone RP1 1-260H5 on chromosome 6, 
complete sequence [Homo sapiens] 


0.24 


2477 


Z69375 


Human DNA sequence from cosmid L174G8, Huntington's Disease 
Region, chromosome 4pl6.3 


3E-16 


2478 


TT"2 O 1 *7 y) 


Dictyostelium discoideum spore lysis A (splA) gene, complete cds 


0.25 


2479 


XM_045093 


Homo sapiens KIAA1320 protein (KIAA1320), mRNA 


0.53 


2480 


AY035224 


Gallus gallus cardiac muscle factor 1 gene, partial cds 


0.78 


2481 


D 14567 


Pemcilhum urticae mitochondnal 1-rRNA (large rRNA) gene and its 
flanking region 


0.25 


2482 


Ar 119218 


HIV-1 strain 96CG38.10 from Republic of the Congo, envelope 
glycoprotein, C2-C4 region (env) gene, partial cds 


0.002 


2483 


Ar3oo9o3 


Arabidopsis thaliana Unknown protein mRNA, complete cds 


0.057 


2484 


Z19595 


C.elegans her-1 gene 


0.19 


2485 


AJb00o341 


Lactococcus lactis subsp. lactis IL1403 section 103 of 218 of the 
complete genome 


6.1 


2486 


AT 


Human DNA sequence from clone RP1 1-469L10 on chromosome 6, 
cumpicic sequence [xioiiio sapiensj 


O.ZJ 


2487 


NC_001617 


Human rhinovirus 89 


0.078 


2488 


X65859 


H. sapiens pseudogene for the low affinity IL-8 receptor 


5.7 


2489 


AE007508 


Streptococcus pneumoniae section 191 of 194 of the complete genome 


0.63 


2490 


AF199367 


Entamoeba histolytica cdc48-like protein gene, complete cds 


0.56 


2491 


AF211160 


Pseudonaja textilis tissue-type liver phospholipase A2 inhibitor beta 
subunit isoform PTI-2B mRNA, complete cds 


2.2 
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Table 3A Nearest Neighbor (BlastN vs. Genbanlrt 


SEQK 
NO 


) 

ACCESSN 


DESCRIP 


P VALUE 


2492 


AE000813 


Methanobacterium thermoautotrophicum from bases 206991 to 
217929 (section 19 of 148) of the complete genome 


5.5 


2493 


U67604 


Methanococcus jarmaschii section 146 of 150 of the complete genome 


1 


2494 


D16253 


Pichia canadensis mitochondrial gene for NADH dehydrogenase 
subunit 5, complete cds 


0.006 


2495 


NC 001530 


Human papillomavirus type 47, complete genome 


5.4 


2496 


AF286437 


Pseudavenzoaria ochropodis from host Tringa ochropus 16S 
ribosomal RNA gene, partial sequence; mitochondrial gene for 
mitochondrial product 


0.4 


2497 


XM 006069 


Homo sapiens weel+ (S. pombe) homolog (WEE1) mRNA 


3E-26 


- 

2498 


AF314019 


Rana boylii NADH dehydrogenase subunit I (ND1) gene, partial cds; 
tRNA-Ile, tRNA-Gln, and tRNA-Met genes, complete sequence; 
NADH dehydrogenase subunit II (ND2) gene, complete cds; tRNA- 
Trp, tRNA-Ala, tRNA-Asn, tRNA-Cys, and tRNA-Tyr genes, 
complete se> 


1.4 


2499 


AJ3 14911 


Dictyostelium discoideum DrnA gene for putative RNaselll, exons 1-3 


0.51 


2500 


M35228 


Vesicular stomatitis virus (strain 86-GM-B) glycoprotein gene, 
complete cds 


0.45 


2501 


AB021290 


Homo sapiens gene for chondromodulin-1, promoter and partial cds 


0.72 


2502 


AJ307807 


Astrammina rara partial 18S rRNA gene, isolate A44 


3.5 


2503 


AB005148 


Bos taurus mRNA for IL-1 receptor antagonist, complete cds 


0.84 


2504 


AE002687 


Drosophila melanogaster genomic scaffold 142000013385743, 
complete sequence 


0.23 


2505 


AF053496 


Caenorhabditis elegans beta chain spectrin homolog Smal (smal) 
mRNA, complete cds 


5.9 


2506 


X59611 


E.coh IncFV plasmid pED208 oriT region with genes traM, traJ and 
traY (part.) 


0.043 


2507 


XM 051897 


Homo sapiens phospholipase A2, group IVA (cytosolic, calcium- 
dependent) (PLA2G4A), mRNA 


3E-66 


2508 


AJ132890 


Bos taurus mRNA for acetyl-CoA-carboxylase 


0.18 


2509 


Y13537 


Homo sapiens LAG-3 gene, 3* region 


1.7 


2510 


AE002149 


CJreaplasma urealyticum section 50 of 59 of the complete genome 


0.027 


2511 


AF170573 j 


Influenza C virus C/Johannesburg/1/66 segment 5 nucleoprotein (NP) 
gene, partial cds 


2.4 


2512 


] 

AKO 16507 1 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
ibrary, clone:4931439C15, full insert sequence 


6.8 


2513 


] 
( 

AL354664 c 


luman DNA sequence from clone RP11-209D8 on chromosome 6. 
Contains ESTs and GSSs. Contains part of the COL12A1 gene for 
;ollagen type XII, alpha 1, complete sequence [Homo sapiens] 


0.082 


2514 


I 

AB042531 c 


5 isum sativum PsEXT mRNA for xyloglucan endotransglycosylase, 
:omplete cds 


2E-12 
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Table 3A Nearest Neighbor (BlastN vs. Genbank} 


SEQID 
NO 


> 

ACCESSN 


DESCRIP 


P VALUE 


2515 


AL590371 


Human DNA ofviiipitrf* from r\r\n& ddi i tqcui i ^ 

iiunidii i-'ixa sequence irom cione tsJri on chromosome 6, 

conmlete <ienii£*Tir.p n-fnnnn csintpncl 


L5 


2516 


AF334780 


Homo sapiens pregnancy-induced growth inhibitor OKL38 gene, 
nartial cds 


0.8 


2517 


AF362371 


Dictyostelium discoideum histidine kinase Dhkl (dhkl) gene, 
comnlete cd^ 


0.021 


2518 


NM_0 12608 


Rattus norvegicus Membrane metallo-endopeptidase (neutral 
endoDentida^e/enkpnhnlina^^ f\Amc*\ mDWA 


2 


2519 


BC007173 


Ml IS mii^di1ii<s rlnnp lA/fA^TF -'XAQ'X 1 11 ml? XT A n4r ti 0 i ^Ac* 
iUUO hiimviuuo, wiving xivirt.vj£j. j*f y J X £, 1 , IIlXxJN/Y, pdXOai COS 


5E-53 


2520 


Y13870 


Homo saniens mRNA rnnt^ininty rr A CWf*. rpnpot rAr\r\<=>. r*? a n n 

a-ia/aiivf oajji^iia xiixvL^rt. y^yjlllcinim^ \\^rWJ)\j repeat, ClOHC i '*'-\_ - rV*J- 1 Zr 


0.06 


2521- 


AF051769 


Homo CfiniPTl^s tlvnllirAnirtacp^-lilrp* ncmirlrtrK^n** 1 /TTVAT T> i \ t-lnl 

xxvpiiiu Mpiciia irytuuiuiuudbe-iiKc pseuuogene i ^rii AU'i), partial 
sequence 


0.009 


2522 


AF3 09947 


Oictvn^tpliiim HicrrtiHpum R or* 1 A ro^ 1 A\ nana ^AmnlAtia ^j-.. _ _ j 

A - ,riw v ^ ji3Lt ' iiui11 uia^uiucuiii xvdiw i r\ \Tacij\) gene, complete cos, ana 
unknown eene 


u. n 


2523 


AF128392 


Arabidopsis thaliana BAC F15P23 


0.22 


2524 


AK020739 


iviua iiiuauuiua duuil itUUC £>piIUU COiU CUIN/\, K 1 rv cIN HUl-lengtll 

enriched libra rv Hnnp* A^lOlft/iT 17 fiill incpri cMinon^ 


LI 


2525 


Y09472 


B.taurus gene encoding preprododecapeptide 


0.24 


2526 


D87922 


r\iiL iui j uiA.01 bKcieidi inuscie souium c nannei, partial 
sequence 


0.4 


2527 


U70865 


V/madDiua auidiua ^v^i\o jJiCCUI bUI , IIltviNA, COITipiClC COS 


0.67 


2528 


ABO 12624 


Homo sapiens FLU gene for ERGB transcription factor, intron 4 and 
nartial cd<% 

y%JLl~ 1 1 Cl H V^VXi3 


A AC/I 

U.U54 


2529 


AF180492 


Crocuta crocuta microsatellite Ccrl2 sequence 


0.083 


2530 


AL078640 


nuitiaii ac^juciicc irom cione jjOru on enromosome LL, 
Contains an EST, complete sequence [Homo sapiens] 


2E-11 


2531 


Z63184 


H.sapiens CpG island DNA genomic Msel fragment, clone 7al, 
forward read cpg7al.ftld 


6E-22 


2532 


AL162055 


iiuiiiu aapiciia iiu\j.n/\, L,±^lN/\ LJIvFZ^p /O 1IVLZZZJ (ITOm Clone 

DKFZp761M2223) 


0.022 


2533 


AE001392 


ndMiiuuium idicipdrum enromosome z, section Zy ol 73 01 the 
comnlete ^eniienct* 




2535 


AF202903 


Sporopachydermia cereana var. D77-321B internal transcribed spacer 
i, j.oo riuuMjindi rviN/\ gene ana internal transcribed spacer 2, 
complete sequence; and large subunit ribosomal RNA gene, partial 
sequence 


A 1 A 
U. 14 


2536 


U59487 


Borrelia burgdorferi P35 antigen protein gene, and 7.5 kDa 
lipoprotein gene, complete cds 


0.029 


2537 


XM_030999 


Homo sapiens KIAA0444 protein (KIAA0444), mRNA 


0.012 


2538 


AF302838 


Mus musculus putative tumor suppressor LEU2 gene, exons 1 and 2 


0.63 


2539 


AF125423 


Bresiliidae gen. sp. LSP21-1 cytochrome oxidase I gene, partial cds; 
mitochondrial gene for mitochondrial product 


0.66 


2540 


AP001348 


Homo sapiens genomic DNA, chromosome 21q21.1-q21.2 
clone:f39G8, LL56-APP region, complete sequence 


0.42 
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SEQIE 
NO 
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P VALUE 


2541 


XM 028324 


Homo sapiens hypothetical protein FLJ14195; KIAA1714 protein 
(FLJ14195), mRNA 


8E-98 


2542 


AF238860 


Streptococcus mutans mutacin I biosynthesis gene cluster, partial 
sequence 


4.6 


2543 


AE005391 


Escherichia coh 0157:H7 EDL933 genome, contig 3 of 3, section 10 
of 290 


1.6 


2544 


AK024518 


Homo sapiens cDNA: FLJ20865 fis, clone ADKA01850 


0.06 


2545 


AK026595 


Homo sapiens cDNA: FLJ22942 fis, clone KAT08170, highly similar 
to HUMMHBW62 Homo sapiens (clone pMF18) MHC class I HLA- 
Bw62 mRNA 


4E-39 


2546 


AT A AQ 1 


Human DNA sequence from clone RP11-533E16 on chromosome 6, 
complete sequence [Homo sapiens] 


2.3 


2547 


AJ305053 


Caprine arthritis-encephalitis virus proviral partial pol gene for Pol 
protein (reverse transcriptase), isolate CAEV Br/UFRGS-2/C767 


0.69 


2548 


Arzo3U67 


Homo sapiens clone 3ptel cllbt3 sequence 


1E-41 


2549 


AF286431 


Bychovskiata intermedia from host Charadrius leschenaulti 16S 
ribosomal RNA gene, partial sequence; mitochondrial gene for 
mitochondrial product 


0.94 


2550 


L35232 


Schizosaccharomyces pombe deoxyribonuclease (exo2) gene, exons 1- 
3, complete cds 


0.93 


2551 




Methanosarcina acetivorans strain C2 A MtaF (mtaF) and MtaG 
(mtaG) genes, complete cds 


0.034 


2552 


XM 018334 


Homo sapiens hypothetical protein FLJ22418 (FLJ22418), mRNA 


0.013 


2553 


T TKO<') 
U JJOOZ 


Visna virus clone 678 env3.2 envelope polyprotein (env) gene, partial 
cds 


0.17 


2554 




N.crassa conidation-specific protein genes con- 10 (complete cds.) and 
con- 13 (3 flank) 


0.079 


2555 


Ao050531 


Macaca fascicularis brain cDNA, clone:QnpA-20907 


1.4 


2556 


AB009693 


Mus musculus gene for mafG, complete cds 


5.6 


2557 


U67593 


Methanococcus jannaschii section 135 of 150 of the complete genome 


0.024 


2558 




Homo sapiens x 009 protein (MDS009), mRNA 


4 


2559 


Uo757o 


Methanococcus jannaschii section 120 of 150 of the complete genome 


1.8 


2560 


AE001759 


Thermotoga maritima section 71 of 136 of the complete genome 


1.9 


2561 


] 

AF363230 j 


auuiiiui iiuiiicuuuiiidiii ininscnpuon xactor vjSx (Cjsx) 
gene, partial cds 


0.59 


2562 


NM_019211 ] 


Rattus norvegicus RAS guanyl releasing protein 1 (Rasgrp), mRNA 


2.5 


2563 


U60070 < 


Solanum tuberosum disease resistance homolog (St 121) gene, partial 
:ds 


0.68 


2564 


AE006593 t 


Streptococcus pyogenes Ml GAS strain SF370, section 122 of 167 of 
he complete genome 


0.062 
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NO 
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DESCRIP 


P VALUE 


2565 


AF147231 


Wilkesia gymnoxiphium ASAPETALA1-A (ASAP 1 -A) gene, exons 3 
through 8, partial cds 


0.026 


2566 


AK018656 


Mus musculus adult male cecum cDNA, RIKEN full-length enriched 
library, clone:9130402C12, full insert sequence 


0.74 


2567 


X65859 


H. sapiens pseudogene for the low affinity IL-8 receptor 


5.4 


2568 


XM 030855 


Homo sapiens similar to KJAA1361 protein (H. sapiens) (LOC90315), 
mRNA 


0.65 


2569 


XM 031726 


Homo sapiens hypothetical protein FLJ 12577 (FLJ 12577), mRNA 


2.1 


2570 


Z68749 


Caenorhabditis elegans cosmid F56H1 1, complete sequence 


0.025 


2571 


AE000020 


Mvcor*1a^ma rjneirmoniae Kf 1 29 section 60 of 63 of the comnlete 
genome 


0.17 


2572 


U63091 


Orvctolatni*; ciiniciilus heta-fflohin-like ffene cluster locus control 
region 


0.13 


2573 


BC001339 


Homo sapiens, clone IMAGE: 3 46 1987, mRNA, partial cds 


1E-31 


2574 


U86962 


Dictyostelium discoideum MigA (migA) gene, complete cds 


0.9 


2575 


AY021629 


Orvza <;ativa micro<witellite MRG3954 containing n\A)X30 genomic 
sequence 


6.5 


2576 


AB063077 


Macaca fa^cicnlari^ brain cDNA clone'OtrA- 13349 full insert 
sequence 


0.59 


2577 


XM 044332 


Homo sapiens piwi (Drosophila)-like 1 (PIWIL1), mRNA 


4E-28 


2578 


Z97185 


S.pombe chromosome I cosmid c 16 A 10 


1.5 


2579 


L43603 


Gallus gallus Na+/K+-ATPase alpha- 1 subunit gene, promoter region 


0.7 


2580 


ACO 16645 


Homo sapiens chromosome 5 clone RP1 1-54A24, complete sequence 


0.0001 


2581 


AE001408 


Plasmodium falciparum chromosome 2, section 45 of 73 of the 
complete sequence 


0.25 


2582 


AF067699 


Ischnura prognata cytochrome b (cytb) gene, mitochondrial gene 
encoding mitochondrial protein, partial cds 


0.45 


2583 


AC026061 


Homo sapiens BAC clone RP1 1-223K9 from Y, complete sequence 


2E-15 


2584 


Y13537 


Homo sapiens LAG-3 gene, 3' region 


0.24 


2585 


AE005977 


Caulobacter crescentus section 303 of 359 of the complete genome 


5.5 


2586 


U13190 


Angiostrongylus cantonensis clone G-2 retrotransposable element 
AcT-1 


0.077 


2587 


AF364550 


Drosophila melanogaster transposon cruiser, complete sequence 


0.07 


2588 


XM_034146 


Homo sapiens CD36 antigen (collagen type I receptor, 
thrombospondin receptor) (CD36), mRNA 


0.62 


2589 


AFO 14580 


Gonioctena viminalis 12S ribosomal RNA gene, mitochondrial gene 
for mitochondrial RNA, partial sequence 


2.1 


2590 


XM_042643 


Homo sapiens hypothetical protein DKFZp434G131 
(DKFZP434G131), mRNA 


0.68 


2591 


BC004324 


Homo sapiens, ribosomal protein S16, clone MGC; 10931 
IMAGE:3628799, mRNA, complete cds 


0.006 
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u_/ 

NO 


ACCESSN 


DESCRIP 


P VALUE 


2592 


U42430 


Mesocricetus auratus CD36 mRNA, complete cds 


0.71 


2593 


U16137 


Saccharomyces cerevisiae Mvplp (MVP1) gene, complete cds 


4.7 


2594 


AB047954 


Macaca fascicularis brain cDNA, clone:QnpA-12630 


0.65 


ZJ7J 


AF261697 


Homo sapiens clone 17ptel_c39t3 sequence 


0.024 


2596 


AF162598 


Drosophila virilis strain wl59 zeste protein (z) gene, partial cds 


0.063 


2597 


U40255 


Peromyscus sejugis ND3 and ND4L genes, complete cds, tRNA(arg) 
gene, complete sequence, tRNA(gly) gene, partial sequence, and ND4 
gene, partial cds, mitochondrial genes encoding mitochondrial 
products 


0.67 


2598 


Z63251 


Hsapiens CpG island DNA genomic Msel fragment, clone 7hl0, 
forward read cpg7hl0.fla 


0.003 


2599 


U68150 


Chelonus sp. 16S mitochondrial ribosomal RNA, mitochondrial gene, 
partial sequence 


0.003 


2600 


AF138281 


Arabidopsis thaliana phospholipase D-gamma-2 mRNA, complete cds 


0.071 


7601 


AF069985 


Mus musculus nitrilase homolog 1 (Nitl) gene, alternatively spliced 
product, complete cds 


0.39 


£.\y\j£. 


M83134 


Human germline IgH chain (hv3005) V3-region 


0.35 




AF275245 


Trama rara 12S small subunit ribosomal RNA gene, partial sequence; 
tRNA-Val gene, complete sequence; and 16S large subunit ribosomal 
RNA gene, partial sequence; mitochondrial genes for mitochondrial 
products 


0.59 


2604 


X93302 


S.cerevisiae mRNA for Msn5 protein 


0.05 


2605 


AK015536 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4930470H18, full insert sequence 


0.7 


2606 


AC087735 


Caenorhabditis elegans cosmid Y35H6, complete sequence 


0.022 




AJ271002 


Mus musculus TFFl/pS2 gene for Trefoil Factorl/pS2, exons 1-3 


0.73 




NM_019664 


Mus musculus potassium inwardly-rectifying channel, subfamily J, 
member 15 (Kcnjl5), mRNA 


5.8 




AL450330 


Human DNA sequence from clone RP1 1-162H13 on chromosome 6 
Contains GSSs, complete sequence [Homo sapiens] 


0.25 


7610 


Z24671 


H. sapiens (D1S514) DNA segment containing (CA) repeat; clone 
AFMal51za5; single read 


0.007 


2611 


AE000773 


Aquifex aeolicus section 105 of 109 of the complete genome 


0.34 


2612 


X93508 


E.histolytica mRNA for 170kDa subunit of galactose/N- 
acetylgalactosamine binding lectin 


0.054 




AivUUJ/oU 


Mus musculus 18 days embryo cDNA, RIKEN full-length enriched 
library, clone, iliuu 18121, rull insert sequence 


0.15 


2614 


L81687 


Homo sapiens (subclone l_cl2 from PI H54) DNA sequence, 
complete sequence 


5.2 


2615 


M10090 


Human myoglobin gene, exon 1 


0.13 


2616 


NC_002670 


Bacteriophage bIL311, complete genome 


0.061 


2617 


AB016259 


Oryzias latipes gene for cardiac muscle actin, complete cds 


0.19 


2618 


U67578 


Methanococcus jannaschii section 120 of 150 of the complete genome 


2 
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2619 
2620 



2621 
2622 



2623 



2624 
2625 



2626 
2627 



2628 



a 



2629 
2630 



Table 3A Nearest Neighbor (BlastN vs. Genbank) 



ACCESSN 



DESCRIP 



AF186258 
X79777 



Homo sapiens sulfotransferase 1C1 (SULT1C1) gene, exons 2 and 3 



XM 042013 
AB044879 



P.furiosus aor, cmo and ado-hcy genes 



Homo sapiens cell division cycle 34 (CDC34), mRNA 



|Equus caballus DNA, microsatellite TKY379 



AF1 1795 1 Mus musculus lysyl oxidase-like protein 2 mRNA, partial cds 



AF3 10887 
XM 034353 



Dictyostelium discoideum RacB (racB) gene, partial cds; and 
| unknown gene 



[Homo sapiens KIAA1500 protein (KIAA1500), mRNA 



AK021859 [Homo sapiens cDNA FLJ11797 fis, clone HEMBA1006182 



AF280894 Pongo pygmaeus clone 3458 chromosome Xq genomic sequence 



Plasmodium falciparum chromosome 2, section 33 of 73 of the 
AE001396 I complete sequence 



Homo sapiens similar to RIKEN cDNA 2610307121 gene; 
(hypothetical protein, MNCb-4273 (M. musculus) (LOC92606) 
XM 046097 I mRNA 



2631 
2632 



2633 



2634 
2635 



2636 



2637 



2638 



2639 



2640 
2641 



2642 
2643 



2644 



2645 



2646 



X02438 [Leishmania tarentolae maxicircle DNA fragmenT 



Bombus terrestris 16S large subunit ribosomal RNA gene partial 
AF181582 [sequence 



M34149 [S.cerevisiae mitochondrial gap 12 sequence 



AJ400612 



Trioxys angelicae partial mitochondrial ATPase 6 gene for ATPase 6 
subunit 



AB02 1 290 Homo sapiens gene for chondromodulin- 1 , promoter and partial cds 



AJ133756 Drosophila melanogaster mRNA for jim zinc finger protein 



AL121589 



Human DNA sequence from clone RP5-1 140M3 on chromosome 20. 
Contains GSSs, complete sequence [Homo sapiensl 



XM_040430 Homo sapiens hypothetical protein FLJ10599 (FLJ10599), mRNA 



ABO 1 5484 Dugesia japonica mRNA for myosin heavy chain, complete cds 



XM_008161 Homo sapiens hypothetical protein FLJ20345 (FLJ20345), mRNA 



Mus musculus gamma-aminobutyric acid receptor rho2 subunit 
AF024621 | mRNA, complete cds 



L78777 



Homo sapiens (subclone 2_b8 from PI H49) DNA sequence 



AE001028 Archaeoglobus fulgidus section 79 of 172 of the complete genome 



AB013796 Ceratopteris richardii mRNA for CRHB6, complete cds 



Chlorella ellipsoidea chloroplast 16S-23S DNA spacer and ORF1 to 
X03848 ORF4 DNA 



B.aphidicola trpE and trpG genes for anthranilate synthase, 
Z2 1 93 8 1 components I and II 



XM_039805 |Homo sapiens hypothetical protein MGC3040 (MGC3040), mRNA 
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ACCESSN 



DESCRIP 



2647 
2648 



AF267217 
L39786 



Candidate; Carsonella ruddii natural-host Pachypsylla celtidis ATP 
synthase alpha subunit (atpA) gene, partial cds; ATP synthase gamma 
subunit (atpG) gene, complete cds; and ATP synthase beta subunit 
(atpD) gene, partial cds 



P VALUE 



ILupinus angustifolius conglutin gamma gene, complete cds 



Drosophila melanogaster genomic scaflFold 142000013385436, 
| complete sequence 



2652 
2653 



XM_044529 
AB035494 



iHomo sapiens calcium and integrin binding protein (DNA-dependent 
protein kinase interacting protein) (SIP2-28), mRNA 



I Alternaria alternata gene for Akt3-2, complete"cdT 



2654 AE001381 



Plasmodium falciparum chromosome 2, section 18 of 73 of the 
complete sequence 



fu 

o 



m 



2655 
2656 



XM013145 
NM 007976 



Homo sapiens Alport syndrome, mental retardation, midface 
hypoplasia and elliptocytosis chromosomal region, gene 1 
l(AMMECRl), mRNA 



jMus musculus coagulation factor V (F5), mRNA" 



2657 



U47913 



Tribolium freemani Woot retrotransposon, partial sequence and 3' 
insertion junction F3(l) 



2658 



D10331 



Pneumonia virus of mice mRNA for nonstructural proteins and major 
Inucleocapsid protein, complete cds 



2659 
2660 



AL390874 
AF020282 



iHuman DNA sequence from clone RP1 1-238K16 on chromosome 9, 
complete sequence [Homo sapiens] 



Dictyostelium discoideum DG2033 gene, partial cds 



2663 



U12186 



lOryctolagus cuniculus sodium/dicarboxylate cotransporter mRNA, 
[partial cds 



2664 AJ305053 



Caprine arthritis-encephalitis virus proviral partial pol gene for Pol 
protein (reverse transcriptase), isolate CAEV Br/UFRGS-2/C767 



2665 AE006628 



I Streptococcus pyogenes Ml GAS strain SF370, section 157 of 167 of 
[the complete genome 



2666 I L76504 

2667 XM 032307 



Cronartium arizonicum isolate FSprP-1 18S ribosomal RNA gene, 
partial sequence; internal transcribed spacer 1, 5.8S ribosomal RNA 
gene and internal transcribed spacer 2, complete sequence; and 26S 
ribosomal RNA gene, partial sequence 



2668 | XM_010126 
2670 



Homo sapiens neuronal thread protein (AD7C-NTP), mRNA 



Homo sapiens stromal antigen 2 (STAG2), mRNA 



0.71 



2649 AB 049879 jMacaca fascicularis brain cDNA, clone:QnpA-18812 1 ^74" 

2650 AE002648 



2.1 



Homo sapiens hypothetical gene supported by AL080068 
2651 I XM_032228 |(LOC90510), mRNA 



2.4 



0.76 



0.7 



0.27 



0.65 



0.077 



0.087 



0.084 



0.003 



2.3 



2661 AF393505 jMus musculus potential meiotic DNA repair locus 2T 



2.3 



0.78 



2.4 



0.79 



7.2 



AB018798 I Plasmodium yoelii gene for P-Type ATPase, complete cds 



e-123 



2671 Z24358 



H. sapiens (D2S377) DNA segment containing (CA) repeat; clone 
|AFM319zf9; single read 



0.029 



2672 AE000878 



iMethanobacterium thermoautotrophicum from bases 976801 to 
1 992232 (section 84 of 148) of the complete genome 



0.0004 



6.2 
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> 
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DESCRIP 


P VALUE 


2673 


AL137717 


Homo sapiens mRNA; cDNA DKFZp434J1630 (from clone 
DKFZp434J1630) 


2E-31 


2674 


X61377 


S.cerevisiae XYK gene for xylulokinase 


2.4 


2675 


XM 034353 


Homo sapiens KIAA1500 protein (KIAA1500), mRNA 


4E-89 


2676 


JO 1268 


Slime mold (D.discoideum) actin 2 (sub 2) pseudogene 5* end 


0.085 


2677 


AF113932 


Carios puertoricensis 16S ribosomal RNA, partial sequence 


0.079 


2678 


XM 028752 


Homo sapiens ribophorin II (RPN2), mRNA 


2.1 


2o /y 


BC003254 


Mus musculus, Similar to dentatorubral pallidoluysian atrophy, clone 
MGC5758, mRNA, complete cds 


0.78 


2680 


NC 001871 


Feline foamy virus, complete genome 


0.27 


2681 


AF188115 


Simian immunodeficiency virus Gae protein gene complete cds* Pol 
protein gene, partial cds; Vif protein (Vif), Vpr protein (Vpr), 
truncated Tat protein (Tat), Rev protein (Rev), Env protein (Env), and 
Nef protein (Nef) genes, complete cds; and long te> 


0.77 


2682 


AE007382 


Streptococcus pneumoniae section 65 of 194 of the complete genome 


0.73 


2683 


AF1 90869 


Gorilla gorilla X chromosome clone 1, partial sequence 


0.082 


2685 


AK004792 


Mus musculus adult male lung cDNA, RIKEN full-length enriched 
library, clone: 1200015107, full insert sequence 


1E-15 


2686 


AE001387 


Plasmodium falciparum chromosome 2, section 24 of 73 of the 
complete sequence 


0.063 


2687 


AB051438 


Homo sapiens mRNA for KIAA165 1 protein, partial cds 


e-141 


2688 


AY047509 


Drosophila melanogaster GH01721 full length cDNA 


0.21 


2689 


AK016678 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4933405P16, full insert sequence 


0.21 


2690 


Z 19055 


B.aphidicola tryptophan operon 


0.077 


2691 


XM_047737 


Homo sapiens ubiqmtin specific protease 25 (USP25), mRNA 


9E-56 


2692 


U13190 


Angiostrongylus cantonensis clone G-2 retrotransposable element 
AcT-1 


0.26 


2693 


X00858 


H. sapiens DHeK gene, exon 5 


0.27 




AK026620 


Homo sapiens cDNA; FLJ22967 fis, clone KAT10573, highly similar 
to AF151892 Homo sapiens CGI-134 protein mRNA 


0.077 


zoyo 


D87001 


Homo sapiens immunoglobulin lambda gene locus DNA, clone:47H9 
downstream contig 


0.25 


26V7 


U93276 


Mus musculus glucokinase gene, intron 1 and hypersensitive site VII 
region 


0.73 


2698 


AE006705 


Sulfolobus solfatancus section 64 of 272 of the complete genome 


1.2 


2699 


AE001423 


Plasmodium falciparum chromosome 2, section 60 of 73 of the 
complete sequence 


2.2 


2700 


X76799 ] 


H.sapiens genomic DNA (^nucleotide (AC) repeat clone alO 


6.4 


2701 


Z93324 ] 


L. major DNA containing open reading frame 


0.23 


2702 


AJ251163 ] 


Dictyostelium discoideum dymB gene for dynamin B homologue 


6.3 


2703 


] 

AK020562 ( 


VIus musculus adult male urinary bladder cDNA, RIKEN full-length 
snriched library, clone:9530018H14, full insert sequence 


0.76 
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Z /U*f 


AF1 16983 


Stenomesson variegatum ribulose 1,5-bisphosphate carboxylase large 
subunit (rbcL) gene ? partial cds; chloroplast gene for chloroplast 
product 


6.4 




2706 


XM 051524 


Homo sapiens archain 1 (ARCN1), mRNA 


0.66 




2707 


AF351620 


Homo sapiens lipocalin-1 interacting membrane receptor (LIMR) 
gene, complete cds 


0.084 




2708 


AF180472 


Homo sapiens aryl-hydrocarbon interacting protein-like 1 (AIPL1) 
gene, complete cds 


5.9 




z /u? 


AY008848 


Ictalurus punctatus clone Icpu-UA/3 MHC class I antigen gene, 
complete cds 


0.025 




9710 
Z / 1U 


NM_0 16862 


Mus musculus vesicle transport through interaction with t-SNAREs 1 
homolog (Vtil -pending), mRNA 


0.65 


Q 


971 1 
Z / 1 1 


AE004023 


Xylella fastidiosa 9a5c, section 169 of 229 of the complete genome 


3.9 




971 O 
Z /1Z 


XM 037239 


Homo sapiens actin, beta (ACTB), mRNA 


1E-49 




971 "5 
Z / 1 J 


AE001404 


Plasmodium falciparum chromosome 2, section 41 of 73 of the 
complete sequence 


0.7 


C3 


2714 


XM 050114 


Homo sapiens itLRA protein (TERA), mRNA 


5.9 




971 < 
Z / 1 J 


AK001082 


Homo sapiens cDNA FLJ10220 fis, clone HEMBA1007342 


0.56 


*M 


2716 


AF380155 


Ctenopharyngodon idella c-Fos (c-fos) mRNA, complete cds 


3.2 




771 7 
Z /I / 


AB027352 


Cordyceps sp. 97003 mitochondrial DNA for small subunit ribosomal 
RNA 


0.47 


iy 

It 


2718 


AB033220 


Lactococcus lactis gadB gene, untranslated by upstream frameshift, 
strain:01-l 


1.9 


L-a, 


2719 


U01841 


Mus musculus B6/CBAF1 J peroxisome proliferator activated receptor 
gamma (ppar) mRNA, complete cds 


0.009 




z /ZU 


AL023814 


Caenorhabditis elegans cosmid H08J19, complete sequence 


0.021 




2721 


XM_047737 


Homo sapiens ubiquitin specific protease 25 (USP25), mRNA 


5E-56 




2723 


AF389403 


Takifugu rubripes pericentriolar material 1 protein (PCM1) mRNA, 
complete cds 


1.9 




2724 


AE006369 


Lactococcus lactis subsp. lactis IL1403 section 131 of 218 of the 
complete genome 


0.21 




2726 


AF367865 


Mesembryanthemum crystallinum potassium transporter HAK2p 
(HAK2) mRNA, complete cds 


0.39 




2727 


XM_043040 


Homo sapiens hypothetical protein MGC2718 (MGC2718), mRNA 


1E-09 




979R 
Z /Zo 


AE001392 


Plasmodium falciparum chromosome 2, section 29 of 73 of the 
complete sequence 


0.0000004 




9790 

z /zy 


U31371 


Bovine herpesvirus 4 1.7-kb late RNA (LI. 7) mRNA, repeat region 


1.9 




2730 


AF195728 


Libellula flavida 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.025 




2731 


AK024658 ] 


Homo sapiens cDNA: FLJ21005 fis, clone CAE03745 


0.64 




2732 


AK025613 ] 


Homo sapiens cDNA: FLJ21960 fis, clone HEP05517 


1.4 




2733 


L11016 ] 


Homo sapiens lymphotoxin-beta gene, complete cds 


0.0001 




2734 


Y14422 ] 


M.musculus DNA for retinal protein 


L2 
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SEQ ID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


2735 


X83757 


E.histolytica 170 kDa lectin gene, 5' region 


0.64 


2736 


AK017535 


Mus musculus 8 days embryo cDNA, RIKEN full-length enriched 
library, clone:5730409K12, full insert sequence 


0.64 


2737 


AY047509 


Drosophila melanogaster GHO 1 72 1 full length cDNA 


0.21 - 


2738 


AB019433 


Homo sapiens gene for T-cell nuclear receptor NOT (Nurrl), complete 
cds 


0.024 


2739 


U58731 


Caenorhabditis elegans cosmid W09B12 


0.61 


2740 


U35620 


Nicotiana tabacum ribulose-1,5 bisphosphate carboxylase/oxygenase 
large subunit N-methyltransferase (rbcmtT) mRNA, complete cds 


1 


2741 


AF181720 


Homo sapiens RU2 AS (RU2) gene, complete cds; and RU2S (RU2) 
gene, partial cds 


5.4 


2742 


AE006560 


Streptococcus pyogenes Ml GAS strain SF370, section 89 of 167 of 
the complete genome 


0.24 


2743 


AF346287 


Gonioctena pallida haplotype 61 mitochondrial control region, partial 
sequence 


5 


2744 


L05617 


Dictyostelium purpureum (Dpp5) DNA sequence, repeat region 


0.021 


2745 


AK021104 


Mus musculus adult male corpus striatum cDNA, RIKEN full-length 
enriched library, clone:C030026M15, full insert sequence 


1 


2746 


L08174 


Romanomermis culicivorax mitochondrial NADH dehydrogenase 
subunits 6 and 3 (ND6, ND3) cytochrome P450-like protein genes, 
complete CDS 


1.6 


2747 


AB006130 


Mesocricetus auratus mRNA for haptoglobin, complete cds 


0.24 


2748 


AK001448 


Homo sapiens cDNA FLJ10586 fis, clone NT2RP2003986 


0.0008 


2749 


M95192 


Drosophila melanogaster hairless protein (H) mRNA, complete cds 


3.6 


2750 


X53426 


Nicotiana sylvestris NySS41 and partial NySS42 genes for RUBISCO 
small subunit (SSU) 


0.63 


2751 


NC_001336 


Methanobacterium thermoautotrophicum plasmid pFVl, complete 
sequence 


0.074 


2752 


NM_0 11595 


Mus musculus tissue inhibitor of metalloproteinase 3 (Timp3), mRNA 


0.012 


2753 


AF299340 


Homo sapiens CD 164 gene, complete cds, alternatively spliced 


0.074 


2754 


AL121765 


S.pombe chromosome I cosmid cl786 


0.025 


2755 


U27078 


Trichaptum abietinum mitochondrial small subunit ribosomal RNA, 
mitochondrial gene, partial sequence 


0.44 


2756 


AF279292 


Mycoplasma hyopneumoniae strain ISU 232 YX1 (yxl) gene, partial 
cds; pi 10 membrane protein precursor, gene, complete cds; and YX2 
(yxl) gene, partial cds 


0.024 


Z / J / 


AL590371 


Human DNA sequence from clone RP1 1-295H23 on chromosome 6, 
complete sequence [Homo sapiens] 


0.61 


2758 


XM_041904 


Homo sapiens cathepsin S (CTSS), mRNA 


6 


2759 


D63170 


Gallus domesticus DNA for the terminal heterochromatic region of 
the Z chromosome 


1.4 


2760 


NM_023197 


Mus musculus RIKEN cDNA 2310008H09 gene (23 10008H09Rik), 
mRNA 


0.025 
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SEQ ID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


2761 


BC007150 


Mus museums, dystroglycan 1, clone MOC:o63l iMAuh. 34969 14, 
mRNA, complete cds 


0.18 


2762 


BC000480 


Homo sapiens, amyloid beta precursor protein-binding protein 1, 
59kD, clone MGC8690 IMAGE:2964638, mRNA, complete cds 


8E-65 


2764 


XM_036683 


Homo sapiens a disintegrin-like and metalloprotease (reprolysin type) 
with thrombospondin type 1 motif, 3 (ADAMTS3), mRNA 


3E-46 


2765 


AJ005581 


Encephahtozoon cumculi complete rDNA umt (16S, 5.8S, 23 S, ITS1 
and IGS) 


0.42 


2766 


AE005227 


Eschenclua con 0157:H7 EDL933 genome, contig 1 ot 3, section 51 
of 155 


ft ftft7 


iifi 

X, IsJ 1 


XM_048172 


Homo sapiens hypothetical protein DKFZp434F1017 
(DKFZP434F1017), mRNA 


1 Q 


97£Q 
z toy 


AF129087 


Medicago sativa mitogen-activated protein kinase homologue (TDY1) 
gene, complete cds 


0.66 


2770 


XM_008176 


Homo sapiens MYB binding protein (P160) la (MYBBP1A), mRNA 


5.2 


9771 

£.111. 


AC024865 


Caenorhabditis elegans cosmid Y74C10AL, complete sequence 




2772 


XM_036164 


Homo sapiens RAN binding protein 2 (RANBP2), mRNA 


1E-59 


9771 


AL023816 


Caenorhabditis elegans cosmid T05G11, complete sequence 


0.64 


9774. 


XM_004980 


Homo sapiens cystic fibrosis transmembrane conductance regulator, 
ATP-binding cassette (sub-family C, member 7) (CFTR), mRNA 


0.016 


2775 


XM_032209 


Homo sapiens hypothetical gene supported by AK023337 
(LOC90509), mRNA 


0.018 


ill ft 


Z69650 


Human DNA sequence from cosmid L69F7B, Huntington's Disease 
Region, chromosome 4pio.J contains xiuntington uisease \t\u) gene 




2777 


AJ305053 


Caprine arthritis-encephalitis virus proviral partial pol gene for Pol 
protein (reverse transcriptase), isolate CAEV Br/UFRGS-2/C767 


0.46 


2778 


AKO 17531 


Mus musculus 8 days embryo cDNA, RIKJbN tull-iengtn enncnea 
library, clone:57304U9rlo, lull insert sequence 


n fki 


9770 


AF028784 


Rattus norvegicus glial fibrillary acidic proteins alpha and delta 
(GFAP) gene, alternatively spliced products, complete cds 


0.11 


2780 


AF101236 


Naja sputatrix neutral, phospholipase A2 (NPLA2) gene, complete cds 


0.054 


2781 


U71280 


Mesocricetus auratus cytochrome P450 aldosterone synthase 
(CYP1 1B2) gene, complete cds 


0.023 


97R9 


VR0007 

AOvvu / 


P. sativum gene for chalcone synthase 


0.15 


2783 


AF171040 


Anopheles funestus clone AF10 microsatellite sequence 


2 


2784 


XM 011464 


Homo sapiens hypothetical protein FLJ20170 (FLJ20170), mRNA 


0.21 


2785 


Y18929 


Homo sapiens wild type AF-4/FEL gene, part of intron 3, individual 
PB 


3 


2786 


L81581 


Homo sapiens (subclone 4__a4 from PI HI 1) DNA sequence, complete 
sequence 


0.024 
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|SEQID| 
NO 



ACCESSN 



DESCRIP 



P VALUE 



2787 
2788 



NM_02438 1 
X72688 



Rattus norvegicus ATP-stimulated glucocorticoid-receptor 
translocaton promoter (Gyk), mRNA 



0.17 



L.stagnalis LS-rabl mRNA 



0.2 



2789 
2790 



XM_033154 
AB017186 



Homo sapiens svntrophin, gamma 1 (SNTG1), mRNA 



0.021 



[Clostridium perfringens genes of hem operon, complete cds 



1.4 



2791 



U47416 



Solanum allophyllum NADH dehydrogenase subunit (ndhF) gene, 
chloroplast gene encoding chloroplast protein, partial cds 



0.2 



2792 



M21258 



Human small nuclear ribonucleoprotein (snRNP) E gene, exon 5 and 
| Alu repeats copy N and O 



0.051 



2793 



L05004 



Staphylococcus aureus dehydroquinate synthase (aroB) gene, 3* end 
cds; 3-phosphoshikimate-l-carboxyvinyltransferase (aroA) gene, 
complete cds; ORF3, complete cds 



0.21 



2794 AF302005 



Uma scoparia isolate ROM3406 cytochrome b gene, partial cds; 
mitochondrial gene for mitochondrial product 



2795 
2796 



AF288209 
AB048897 



Homo sapiens beta galactosyltransferase bGalT7 mRNA, complete cds | 2 
[Macaca fascicularis brain cDNA, clone:QnpA-17253 j~T 



2797 
2798 



AK025054 
U39886 



iHomo sapiens cDNA: FLJ21401 fis, clone CQL03678 



0.071 



Bos taurus Y-chromosome specific genomic sequence 



0.002 



2799 



AJ277649 I Caenorhabditis elegans mRNA for CHE- 14 protein 



0.6 



2800 AF058764 



Xenopus laevis bone morphogenetic protein 4 (BMP-4) gene, 
[complete cds 



0.56 



2801 
2802 



AE001398 
AF079456 



Plasmodium falciparum chromosome 2, section 35 of 73 of the 
I complete sequence 



0.15 



O'nyong-nyong virus strain SG650, complete genome 



0.003 



2803 AF087913 I Human endogenous retrovirus HER V-P-T47D 



2804 AB005053 Anomala cuprea entomopoxvirus gene for spheroidin, complete cds 



1.8 



2805 AF027656 Hom o sapiens cholesteryl ester transfer protein gene, promoter region! 0004 

2806 X M_031478 Homo sapiens vacuolar protein sorting protein 18 (VPS 18), mR NA 1 7 

2807 I XM J)01718 IHomo sapiens KIAA0475 gene product (KIAA0475), mRNA | Q.Q23 

2808 I L 21906 [Human troponin I, slow-twitch isoform (TNNI1) gene , exons 2-4 | IT 

2809 NM 006633 



Homo sapiens IQ motif containing GTPase activating protein 2 
|(IQGAP2), mRNA 



5E-39 



2810 AE001541 



Helicobacter pylori, strain J99 section 102 of 132 of the complete 
I genome 



0.2 



2811 I M63577 [Sxerevisiae SFP1 gene, complete cds 



0.023 



2812 



Homo sapiens cystic fibrosis transmembrane conductance regulator, 
XMJ)04980 ATP-binding cassette (sub-family C, member 7) (CFTR), mRNA 



0.013 



2813 AJ305053 



Caprine arthritis-encephalitis virus proviral partial pol gene for Pol 
Iprotein (reverse transcriptase), isolate CAEV Br/UFRGS-2/C767 



0.5 
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|SEQID 

NO ACCESSN 



DESCRIP 



P VALUE 



2814 
2815 



AK014450 
X99691 



Mus musculus 18 days pregnant adult female placenta and extra 
embryonic tissue cDNA, RIKEN full-length enriched library, 
clone:3830422A13, full insert sequence 



0.075 



B.taurus DNA for agouti gene 



H.sapiens CpG island DNA genomic Msel fragment, clone 152h5, 
forward read cpg!52h5.ftla 



2816 



Z64630 



0.14 



Ovine pulmonary adenocarcinoma virus, complete sequence* and 
2817 | AF357971 flanking Ovis aries sequence 



4.9 



2818 
2819 



Drosophila melanogaster genomic scaffold 142000013385800, 
AE003336 [complete sequence 



0.16 



U72236 Dictyostelium discoideum ModA (modA) gene, complete cds 



0.61 



2820 XM 018110 



Homo sapiens gap junction protein, beta 3, 31kD (connexin 31) 
|(GJB3), mRNA 



5.1 



2821 
2822 



AJ005588 



Capsicum annuum gene encoding 5-epi-aristolochene synthase, exons 
1 to 7 



0.23 



AK022842 |Homo sapiens cDNA FLJ12780 fis, clone NT2RP2001813 



0.074 



2823 
2824 



AF138281 Arabidopsis thaliana phospholipase D-gamma-2 mRNA, complete cds 



0.071 



AB040882 Homo sapiens mRNA for KIAA1449 protein, partial cds 



0.18 



2825 AB022156 



Mus musculus Cctb gene for chaperonin containing TCP-1 beta 
|subunit, complete cds 



0.5 



2826 BC005516 



|Mus musculus, RIKEN cDNA 1200007D18 gene, clone MGC:7601 
IMAGE:3494155, mRNA, complete cds 



0.51 



2827 
2828 



AK025207 Homo sapiens cDNA: FLJ21554 fis, clone COL06330 



4.4 



NMJ) 18798 Mus musculus ubiquilin 2 (Ubqln2), mRNA 



0.043 



H. sapiens (D17S842) DNA segment containing (CA) repeat* clone 
2829 | Z23770 | AFM240xe5; single read 



0.00004 



2830 BC005762 



Mus musculus, Similar to proteasome (prosome, macropain) subunit, 
alpha type 1, clone MGC:6546 IMAGE: 265 5483, mRNA, complete ' 
cds 



0.67 



283 1 AB047932 Macaca fascicularis brain cDNA, clone:QnpA-10158 



0.68 



2832 U19765 [Human nucleic acid binding protein gene, complete cds 



0.18 



2833 AF200327 [Plasmodium chabaudi unknown genes 



0.028 



2834 



U67544 Methanococcus jannaschii section 86 of 150 of the complete genome 



0.14 



2835 BC010357 



iHomo sapiens, Similar to hypothetical protein, clone MGC: 13367 
|lMAGE:4249317, mRNA, complete cds 



2E-55 



2836 AE007471 Streptococcus pneumoniae section 154 of 194 of the complete genome 



5.2 



2837 1 AJ320185 [Solanum tuberosum snl gene for snakin-1, exons 1-2 



0.44 



2838 AF039954 Homo sapiens CC chemokine LCC-1 precursor, gene, complete cds 



0.002 



2839 Y13631 [Clostridium botulinum P-21, P-47 ntnh, bonT genes 



0.14 



2840 AL390181 



Homo sapiens mRNA; cDNA DKFZp547J125 (from clone 
|DKFZp547J125) 



0.019 
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OHI^ ID 

NO 


ACCESSN 


DESCRIP 


P VALUE 


OQA 1 


AF366200 


Diadema antillarum isolate dmid3 tRNA-Lys gene, partial sequence; 
ATPase 8 gene, complete cds; and ATPase 6 gene, partial cds; 
mitochondrial genes for mitochondrial products 


6.1 


2842 


AJ237688 


Cards familiaris polymorphic marker clone XA07 


0.53 


2843 


AF244979 


Mus musculus galectin-1 1 mRNA, variant b, complete cds, 
alternatively spliced 


0.25 


OQA A 


AL390174 


Homo sapiens mRNA; cDNA DKFZp547J184 (from clone 
DKFZp547J184) 


0.21 


OQA ^ 


U80230 


Human clotting factor VIII gene, junction regions of the deletion of 
exons 15 through 20 and LENE-l-like repeat region 


2E-09 


Zo4u 


U31086 


Gallus gallus neuron-glia adhesion molecule (Ng-CAM) gene, partial 
cds, exons 1 and 2 


0.064 


Z©4 / 


D16685 


Rice gene for lactate dehydrogenase, complete cds 


0.003 


2848 


AB007459 


Chara corallina ccml mRNA for myosin, complete cds 


0.067 


z©4y 


U63328 


Pissodes strobi vitellogenin (Vtg) mRNA, partial cds 


0.68 


2850 


U50727 


Human p40-phox (NCF4) gene, exon 8 


0.023 




NM_017190 


Rattus norvegicus Myelin-associated glycoprotein (Mag), mRNA 


6.5 


2852 


AF361105 


Homo sapiens interleukin 9 (IL9) gene, complete cds 


0.37 


2853 


AP001416 


Homo sapiens genomic DNA, chromosome 21q22.2, clone:D34, 
LB7T-ERG region, complete sequence 


1.5 


ZoD4 


Y16771 


Phylica pubescens chloroplast partial trnL gene & intergenic spacer 


0.6 


2855 


AE005624 


Escherichia coli 0157:H7 EDL933 genome, contig 3 of 3, section 243 
of 290 


0.075 


2856 


AF159568 


Microbotryum violaceum ammonium transporter MEPa (MEPa) 
mRNA, complete cds 


0.7 


ZoD / 


M34149 


S.cerevisiae mitochondrial gap 12 sequence 


0.51 


Z53o 


D16541 


Chicken DNA for connectin(titin), partial sequence 


0.064 


zojV 


AC007039 


Homo sapiens BAC clone RP1 1-263 A15 from Y, complete sequence 


e-149 


ZooU 


AJ294714 


Pichia anomala leu2 gene for beta-isopropylmalate dehydrogenase 


0.056 


2861 


XM_051160 


Homo sapiens KIAA1463 protein (KIAA1463), mRNA 


0.22 


ZftOZ 


XM_031527 


Homo sapiens leucine-rich PPR-motif containing (LRPPRC), mRNA 


0.0003 


ZoO-7 


AF323443 


Plasmodium yoelii yoelii clone 11 235 kDa rhoptry protein (IIa.2) 
gene, partial cds 


0.02 


Zoo4 


Y14324 


Staphylococcus aureus partial ORF292 and ORF271, ORF331 and 
ORF314 


0.62 


2865 


BC007568 


Homo sapiens, clone IMAGE: 3028427, mRNA, partial cds 


0.16 


2866 


U67518 


Methanococcus jannaschii section 60 of 150 of the complete genome 


0.67 


2867 


AJ133742 


Sus scrofa ryr3 gene (partial), exon 2 


0.68 


2868 


AF1 16520 


Mus musculus periplakin (ppl) gene, exons 2-14 


0.083 


2869 


D38044 


Human gene for Ah-receptor, exon 7-9 


0.25 
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SEQDD 
NU 


ACCESSN 


DESCRIP 


P VALUE 


2870 


XM 010162 


Homo sapiens methyl CpG binding protein 2 (Rett syndrome) 
(MECP2), mRNA 


1.7 


2871 


AF303030 


Lavatera phoenicea isolate 804 internal transcribed spacer 1, partial 
sequence, 5.8S ribosomal RNA, complete sequence and internal 
transcribed spacer 2, partial sequence 


0.01 


2872 


AE002274 


Chlamydia muridarum, section 6 of 85 of the complete genome 


1.8 


2873 


AB062056 


Homo sapiens p53DINPl gene for p53DINPla, p53DINPlb, complete 
cds, alternative splicing 


8E-26 


2874 


AF150965 


Bacillus anthracis plasmid pXOl resolvase X (resX) and transposase 
X (traX) genes, complete cds 


1.6 


2875 


U90292 


Human chromosome 16 duplicated adrenoleukodystrophy (ALD) 
gene, 5' breakpoint sequence 


3E-41 


2876 


U75361 


Rattus norvegicus Muncl3-3 mRNA, complete cds 


3E-18 


2877 


AF006686 


Plecotus auritus clone Paur06 microsatellite sequence 


5E-10 


2878 


NM_009694 


Mus musculus apolipoprotein B editing complex 2 (Apobec2), mRNA 


0.00009 


2879 


AF315316 


Vigna radiata ACC oxidase gene, complete cds 


0.003 


2880 


AE005714 


Caulobacter crescentus section 40 of 359 of the complete genome 


L5 


2881 


AL451135 


Human DNA sequence from clone RP1 1-45G20 on chromosome 6, 
complete sequence [Homo sapiens] 


1.9 


2882 


AF315316 


Vigna radiata ACC oxidase gene, complete cds 


0.003 


2883 


NM 009694 


Mus musculus apolipoprotein B editing complex 2 (Apobec2), mRNA 


0.00008 


2884 


AB047932 


Macaca fascicularis brain cDNA, clone:QnpA-10158 


0.16 


2885 


U18671 


Human Stat2 gene, complete cds 


0.23 


2886 


AB040670 


Bombyx mori copg2 mRNA for nonclathrin coat protein gamma2- 
COP, partial cds 


0.074 


2887 


AK0 17987 


Mus musculus adult male thymus cDNA, RIKEN full-length enriched 
library, clone:5830443J22, full insert sequence 


0.035 


2889 


AE001419 


Plasmodium falciparum chromosome 2, section 56 of 73 of the 
complete sequence 


0.12 


2890 


AF275225 


Eriosoma lanigerum 12S small subunit ribosomal RNA gene, partial 
sequence; tRNA-Val gene, complete sequence; and 16S large subunit 
ribosomal RNA gene, partial sequence; mitochondrial genes for 
mitochondrial products 


0.35 


2891 


AF372971 


Arabidopsis thaliana AT3g23580/MDB19 7 mRNA, complete cds 


0.53 


2892 


AJ404851 


Homo sapiens partial ARNT gene for aryl hydrocarbon receptor 
nuclear translocator, exon 4 


3.7 


IRQ'? 


AJ012380 


Fugu rubripes partial rara gene, exons 4-10 


1.4 


2894 


AB064593 


Escherichia coli icdA gene for isocitrate dehydrogenase, complete cds 


0.027 


2895 


] 

AF182444 < 


Drosophila melanogaster disco-interacting protein (dipl) gene, 
alternatively spliced products, complete cds; and TO42 gene, partial 
:ds 


1.6 
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OT7/"fc TT"\ 

SEQ ID 
NU 


ACCESSN 


DESCRIP 


P VALUE 


2896 


AY020760 


Oryza sativa microsatellite MRG3085 containing (TA)X13, closest to 
marker R2976, genomic sequence 


0.007 


2897 


AE006318 


Lactococcus lactis subsp. lactis IL1403 section 80 of 218 of the 
complete genome 


0.53 


2898 


X01973 


Mouse gene for interferon alpha 4 (Mu IFN-alpha 4) 


3.9 


1QOO 

2099 


AF176677 


Candida albicans gamma-glutamylcysteine synthetase (GCS1) gene, 
complete cds 


0.07 


2900 


AF109009 


Prolaupala kukui clone 1092 transposon Laul sequence 


3.6 


2901 


D49739 


Chicken mRNA for leucine zipper protein, complete cds 


2.4 


2902 


U34610 


Mus musculus alpha- 1 (XVIII) collagen (COL18A1) gene, exon 6 and 
7 


0.006 


2903 


AK026763 


Homo sapiens cDNA: FLJ23110 fis, clone LNG07802 


0.34 


2904 


AF343914 


Campylobacter jejuni NCTC 11828 LOS biosynthesis cluster, partial 
sequence 


0.024 


2905 


AE002722 


Drosophila melanogaster genomic scaffold 142000013385665, 
complete sequence 


0.007 


2906 


XM_027242 


Homo sapiens KIAA0805 protein (KIAA0805), mRNA 


1.1 


2907 


Y15082 


Homo sapiens pl4.5 gene, partial exon 1 and promoter; hPOPl gene 
exon 1 and promoter and bidirectional promoter region 


0.67 


2908 


XM_009536 


Homo sapiens GTPase GES; REM protein (REM), mRNA 


5.9 


2909 


XM_036351 


Homo sapiens nuclear transcription factor Y, alpha (NFY A), mRNA 


1E-32 


2910 


AB037851 


Homo sapiens mRNA for KIAA1430 protein, partial cds 


0.58 


2911 


XM__002943 


Homo sapiens similar to PRO0327 protein (H. sapiens) (LOC92570), 
mRNA 


2.4 


2912 


AF069179 


Lysiphlebus cardui NADH dehydrogenase 1 gene, mitochondrial gene 
encoding mitochondrial protein, partial cds 


0.008 


2913 


AJ243545 


Arabidopsis thaliana gene for 67 kD chloroplastic RNA-binding 
protein, P67 


0.54 


2914 


AJ295795 


Isolepis producta chloroplast trnL gene intron 


2.2 


2915 


AJ3 11050 


Medicago sativa prl0.2 gene for class 10 PR protein 


0.26 


2916 


AK021512 


Homo sapiens cDNA FLJ11450 fis, clone HEMBA1001432 


0.25 


2917 


AK002979 


Mus musculus adult male brain cDNA, RJKEN full-length enriched 
library, clone:0710001P07, full insert sequence 


0.023 


TO 1 O 

2vlo 


AY007505 


Streptococcus mitis phage SMI PblA (pblA) and PblB (pblB) genes, 
complete cds; lysin (lys) gene, partial cds; and unknown genes 


0.026 


Id 1 O 

2919 


AK022043 


Homo sapiens cDNA FLJ11981 fis, clone HEMBB1001317 


0.69 


2920 


AF 162969 


Mycoplasma capncolum capncolum strain 960038 fructose 
biphosphate aldolase gene, partial cds 


0.064 


2921 


XM 051364 


Homo sapiens similar to immediate early protein (H. sapiens) 
(LOC93434), mRNA 


0.086 


2922 


XM_027642 


Homo sapiens KIAA0349 protein (KIAA0349), mRNA 


0.24 


2923 


XM 045819 


Homo sapiens cofilin 2 (muscle) (CFL2), mRNA 


1.9 


2924 


AF146362 


Drosophila melanogaster Huntington disease protein homolog 
(Hsap\HD) mRNA, complete cds 


0.23 
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SEQID 
NO 


/w^y^iz, o o in 




"D A/AT TTTh 
r VAL»UH 


2925 


U91993 


Solanum chacoense transposable element Sch2, sequence 


0.027 


2926 


AVfl0 1 780 
AJSAJZI /oZ 


xiomo sapiens cuina fjuj i i /zu ns, cione niiJviDAiuvozyj 


Z.J 


2927 


ATZOHAAAA 
ArZ /4444 


Carsonella ruddii ribosomal protein LI 1 (rpLl 1), ribosomal protein 
LI (rpLl), ribosomal protein L10 (rpLlO), ribosomal protein L7/L12 
(rpL7/L12), beta subunit of RNA polymerase (rpoB), beta-prime 
subunit of RNA polymerase (rpoC), ribosomal protein S12 > 


z 


2928 


A T4f\fl1fiA*7 

ArUUJU4 / 


Simian immunodeficiency virus isolate P2S1 patient P2, gpl20 (env) 
gene, partial cds 


U.U04 


2929 


/VISAJZZIUZ 


riomo sapiens cjjjna ri-j izu4u ns, cione ixtiJVLD£jiuuiy44 


n 
u 


2930 




Homo sapiens BAC clone CTB-34F8 from 7pl5-p21, complete 
sequence 


o noi 


2931 


A T7 1 H/l S1Q 

Ar iU4oiy 


Strumaria truncata trnL gene, partial sequence; chloroplast gene for 
chloroplast product 


0 ?S 


2932 


AF171916 


Trimeresurus flavomaculatus cytochrome b (cytb) gene, partial cds; 
mitochondrial gene for mitochondrial product 


0.21 


2933 


AJ011596 


Homo sapiens trapped 3' terminal exon, clone B2E8 


0.0003 


2934 


ACO/£Q 1 "57 


Homo sapiens GABAA receptor gamma 3 subunit (GABRG3) gene, 
exon 3 


O R 


2935 


AoUJlOoO 


Arabidopsis thaliana AATL1 gene for amino acid transporter-like 
protein 1, complete cds 


1 Q 


2936 


L, / 0454 


Cronartium appalachianum isolate Ca-1 18S ribosomal RNA gene, 
partial sequence; internal transcribed spacer 1, 5.8S ribosomal RNA 
gene and internal transcribed spacer 2, complete sequence; and 26S 
ribosomal RNA gene, partial sequence 


O 71 


2937 


AF366570 


Trypanosoma congolense calreticulin mRNA, complete cds 


0.084 


2938 


NM_004682 


Homo sapiens PC4 and SFRS1 interacting protein 1 (PSIP1), mRNA 


2 


2939 


AL050219 


Homo sapiens mRNA; cDNA DKFZp5»6J 1623 (irom clone 
DKFZp586J1623) 


1.8 


2940 


AF275687 


Plasmodium falciparum knob-associated histidine-rich protein 
(KAHRP) gene, complete cds 


0.009 


2941 


ATI A*)HC\1 

Ar 14Z /Ul 


Vigna umbellata maturase-like protein (matK) gene, complete cds; 
cmoropiasi gene ior cnioropiasi proauci 


0 Sft 

U.JU 


2942 


ArZZo / /D 


Mitthyridium undulatum small ribosomal protein 4 (rps4) gene, 
pamai cos, cmoropiasi gene ior cmoropiasi proauci 


O 07^ 


2943 


XM 034735 


Homo sapiens similar to hypothetical protein FLJ21463 (H. sapiens) 
(LOC90883), mRNA 


0 


2944 


AKO 17899 


Mus musculus adult male thymus cDNA, RIKEN full-length enriched 
library, clone:5830405M20, full insert sequence 


0.28 


2945 


Y10159 


D.discoideum racGAP gene 


6.2 


2946 


XMJH1562 


Homo sapiens hypothetical gene supported by NM_000181 
(LOC92707), mRNA 


0.025 


2947 


AF191257 


Trichuris trichiura microsatellite ttstrl4 


0.078 


2948 


Y08449 


E.caballus DNA segment containing CA microsatellite VHL137 


0.00003 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 




DtiaCRlP 


P VALUE 


2949 




Mus musculus 18 days embryo cDNA, RIKEN full-length enriched 
norary, cione.iiiuuzyuiz, lull insert sequence 


0.077 


2950 


NM_0 12779 


Rattus norvegicus Aquaporin 5 (Aqp5), mRNA 


2.2 


2951 


AJ4U4zVj 


Borelis schlumbergeri 18S rRNA gene, isolate 191 


0.25 


2952 


AE001365 


Plasmodium falciparum chromosome 2, section 2 of 73 of the 
complete sequence 


0.49 


2954 


"VI 1A1A 

I I J4J4 


Ovis aries mRNA for thyroid stimulating hormone receptor 


1.8 


2955 


AB026906 


Homo sapiens SDHD gene for small subunit of cytochrome b of 
succinate dehydrogenase, complete cds 


0.001 


2956 


rJLA/lUjo / 


Homo sapiens, hypothetical protein FLJ20534, clone MGC: 13432 
iMA\j±i.4jJ4l7z, mRNA, complete cds 


4 


2957 


AJ251957 


Ciona intestinalis mRNA for nuclear lamin (lamin L2 gene) 


2 


2958 




Mus musculus adult male testis cDNA, RIKEN full-length enriched 
liDrary, cione.4yj34UoJOo 5 lull insert sequence 


2E-14 


2959 


Z13985 


M.musculus control region sequence DNA 


0.072 


2960 


AT 1 *Qf\ 


Human DNA sequence from clone CTD-2053G21 on chromosome 
xq 13. 3 -2 1.2, complete sequence [Homo sapiens] 


0.007 


2961 


AB002807 


Glycine max DNA for nodulin 35, partial cds 


0.24 


2962 


AE001399 


Plasmodium falciparum chromosome 2, section 36 of 73 of the 
complete sequence 


0.003 


2963 




Homo sapiens KrCl-ll 2©yfil3 CD21 gene, partial cds 


0.00001 


2964 




Homo sapiens similar to hypothetical protein PR02822 (H. sapiens) 


0.21 


2965 


AF288778 


Homo sapiens erythroleukemia translocation breakpoint region 


0.068 


2966 


ATTfl^TOQ/l 

Aru / jyy4 


Drosophila melanogaster gamma-tubulin (GTUB) gene, partial cds; 
and recombination repair protein 1 (RRP1) and thymidylate synthase 
(TS) genes, complete cds; and RNA-binding protein (RBP9) gene, 
partial sequence 


0.14 


2967 


AL121822 


Human DNA sequence from clone 346P1 1 on chromosome Xq2L2- 
21.33, complete sequence [Homo sapiens] 


0.0000001 


2968 


TT1 1 Ofl/I 
U-? !Zo4 


Mesotaenium caldariorum clone mesphylb phytochrome gene, 
complete cds 


0.59 


2969 


7UVl_UUooUO 


Homo sapiens mitogen-activated protein kinase 4 (MAPK4), mRNA 


1.8 


2970 


XM_027307 


Homo sapiens KIAA1209 protein (KIAA1209), mRNA 


0.001 


2971 


A 171 70*71 1 


HyioDates sp. KH.50 glycoprotein (RHAG) gene, intron 9 


0.56 


2972 


BC007133 


Mus musculus, apoptosis inhibitory protein 5, clone MGC:5830 
IMAGE 3594358 mRNA comnlete cdc 


J. 4 


2973 


AF282018 


Homo sapiens clone 15qtel_ctg0082_c262bt3 c290bt3 sequence 


2E-53 


2974 


AE004790 


Pseudomonas aeruginosa PA01, section 351 of 529 of the complete 
genome 


1.8 


2975 


XM 003050 


Homo sapiens similar to seven transmembrane domain orphan 
receptor (H. sapiens) (LOC63108), mRNA 


0.27 


2976 


AF353728 


Mus musculus myo-inositol monophosphatase 1 (Impal) gene, exons 
1 through 7 


1.2 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


2977 


AK005535 


Mus musculus adult female placenta cDNA, RIKEN full-length 
enriched library, clone: 1600025H15, full insert sequence 


0.55 


2978 


AJ002479 


Medicago truncatula ENBP1 gene, exons 1 to 12 


0.64 


2979 


AF154836 


Homo sapiens Ras-like GTP-binding protein (RAB27A) gene, exons 
lb and 2 


0.000001 


2981 


AF260819 


Plasmodium falciparum CGI protein gene, complete cds 


0.73 


2982 


AE006714 


Sulfolobus solfataricus section 73 of 272 of the complete genome 


0.02 


2984 


NM 004114 


Homo sapiens fibroblast growth factor 13 (FGF13), mRNA 


6.7 


2985 


AF155369 


Buthus martensii putative potassium ion channel blocker TXKsl 
mRNA, complete cds 


0.72 


2986 


XM_041836 


Homo sapiens similar to MHC class I region ORF (H. sapiens) 
(LOC91955), mRNA 


0.18 


2987 


AF153668 


Microcephala discoidea NADH dehydrogenase (ndhF) gene, partial 
cds; chloroplast gene for chloroplast product 


1.7 


2988 


AF216211 


Drosophila buzzatii alpha-esterase 3 (aE3) and alpha-esterase 4b 
(aE4b) genes, complete cds 


0.073 


2989 


AF352244 


Dermatophagoides farinae paramyosin-like allergen mRNA, partial 
cds 


0.025 


2990 


XM 007409 


Homo sapiens alkylation repair; alkB homolog (ABH), mRNA 


0.21 


2991 


AF005779 


Sus scrofa neuropeptide Y Yl receptor mRNA, complete cds 


0.66 


2992 


AF264749 


Arabidopsis thaliana hydrophobic protein RCI2B (RCI2B) and 
hydrophobic protein RCI2A (RCI2A) genes, complete cds 


0.022 


2994 


AL162033 


Homo sapiens mRNA; cDNA DKFZp434F1872 (from clone 
DKFZp434F 1 872) 


0.69 


2995 


AK013593 


Mus musculus adult male hippocampus cDNA, RIKEN full-length 
enriched library, clone: 2900024009, full insert sequence 


0.7 


2996 


AF222716 


Plasmodium falciparum clone 3D7 unconventional myosin PfM-B 
gene, complete cds 


0.027 


2997 


AK006599 


Mus musculus adult male testis cDNA, RIKEN fiill-length enriched 
library, clone: 1700034J06, full insert sequence 


2.2 


2998 


AF010509 


Plasmodium falciparum microsatellite TA39 sequence 


0.001 


2999 


AF070566 


Homo sapiens clones 24611 and 24705 mRNA sequence 


0.000003 


3000 


AF181676 


Homo sapiens clone 204112 polymorphic microsatellite sequence 


0.074 


3001 


M17169 


Chinese hamster glucose-regulated protein GRP78 mRNA, complete 
cds 


0.008 


3002 


NM 03 1236 


Rattus norvegicus alpha 1,2-fucosyltransferase (Futl), mRNA 


0.69 


3003 


AJYL_U 1 0 J z 1 


Homo sapiens region containing TBX3-iso protein; T-box 3 (ulnar 
mammary syndrome ) (iajcozjjjj, mKNA 


0.73 


3004 


BC005379 


Homo sapiens, Similar to plasminogen, clone MGC: 12496 
IMAGE:3934860, mRNA, complete cds 


0.000001 


3005 


AF276176 


Scaphirhynchus platorynchus microsatellite Spl-107 sequence 


0.009 


3006 


XMJ) 10048 


Homo sapiens like mouse brain protein E46 (E46L), mRNA 


0.47 


3007 


NC 001807 


Human mitochondrion, complete genome 


1E-10 


3008 


AE006103 


Pasteurella multocida PM70 section 70 of 204 of the complete genome 


0.4 
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|SEQID 

NO I ACCESSN 



DESCRIP 



P VALUE 



3009 AE006266 



Lactococcus lactis subsp. Iactis IL1403 section 28 of 218 of the 
| complete genome 



0.2 



3010 
3011 



AF022812 
AJ243901 



Dehalospirillum multivorans GTP cyclohydrolase II (ribA) gene, 
partial cds; and 3,4-dihydroxy-2-butanone-4-phosphate synthase 
(ribB), riboflavin synthase (ribH), tetrachloroethene reductive 
dehalogenase catalytically active subunit (pceA), and tetrachl> 



0.21 



I Mycoplasma hominis p75 gene, strain PG21 



1.4 
5E-33 



3013 
3014 



XM_027894 
AJ390525 



iHomo sapiens hypothetical protein DKFZp761C121 
(DKFZp761C121XmRNA 



[Candida albicans ORF DNA, clone 64gB 



3015 
3016 



AJ006873 
L07898 



Buchnera aphidicola plasmid pBUsl, repA2 gene, leuA gene and 
lORFl 



0.013 



[Entamoeba histolytica alpha-1 tubulin (tubl) gene, completTcdT 



0.024 



3017 1 L25107 \ Human LIS mRNA 



1.2 



3018 
3019 



AL122122 
V01 270 



[Homo sapiens mRNA; cDNA DKFZp434L098 (from clone 
|DKFZp434L098) 



0.007 



|Rattus norvegicus genes for 18S, 5.8S, and 28S liposomal RNaT 



2E-10 



3020 
3021 



AK023061 
U80449 



Homo sapiens cDNA FLJ12999 fis, clone NT2RP3000324 



0.12 



[Caenorhabditis elegans cosmid R10A10 



3022 AF355473 



Oryzias latipes germ cell-less protein (gel) mRNA, complete cds 



2.3 



3023 XM 037811 



[Homo sapiens hypothetical gene supported by AL1 17650 
(LOC91349), mRNA 



0.39 



3024 AK0 16492 



Mus museums adult male testis cDNA, RKEN full-length enriched 
library, clone:493 1430N09, full insert sequence 



0.004 



3025 XM 011189 



IHomo sapiens similar to DKFZP564K247 protein (H sapiens) 
|(LOC91537), mRNA 



3026 BC010710 



IMus musculus, Similar to ribosomal protein L19, clone MGC 6500 
IMAGE:2648593, mRNA, complete cds 



5.1 



3027 XM 017923 [Ho mo sapiens hypothetical protein FLJ12660 (FLJ12660) , mRNA | 4.1 

3028 | AC016671 [Homo sap iens BAC clone RP11-12J24 from Y. comp lete sem,^ | 0 .012 
3029 



Campylobacter jejuni major cell-binding factor (pebl) and receptor 
L13662 |genes, complete cds 



0.044 



3030 



iM.musculus pF2 genomic DNA, homology with mouse repetitive 
X77172 |elements Bl, B2 and human Alu 



0.008 



3031 AF074841 



Campylobacter lari tRNA-Ala and tRNA-Ile genes, complete sequence! 



0.016 



3032 AF272846 



Homo sapiens Fanconi anemia complementation group D2 protein 
|(FANCD2) gene, exon 1 



0.065 



3033 D50090 



Drosophila americana texana G-3-P dehydrogenase (alphaGpdh) 
[gene, exon 1-8, complete cds 



0.13 



3034 



Plasmodium falciparum chromosome 2, section 54 of 73 of the 
AE001417 complete sequence 



0.15 



3035 AK002007 



Homo sapiens cDNA FLJ1 1 145 fis, clone PLACE1006626, highly 
| similar to Homo sapiens mRNA for KIAA0928 protein 



0.026 
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SEQID 
NO 






13 \T A T T TC 

r VALUil 


3036 


AT 


Human DNA sequence from clone RP1 1-245H22 on chromosome 6 
^unictiiia vjoijs, uunipicic sequence iriomo sapiens j 


U.UU0U3 


3037 


AB030490 


Glycine max SG-05 gene for thiamin biosynthetic enzyme, complete 
cds 


0.003 


3038 


U05820 


Saccharomyces cerevisiae DA-box protein Smc2p (SMC2) gene, 
complete cds, and assimilatory sulfite reductase (MET 10) gene, 
partial cds 


1.7 


3039 


AFOOR7^1 


uracuana gracilis microsaieiiiie vjgi sequence 


0.07 


3041 


r\jr\j*T j jy j 


Carassius auratus clone gf-40 glutamic acid decarboxylase isoform 67 
1 ) mtviN/v, complete cos 


2 


3042 


A T40410R 
njtut j\jo 


ounies iiuirgiiiciiia 100 iiviN/\ gene, isolate ojo 


u.uuy 


3043 




Streptococcus pyogenes Ml GAS strain SF370, section 24 of 167 of 


o 


3044 


vVlVl UJU / O 1 


riomo sapiens enigma ^Lruvi aomain proieinj (gjniijJVIAJ, ITLKJN A 


JJb-lo 


3045 


AC024748 


Caenorhabditis elegans cosmid Yl 10A2AR, complete sequence 


0.15 


3046 


AL5 12282 


Human DNA sequence from clone XXyac-BA141F18 on chromosome 
X, complete sequence [Homo sapiens] 


0.06 


3047 


AF009980 


vJioiemur garnem coior vision opsin gene, exons j and o, ana 
complete cds 


0.17 


3048 




Carex exsiccata tRNA-Thr, partial sequence; trnT-trnL intergenic 
spacer, tRNA-Leu, and trnL-trnF intergenic spacer, complete 
sequence; and tRNA-Phe, partial sequence; chloroplast genes for 
cmorupiasi pruuucis 




3049 


a Ffifi/in^^ 

AJDULH-UJO 


Ayiena iasuaiosa ya^c, section loz 01 zzy 01 trie complete genome 


4.3 


3050 




T_T ooniono D T51 nana /V7 DD 1 \ 

ri. sapiens ktj gene ^AJ-rKr* gene Z) 


0.002 


3051 


AF279135 


Dictyostelium discoideum spore coat structural protein SP65 (cotE) 
gene, complete cds 


0.003 


3052 


AF7 HA^l 


ixanus norvegicus rxa i itlkjna, partial cos 


0. 12 


3053 


AF327114 


Homo sapiens clone 8qtel_cl9t7 sequence 


8E-17 


3054 




ivuiuiiLrfi xdscicuioiis urain cl/jna, cione.i^npA-zujy4 


n cn 


3055 


AJ235314 


Antilocapra americana mitochondrial DNA for D-loop 


0.17 


3056 


AF361481 


nomo sapiens miiocnonanai \j i K-oinaing protein 1 (OIFBP3) gene, 
complete cds; nuclear gene for mitochondrial product 


0.009 


3057 


AE006879 


Sulfolobus solfataricus section 238 of 272 of the complete genome 


1.6 


3058 


AJ277756 


ivcgionciia pneumupmia urvr j\ ^paniaij, recA gene, recA gene and 
sya gene (partial) 


0.21 


3059 


U73044 


Schizosaccharomvces nombe ARS binding nrotein 0 fahn?^ mRNA 
complete cds 


0.058 


3060 


AF043123 


Homo sapiens interleukin-7 receptor precursor (IL7R) gene, exon 1 


0.2 


3061 


L43603 


Gallus gallus Na+/K+-ATPase alpha- 1 subunit gene, promoter region 


0.48 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3062 


X68976 


N.viridescens mRNA for Hoxd-10 


0.092 


3063 


AF001454 


Helianthus animus Dc3 promoter-binding factor-2 (DPBF-2) mRNA, 
complete cds 


V/. 1 o 


3064 


AF086062 


Homo sapiens full length insert cDNA clone YZ06B1 1 




3065 


U67234 


Human clone HS5.29 Alu-Ya5 sequence 


v.UO / 


3066 


AF030199 


Mus musculus type 1 sigma receptor gene, complete cds 


0.45 


3067 


J05544 


Plasmodium falciDamm i?liico<!erihoQnhjitf* icnmpnicp ( trcri R 1 ^ o-^n/a 

complete cds 


0.2 


3068 


X17455 


Rat Bmyc gene 5* -fragment 




3069 


AF298207 


Dictvostelium discoideum transDoson thiiff-S comnletp ^pnnenrp 




3070 


NM_030238 


Mus musculus dynein, cvtODlasmic heaw chain 1 fDnchcH mRNA 


1.7 


3071 


AB032151 


Homo sapiens BABP gene for bile acid-binding protein [AKR 1C2], 
exon 1 to 7 


0.28 


3072 


NC 001481 


Feline calicivirus, complete genome 


1.9 


3073 


U49266 


Mus musculus growth hormone receptor/binding protein gene, exons 
9-10, and partial cds 


0.13 


3074 


Z24279 


H SaoiertS (D1S489^ DNA segment mntainina {C*A} rpivat- Hnnf» 

AFM309ve9; single read 


0.16 


3075 


AF228724 


xviid^uicub wuiivcisxi iN/vL*xi,-ueiiyurogenase suoumi *r gene, 
oartial seauence* NADH-dehvdroffenaop oiihnnit dT rNmdJ ^ or»np 
complete sequence; and tRNA-Thr gene, partial sequence; 
mitochondrial genes for mitochondrial products 


0.048 


3076 


U72723 


Oryza longistaminata receptor kinase-like protein (Xa21) gene, 
complete cds and family member C, pseudogene 


0.21 


3077 


U55184 


Human G protein Golf alpha gene, exon 12 and complete cds 


1E-16 


3078 


AC006119 


Mus musculus clone UWGC'mbaclOffan from 14D1-D? fT-PpIl 

*«UUVU4UU vaw&av tt ■ till/Civ 1 VbQI/ XI vlll X ft X 1 ^ 1 X \^W«hI 

Receptor Alpha Locus), complete sequence 


0 00004 


3079 


AK015900 


Mus musculus adult male testis cDNA, RUCEN full-length enriched 
library, clone:4930526F13, full insert sequence 


0.76 


3080 


AL133637 


Homo Saoiens mRNA* cDNA DKFZn414T1618 ffrnm Hnnp 

DKFZp434J1618); partial cds 


0.071 


3081 


S74063 


c-raf=raf protein [Xenopus laevis, mRNA, 2736 nt] 


0.56 


3082 


NM_0 14498 


Homo sapiens type II Golgi membrane protein (GPP 130), mRNA 


1.9 


3083 


AF238234 


Entamoeba histolytica diaphanous protein (dia) gene, partial cds 


2.1 


3084 


XM J) 10729 


Homo sapiens SWI/SNF related matrix associated art in rff*nf*nHpnt 
regulator of chromatin, subfamily f, member 1 (SMARCF1), mRNA 


0.65 


3085 


AK007542 


Mus musculus 10 day old male pancreas cDNA, RUCEN full-length 
enriched library, clone: 1810019D05, full insert sequence 


0.62 


3086 


AF228977 


Human T-cell lymphotropic virus type 1 isolate ATLL long terminal 
repeat, partial sequence 


0.0003 


3087 


AC091698 


Homo sapiens clone RP11-596J6, complete sequence 


0.073 


3088 


AF233882 


Homo sapiens plakoglobin (JUP) gene, exons 1 through 7 | 


4E-88 
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Table 3A Nearest Neighbor fBlastN vs. Genhank^ 


SEQ IE 
NO 


) 

ACCESSN 


DESCRIP 


P VALUE 


3089 


U29468 


Drosophila koepferae K04 San Luis, Argentina, Gandalf transposable 
element DNA< partial sequence 


0.2 


3090 


AF279292 


Mycoplasma hyopneumoniae strain ISU 232 YX1 (yxl) gene, partial 
cds; pi 10 membrane protein precursor, gene, complete cds; and YX2 
(yx2) gene, partial cds 


0.2 


3091 


AF391284 


Homo sapiens llpl5.5 clone LOH11A, partial sequence 


0.007 


3092 


AF259737 


Ovis aries microsatellite MNS-64 sequence 


0.66 


3093 


NM 030261 


Mus musculus hypothetical protein MGC7182 (MGC7182), mRNA 


6E-36 


3094 


AB047879 


Macaca fascicularis brain cDNA, clone:QnpA-14270 


0.00003 


3095 


X65607 


H.sapiens MT1X gene for metallothionein IX 


0.023 


3096 


AF231164 


Pseudosymblepharis schimperiana tRNA-Leu, partial sequence; trnL- 
trnF intergenic spacer, complete sequence; and tRNA-Phe, partial 
sequence; chloroplast genes for chloroplast products 


0.06 


3097 


AF290431 


Mus musculus sequence downstream of Hyall; and Fus2 gene, 5' UTR 
and exon 1 


1.3 


3098 


X96762 


C.elegans mRNA for DNA topoisomerase I 


0.028 


3099 


AB036229 


Salganea nigrita mitochondrial COII gene for cytochrome oxidase II, 
partial cds, collection site:Lake Danao, Leyte, Philippines 


0.53 


3100 


AJ309183 


Staphylococcus aurexis partial coa gene for coagulase, type F, isolate 
rs59 


0.07 


3101 


AF240785 


Homo sapiens glutathione-S-transferase GSTT1 deletion/junction 
region sequence 


0.000004 


3102 


XM 031737 


Homo sapiens synaptosomal-associated protein, 23kD (SNAP23), 
mRNA 


0.0000004 


3103 


AY035097 


Arabidopsis thaliana putative proline-rich protein (F23H24. 15) 
mRNA, complete cds 


0.18 


^ 104. 


AF121069 


Anopheles farauti 3 12S small subunit ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


0.065 


3105 


NM 016701 


Mus musculus nestin (Nes), mRNA 


0.23 


3106 


L41893 


Homo sapiens retinoblastoma susceptibility protein (RBI) gene, exon 
5, bases 44452-44939 in LI 1910 


0.008 


3107 


Z78623 


Hsapiens flow-sorted chromosome 6 TaqI fragment, SC6pA12B7 


2E-21 


3108 


XM_041367 


Homo sapiens NADH dehydrogenase (ubiquinone) 1 beta subcomplex, 
4 (15kD, B15) (NDUFB4), mRNA 


2E-24 


3 109 


AL1 17617 ] 


Homo sapiens mRNA; cDNA DKFZp564H0764 (from clone 
DKFZp564H0764) 


0.17 


3110 


AK023907 ] 


Homo sapiens cDNA FLJ13845 fis, clone THYRO1000815 


7E-42 


3111 


M95396 1 


i^east putative transcriptional activator (ADA2) gene, complete cds 


3.8 


3112 


NC_002670 1 


Bacteriophage bIL3 11, complete genome 


0.073 


3113 


XM_037870 J 


too sapiens desmoplakin (DPI, DPII) (DSP), mRNA 


2.2 


3114 


1 

NM_032276 ( 


iomo sapiens hypothetical protein DKFZp547E052 
DKFZp547E052), mRNA 


0.43 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 






P VALUE 


3115 


AB048877 


Macaca fascicularis brain cDNA, clone:QnpA-16414 


3E-30 


3116 




ivius muscuius i j aays emoryo neaa cljina, KUsiiN tull-lengtn 
cimciicu iionuy, cione.jiiuu3j.Dio, mil insert sequence 


0.029 


3117 


XMJ)37053 


no i no sapiens rnaoaoia nimor deletion region protein 1 (RTDR1), 
mRNA 


9E-47 


3118 


AF156197 


Cinara pseudotaxifolia cytochrome oxidase II gene, partial cds; 
mitochondrial gene for mitochondrial product 


0.18 


3119 




nomo bctpiens cluna fjlj 1 iy*+*+ ns, Clone iTJiJVl±3DiUUUo<>4 


0.003 


3120 


U23175 


Caenorhabditis elegans cosmid F57F10, complete sequence 


0.022 


3121 


AF267198 


Candidatus Carsonella ruddii natural-host Aiytaina genistae ATP 
synthase alpha subunit (atpA) gene, partial cds; ATP synthase gamma 
suDunii ^aipvjj gene, complete cos, and Air synthase beta suDunit 
(atpD) gene, partial cds 


0.019 


3122 


AY026461 


nomo sapiens lud aomain containing protein 1 (CDCP1) mRNA, 
complete cds 


0.5 


3123 


MP nfOd7i 


i ypnionecies naians miiocnonanon, complete genome 


0.45 


3125 


A FlOR^'l 1 


nomo sapiens cione yqiei_ciut/ sequence 


0.00001 


3126 


rvT J7 / l_>o 


Homo sapiens clone 1 1 pur alpha-associated ribosomal RNA gene, 
paruai sequence 


2E-37 


3127 


AF1 10Q7H 
/\T i ivy /U 


ruv-i isoiaie i_.-yor>wii.uo country tiotswana, complete genome 


5.3 


3128 


AJ237690 


Canis familiaris polymorphic marker clone XC08 


1.3 


3129 


nEUUl 1 /O 


oorreiia ourgaorxen ^section oz or tv) or tne complete genome 


0.084 


3130 


AD71 111 


^ampyiooacier nyouei v-iivr i, v/Krz, Kjist j ana UKr4 


6.7 


3131 


AF1 1 ?9d7 

AT 1 IZZt / 


Merluccius productus Mpro-1 mitochondrial control region, partial 
sequence 


O.OOOo 


3132 


XM_002378 


Homo sapiens uncharacterized hypothalamus protein HT010 (HT010), 
mRNA 


4E-28 


3133 


rvJZtZ / AO 


^venopus idcvis uuvina ior pzi activated Kinase z (JrAivz genej 


U.ZJ 


3134 


AF049711 


Petunia x hybrida pollen-specific 1-aminocyclopropane-l-carboxylate 
synthase (ACS2) gene, complete cds 


0.24 


3135 


wivi uo j U7 1 


Homo sapiens KIAA0001 gene product; putative G-protein-coupled 
receptor; G protein coupled receptor for UDP-glucose (KIAA0001), 

mPtsJA 


z.z 


3136 


Z33368 


Mcapricolum DNA for CONTIG MCABF 


0.001 


3137 


AE001433 


riasmouium iaiciparum cnromosome 2, section 7U ot 73 oi tne 
complete sequence 


0.001 


3138 


AKO 17235 


ivius muscuius aauu maie pituitaiy gland clJNA, KJJvbN tull-lengui 
enriched library, clone:5330405D10, full insert sequence 


0.7 


3139 


Z57308 


H Rani ens CnO island DNA ffennmir \^cf» 1 fruffmpnt Hnnp 170o-0 
reverse read cpgl70g9.rtla 


0.023 


3140 


AC005485 


Homo sapiens PAC clone RP5-998M2 from 7q33-q35, complete 
sequence 


5.4 


3141 


AF1 18654 


Leishmania donovani ATPase 6 gene; kinetoplast gene for kinetoplast 
product 


0.66 
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Table 3 A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3142 


AJ271879 


Uncultured eubacterium plasmid pIEl 130 DNA for repA gene, repC 
gene, repB gene, mobA gene, mobB gene, mobC gene, cac gene, catlll 
gene, suIII gene, strA gene, strB gene and aph(3*)-I gene 


0.26 


3143 


Y00165 


Sus scrofa mRNA for link protein 


0.079 


3144 


AF283982 


Caenorhabditis elegans transcription enhancer factor- 1 -like protein 
EGL-44 (egl-44) gene, complete cds 


0.0007 


3145 


AE004926 


Pseudomonas aeruginosa PA01, section 487 of 529 of the complete 
genome 


0.44 


3146 


AE004738 


Pseudomonas aeruginosa PA01, section 299 of 529 of the complete 
genome 


1.4 


3147 


AC083824 


Homo sapiens clone RP1 1-91J18, complete sequence 


0.59 


3148 


AY023127 


Oryza sativa microsatellite MRG5452 containing (GCA)X9, closest to 
marker C402, genomic sequence 


0.77 


3149 


XM_002943 


Homo sapiens similar to PRO0327 protein (H. sapiens) (LOC92570), 
mRNA 


0.073 


3150 


D88151 


Clostridium perfringens DNA for D-alanine:D-alanine ligase, cortical 
fragment-lytic enzyme 


0.057 


3151 


XM_009013 


Homo sapiens similar to tumor necrosis factor (ligand) superfamily, 
member 14 (H. sapiens) (LOC63472), mRNA 


1.7 


3152 


NM_0 12702 


Rattus norvegicus Carcinoembryonic antigen gene family (CGM1) 
(Cgml), mRNA 


1.5 


3153 


AJ131041 


Clogmia albipunctata mRNA for Hunchback protein, partial 


0.019 


3154 


AE000720 


Aquifex aeolicus section 52 of 109 of the complete genome 


5.3 


3155 


X74789 


Potato leaf roll virus RNA sequence 


0.47 


3156 


AF064478 


Homo sapiens natural resistance-associated macrophage protein 2 
(NRAMP2) gene, exons 4 and 5 


0.021 


3157 


M84761 


Rat fatty acid synthase gene, complete cds 


0.25 


3158 


AF239566 


Polycycnis ornata tRNA-Leu (trnL) gene, partial sequence; and trnL-F 
intergenic spacer region, complete sequence; chloroplast gene for 
chloroplast product 


0.007 


3159 


AF343914 


Campylobacter jejuni NCTC 1 1828 LOS biosynthesis cluster, partial 
sequence 


0.025 


3160 


AL 13 7443 


Homo sapiens mRNA; cDNA DKFZp566G184 (from clone 
DKFZp566G184) 


0.99 


3161 


X54452 


D.discoideum culmination spiA (Dd31) gene 


0.71 


3162 


X80908 


L.esculentum gene for fruit ripening polygalacturonase 


0.069 


3163 


XM_007838 


Homo sapiens nuclear factor of activated T-cells 5, tonicity-responsive 
(NFAT5), mRNA 


0.5 


3164 


AF1 19385 


Mus museums KvLQTl gene, KvLQTl-AS allele, partial intron 
sequence 


0.17 


3165 


X86449 


E.caballus DNA segment containing microsatellite 


1E-25 


3166 


Z22800 


H.sapiens microsatellite repeat 


0.000004 


3167 


U22842 


Lagos bat virus nucleoprotein gene, partial cds 


0.027 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3168 


AF108897 


Campylobacter jejuni PglB (pglB) gene, partial cds; and PglA (pglA), 
PglC (pglC), PglD (pglD), PglE (pglE), PglF (pglF), and PglG (pglG) 
genes, complete cds 


0.73 


3169 


AE000556 


Helicobacter pylori 26695 section 34 of 134 of the complete genome 


0.016 


3170 


AK024295 


Homo sapiens cDNA FLJ14233 fis, clone NT2RP400O049 


0.46 


3171 


U13835 


Mus musculus c-abl protein (c-abl) gene, exons 1 and IB, 
alternatively spliced 


4.5 


3172 


AJ132887 


Homo sapiens telomere-junction DNA sequence, haplotype B 


6E-13 


3173 


XM 007277 


Homo sapiens hypothetical protein DKFZp761F2014 
(DKFZp761F2014), mRNA 


1.9 


3174 


L40470 


Pisum sativum (clone HW39-4) gene fragment 


0.05 


3175 


AF206594 


Lacerta vivipara 12S ribosomal RNA gene, partial sequence; tRNA- 
Val gene, complete sequence; and 16S ribosomal RNA gene, partial 
sequence; mitochondrial genes for mitochondrial products 


0.69 


3176 


XM_037847 


Homo sapiens neurexophilin 3 (NXPH3), mRNA 


6.7 


3177 


AY026945 


Rattus norvegicus T:G mismatch thymine glycosylase mRNA, 
complete cds 


1.6 


3178 


XM 028009 


Homo sapiens KIAA1563 protein (KIAA1563), mRNA 


2E-71 


3179 


AF250386 


Carsonella ruddii RNA polymerase beta subunit (rpoB) and RNA 
polymerase beta-prime subunit (rpoC) genes, partial cds 


0.65 


3180 


AB038968 


Homo sapiens CFLAR gene, exon 5, exon 6 


0.24 


3182 


AK023569 


Homo sapiens cDNA FLJ13507 fis, clone PLACE 1005085 


2E-64 


3183 


AF240176 


Mus Mus musculus MRP13 mRNA 


2 


3184 


U20230 


Human guanyl cyclase C gene, partial cds 


0.2 


3185 


L18959 


Ovis aries T cell receptor gene V-region, exons 1 (3' end) and 2 (5' 
end) 


0.64 


3186 


U36785 


Rattus norvegicus putative pheromone receptor VN1 mRNA, complete 
cds 


0.73 


3187 


AF270614 


Homo sapiens clone 5qtel_c93t7 sequence 


0.075 


3188 


NM_004114 


Homo sapiens fibroblast growth factor 13 (FGF13), mRNA 


6.7 


3189 


AJ403239 


M. musculus DNA for vimentin-binding fragment VimE7 


0.076 


3190 


BC010716 


Mus musculus, clone MGC:6551 EMAGE:2655861, mRNA, complete 
cds 


4.9 


3191 


AF270162 


Staphylococcus epidermidis strain SRI clone step. 105 le05 genomic 
sequence 


0.56 


3192 


XM_046992 


Homo sapiens hypothetical gene supported by AL137734 
(LOC92742), mRNA 


1.5 


3193 


AE003909 


Xyiella fastidiosa 9a5c, section 55 of 229 of the complete genome 


6.3 


3194 


AF240176 


Mus Mus musculus MRP13 mRNA 


1.5 


3195 


L08174 


Romanomermis culicivorax mitochondrial NADH dehydrogenase 
subunits 6 and 3 (ND6, ND3) cytochrome P450-like protein genes, 
complete CDS 


0.5 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQ ID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3196 


AK021098 


Mus musculus adult male corpus striatum cDNA, RIKEN full-length 
enriched library, clone:C030021G16, full insert sequence 


6.5 


3197 


AF3 11637 


Eimeria separata ORF470 gene, partial cds; apicoplast gene for 
apicoplast product 


0.079 


3198 


D84549 


Candida tropicalis DNA for carnitine acetyltransferase, complete cds 


0.057 


3199 


XM_045864 


Homo sapiens KIAA1804 protein (KIAA1804), mRNA 


0 


3200 


XM_0 10334 


Homo sapiens transducin (beta)-like 1 (TBL1), mRNA 


1.9 


3201 


AL133154 


S.pombe chromosome I cosmid cl751 


0.15 


3202 


S75898 


aromatase [Poephila guttata=zebra finches, ovary, mRNA, 3188 nt] 


0.49 


3203 


AF047669 


Homo sapiens dystrophin (DMD) gene, deletion junction 8 between 
introns 7 and 20 


0.001 


3204 


AB042652 


Bos taurus mRNA for zona pellucida glycoprotein ZPB, complete cds 


0.58 


3205 


AF147082 


Homo sapiens gamma-glutamyl hydrolase gene, exons 3 through 7 


0.028 


3206 


AE006252 


Lactococcus lactis subsp. lactis IL1403 section 14 of 218 of the 
complete genome 


0.07 


3207 


Z99773 


Caenorhabditis elegans cosmid H06A10, complete sequence 


0.7 


3208 


NC_001328 


Caenorhabditis elegans mitochondrion, complete genome 


0.025 


3209 


AY037251 


Arabidopsis thaliana AT4g30750/T10C21_100 mRNA, complete cds 


5.9 


3210 


Z12834 


L.esculentum gene for polyphenol oxidase 


0.75 


3211 


AL391057 


Human DNA sequence from clone RP1 1-533B2 on chromosome 10, 
complete sequence [Homo sapiens] 


0.07 


3212 


AF001454 


Helianthus annuus Dc3 promoter-binding factor-2 (DPBF-2) mRNA, 
complete cds 


0.21 


3213 


AF295925 


Streptococcus pneumoniae transposon Tn5252 Orf28, Orf27, Orf26, 
Orf25, Orf24, Orf23, Orf22, Orf21, and Orf20 genes, complete cds 


0.019 


3214 


AC024821 


Caenorhabditis elegans cosmid Y55B1AR, complete sequence 


1.7 


3215 


AC018655 


Homo sapiens 12q BAC RP1 1-946G22 (Roswell Park Cancer Institute 
Human BAC Library) complete sequence 


0.0000001 


3216 


XM_043524 


Homo sapiens cadherin 17, LI cadherin (liver-intestine) (CDH17), 
mRNA 


2E-52 


3217 


XM 051566 


Homo sapiens phosphoinositide-3 -kinase, catalytic, beta polypeptide 
(PIK3CB), mRNA 


0.025 


3218 


X84742 


M. musculus GPX3 gene 


0.079 


3219 


AL445527 


Human DNA sequence from clone RP1 1-1 15C2 on chromosome 1, 
complete sequence [Homo sapiens] 


5.4 


3220 


AF254792 


Danio rerio Period3 circadian clock protein (Per3) mRNA, complete 
cds 


3.8 


3221 


AE002225 


Chlamydophila pneumoniae AR39, section 53 of 94 of the complete 
genome 


0.12 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


NO 


i 

ACCESSN 


DESCRIP 


P VALUE 


3222 


XM 048692 


Homo sapiens similar to hypothetical protein (H. sapiens) 
(LOC92998), mRNA 


2E-15 


3223 


AF335456 


Clavulina cinerea 18S ribosomal RNA gene, partial sequence; interna] 
transcribed spacer 1, 5.8S ribosomal RNA gene and internal 
transcribed spacer 2, complete sequence; and 28S ribosomal RNA 
gene, partial sequence 


0.037 


3224 


AF335589 


Glycine max chitinase class I (Chial) and hypothetical protein genes, 
complete cds 


3.3 


3225 


AE005230 


Escherichia coli 0157:H7 EDL933 genome, contig 1 of 3, section 54 
of 155 


5.2 


3226 


AB010822 


Homo sapiens MASP gene for mannose binding protein-associated 
protease, exon 1 


7E-13 


jzz r 


XM 052139 


Homo sapiens interleukin 13 receptor, alpha 1 (IL13RA1), mRNA 


0.007 


3228 


AF293654 


Toxoplasma gondii TgMICIO precursor (MIC10) mRNA, complete 
cds 


0.15 


jzzv 


AJ388514 


Canis familiaris mRNA for partial Ribosomal protein S14 


0.38 


3230 


U80814 


Caenorhabditis elegans cosmid B0547, complete sequence 


0.22 




NM_0 13890 


Mus musculus f-box and WD -40 domain protein 2 (Fbxw2), mRNA 


0.29 


JZJZ 


AB042411 


Homo sapiens strg gene for striatum-specific G potein-coupled 
receptor, complete cds 


0.000009 


3233 


AF151527 


Hyphantria cunea serpin gene, complete cds 


0.001 


3234 


X78305 


M. musculus (SRP9) signal recognition particle subunit mRNA, 
1116bp 


0.047 


3235 


NM_018830 


Mus musculus N-acylsphingosine amidohydrolase 2 (Asah2), mRNA 


0.18 




AF302060 


Apocryptophagus sp. MSPAR4 cytochrome oxidase I (COI) gene, 
partial cds; mitochondrial gene for mitochondrial product 


0.007 


3237 


AB024025 


Arabidopsis thaliana genomic DNA, chromosome 5, TAC 
clone:K16F13 


0.012 


jZjo 


U25810 


Bos taurus lysozyme (LZ) gene, complete cds 


0.004 


J/J? 


X95363 


C.annuum gamma thionin gene 


0.15 


jZ*RI 


AF286897 


Plasmodium berghei carbamoyl phosphate synthetase II gene, partial 
cds 


0.15 


"SOA 1 
JZ41 


AF290920 


Procambarus clarkii microsatellite PcIG-03 sequence 


3.7 


.5Z4z 


AE006380 


Lactococcus lactis subsp. lactis IL1403 section 142 of 218 of the 
complete genome 


2.2 


3243 


U66913 


Dictyostelium discoideum ORF DG1040 gene, partial cds 


0.22 




] 

Y00460 ] 


Mouse genes for transfer RNA-Ala, tRNA-Ile, tRNA-Pro and tRNA- 
Lys 


0.026 


3245 


] 

L08052 < 


Fruitfly myosin alkali light chain (Mlcl) gene 
supercontractile/tubular muscle and indirect flight muscle isoforms'; 
:omplete cds's 


0.67 


3246 


] 

AF076597 ( 


Dictyostelium discoideum ubiquitin-conjugating enzyme protein UbcC 
[ubcC) mRNA, complete cds 


0.58 
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Table 3A Nearest Neighbor (BlastN vs. Genbank^ 


NO 


ACCESSN 


DESCRIP 


P VALUE 


3247 


U67518 


Methanococcus jannaschii section 60 of 150 of the complete genome 


0.003 


3248 


AF362370 


Dictyostelium discoideum histidine kinase DhkH (dhkH) gene, partial 
cds 


6 


3249 


AF146688 


Fugu rubripes sex comb on midleg-like 2 protein (SCML2) gene, 
complete cds 


0.026 




AK009952 


Mus museums adult male tongue cDNA, REKEN full-length enriched 
library, clone:2310057C01, full insert sequence 


0.07 


3251 


AB005803 


Homo sapiens DNA for histidine-rich glycoprotein, complete cds 


0.078 




AY021484 


Oryza sativa microsatellite MRG3809 containing (TA)X27, closest to 
marker G227, genomic sequence 


2.2 




ADO00833 


Homo sapiens DNA from chromosome 19-cosmid fl 93 99 (-17 kb 
EcoRI restriction fragment) 


0.024 




AB048982 


Macaca fascicularis brain cDNA, clone:QnpA- 13067 


0.38 




AL023839 


Caenorhabditis elegans cosmid Y39A1C, complete sequence 


0.026 


3256 


NC 002052 


Tomato spotted wilt virus RNA-L, complete sequence 


0.22 


J237 


AK022137 


Homo sapiens cDNA FLJ12075 fis, clone HEMBB 1002425 


0.64 


3258 


U44047 


Xenopus laevis MCM2 mRNA, complete cds 


0.24 


3259 


Z61385 


Ksapiens CpG island DNA genomic Msel fragment, clone 51d3, 
forward read cpg51d3.ftla 


5E-21 


3260 


AF163151 


Homo sapiens dentin sialophosphoprotein precursor (DSPP) gene, 
complete cds 


0.003 


3261 


LI 1194 


Montima caryophyllacea ribulose 1,5-bisphosphate carboxylase large 
subunit (rbcL) gene, partial cds; chloroplast gene for chloroplast 
product 


6.4 


3262 


BC002928 


Homo sapiens, clone MGC: 11357 IMAGE:3954507, mRNA, 
complete cds 


3E-46 


3263 


XM 002837 


Homo sapiens acid phosphatase, prostate (ACPP), mRNA 


2.3 


3264 


AF189787 


Hordeum vulgare putative c-myb-like transcription factor (MYB3R-1) 
gene, partial cds 


0.61 


3266 


AB048353 


Paramecium caudatum mRNA for hsp60, partial cds 


1.8 


3267 


U75274 


Arabidopsis thaliana acyl-CoA binding protein (ACBP) gene, 
complete cds 


0.69 


3268 


AF034077 


Equus caballus alpha- 1 -antitrypsin (Spi2) gene, complete cds 


0.077 


3269 


AL031426 


Human DNA sequence from clone CTA-191D12 on chromosome 
22ql3.1 Contains two exons of the APOL2 gene for apolipoprotein L 
2, ESTs and GSSs, complete sequence [Homo sapiens] 


0.000001 


3270 


AJ007958 


Thiobacillus sp. plasmid pT3.2I including adh and repA genes, partial 


0.001 


32 / i 


NM 025978 


Mus musculus RIKEN cDNA 2700016E08 gene (27000 16E08Rik), 
mRNA 


1E-17 


3272 


] 

XM_0 18403 ] 


Homo sapiens similar to gene with multiple splice variants near HD 
locus on 4pl6.3 (H. sapiens) (LOC91016), mRNA 


5.5 


3273 


AJ238613 < 


Gallus gallus mRNA for teneurin-1 


0.57 


3274 


XM_033841 3 


Homo sapiens FGFR1 oncogene partner (FOP), mRNA 


0.55 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3275 


NM_014601 


Homo sapiens EH-domain containing 2 (EHD2), mRNA 


0.065 


3276 


AK021659 


Homo sapiens cDNA FLJ11597 fis, clone HEMBA1003856 


9E-13 


3277 


AB054644 


Sarcophaga peregrina gene for lectin, 5' upstream region and partial 
cds 


0.045 


3278 


AKO 10362 


Mus musculus ES cells cDNA, RIKEN full-length enriched library, 
clone:2410003J06, full insert sequence 


5.5 


3279 


AF282974 


Plasmodium falciparum cysteine protease falcipain-3 gene, complete 
cds 


1.8 


3280 


U60135 


Arabidopsis thaliana serine/threonine protein phosphatase 2A-3 
catalytic subunit gene, complete cds 


0.024 


3281 


AF365086 


Martiodendron parviflorum tRNA-Leu (trnL) gene, partial intron 
sequence; chloroplast gene for chloroplast product 


6.5 


3282 


AE00214O 


Ureaplasma urealyticum section 41 of 59 of the complete genome 


0.22 


3283 


Z49124 


S.oleracea chloroplast secA mRNA 


0.074 


3284 


AE002353 


Chlamydia muridarum, section 80 of 85 of the complete genome 


4.4 


3285 


XM_0 11989 


Homo sapiens ELKL motif kinase (EMK1), mRNA 


7E-54 


3286 


AJ252287 


Plasmodium falciparum partial msp3 gene for merozoite surface 
protein 3 


0.68 


3287 


AKO 18224 


Mus musculus adult male medulla oblongata cDNA, RIKEN full- 
length enriched library, clone:6330531I01, full insert sequence 


2 


3288 


X91123 


Homo sapiens partial SLC6A2 gene for norepinephrine transporter, 
exon 5 


7.5 


3289 


AB045007 


Mus musculus GNB2 gene for guanine nucleotide binding protein 
beta2 subunit, complete cds 


0.67 


3290 


AYO 17272 


Peromyscus difficilis isolate TK32541 mys-9 retrotransposon and 
LTR, 5' sequence 


5.4 


3291 


AL590902 


S.pombe chromosome I cosmid pB8E5 


0.077 


3292 


XM 003311 


Homo sapiens surfactant protein A binding protein (SPAR), mRNA 


0.38 


3293 


AK027388 


Homo sapiens cDNA FLJ14482 fis, clone MAMMA1002362 


0.007 


3294 


AB043885 


Homo sapiens BHLHB2 gene for bHLH transcriptional factor DEC1, 
complete cds 


0.068 


3295 


XM__028213 


Homo sapiens up-regulated by BCG-CWS (LOC641 16), mRNA 


0.073 


3296 


XM_039789 


Homo sapiens solute carrier family 2 (facilitated glucose transporter), 
member 2 (SLC2A2), mRNA 


9.2 


3297 


AK021841 


Homo sapiens cDNA FLJ1 1779 fis, clone HEMBA1005921 


0.65 


3298 


M11815 


Plasmid pGKLl from killer yeast (Klactis), complete 


0.008 


3299 


AF166527 


Zea mays flowering-time protein isoforms alpha and beta (ZmLD) 
gene, alternatively spliced products, complete cds 


0.074 


3300 


XM 040556 


Homo sapiens similar to hypothetical protein FLJ20127 (H. sapiens) 
(LOC91775), mRNA 


0.22 


3301 


AB001569 


Carrot DNA for transposon Tdcl 


0.17 


3302 


AF326737 


Mus musculus sclerostin gene, complete cds 


4.7 


3303 


AF272001 


Ebola virus subtype Zaire strain Mayinga complete genome 


1.9 
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Table 3 A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3304 


XM_034863 


Homo sapiens similar to hypothetical protein FLJ22294 (H. sapiens) 
(LOC90918), mRNA 


0.2 


3305 


AC006654 


Caenorhabditis elegans cosmid H09I01, complete sequence 


0,023 


3306 


AK008812 


Mus musculus adult male stomach cDNA, RIKEN full-length 
enriched library, clone:2210403E17, full insert sequence 


6.1 


3307 


AF151387 


Lucilia eximia tRNA-Ile gene, partial sequence; D-ioop, complete 
sequence; and 12S ribosomal RNA, partial sequence; mitochondrial 
genes for mitochondrial products 


0.077 


3308 


S53497 


immunoglobulin epsilon chain constant region=secreted form {3 1 
region} [human, B cell myeloma U-266, Genomic, 3198 nt] 


5.5 


3309 


XM.010126 


Homo sapiens stromal antigen 2 (STAG2), mRNA 


e-115 


3310 


D 10500 


Homo sapiens MAT gene for mitochondrial acetoacetyl-CoA thiolase, 
exon 1 


1.9 


3311 


AF293900 


Paraflabellula reniformis small subunit ribosomal RNA gene, 
complete sequence 


0.092 


3312 


NC 001530 


Human papillomavirus type 47, complete genome 


0.67 


3313 


Z85396 


H.sapiens Ig lambda light chain variable region gene (34-34SWIIF32) 
rearranged; Ig-Light-Lambda; VLambda 


0.71 


3314 


NC_001566 


Apis mellifera ligustica mitochondrion, complete genome 


0.015 


3315 


AF103869 


Plasmodium yoelii blood-stage membrane protein Ag-1 mRNA, 
complete cds 


1.5 


3316 


XM 007957 


Homo sapiens hypothetical protein (LOC57019), mRNA 


3E-27 


3317 


XM 012723 


Homo sapiens chromosome 18 open reading frame 1 (C18orfl), 
mRNA 


2.7 


3318 


XM_046956 


Homo sapiens Kallmann syndrome 1 sequence (KALI), mRNA 


4.5 


3319 


AE000614 


Helicobacter pylori 26695 section 92 of 134 of the complete genome 


0.068 


3320 


NM_0 17932 


Homo sapiens hypothetical protein FLJ20700 (FLJ20700), mRNA 


1.7 


3321 


AKO 17541 


Mus musculus 8 days embryo cDNA, RIKEN full-length enriched 
library, clone:5730410F24, full insert sequence 


4.8 


3322 


AK016519 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4932411G06, full insert sequence 


1.9 


3323 


U38184 


Trypanosoma cruzi kinetoplast minicircle ATPase subunit 6 mRNA, 
complete cds 


0.0003 


3324 


Z54181 


M.catarrhalis bla gene for BRO-2 


0.069 


3325 


U43961 


Oryza sativa 3-hydroxy-3-methylglutaryl-CoA reductase gene, 
promoter region and complete cds 


• 

0.17 


3326 


AK025166 


Homo sapiens cDNA: FLJ21513 fis, clone COL05778 


0.23 


3327 


AF158727 


Homo sapiens truncated properdin (PFC) gene, exons 1 and 2 


0.73 


3328 


AL1 16536 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


0.57 


3329 


NC_001712 


Locusta migratoria mitochondrion, complete genome 


0.061 
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SEQIE 
NO 


> 

ACCESSN 


DESCRIP 


P VALUE 


3330 


AY028779 


Staphylococcus epidermidis olasmid dST6 beta-lactama<K renr^cQnr 
Blal (blal) gene, partial cds; resolvase/integrase BinR (binR), 
recombinase Sin (sin), repressor QacR (qacR), and transport protein 
QacB (qacB) genes, complete cds; and unknown gene 


0 23 


3331 


AF346992 


Homo sapiens mitochondrion, complete genome 


1.7 


3332 


NM_031389 


Mus musculus ribonuclease/angiogenin inhibitor 2 (Rnh2), mRNA 


1.9 


3333 


AC079918 


Homo sapiens BAC clone RP1 1-1 14K13 from 7, complete sequence 


0.0002 


3335 


AF298207 


Dictyostelium discoideum transposon thug-S, complete sequence 


0.15 


3336 


L36903 


Saccharomyces cerevisiae mitochondrion 21S ribosomal RNA (2 IS 
rRNA) gene, 2 IS R2 ribosomal RNA (2 IS R2 rRNA) gene, and ORF 
RI 


0.024 


3337 


AF239663 


Prunus persica ACC synthase gene, partial cds 


0.066 


3338 


AJ223690 


H. sapiens D2-l-K2G7np gene for immunoglobulin kappa chain 
variable region 


0.026 


3339 


AC084157 


Caenorhabditis elegans cosmid Y46E12BR, complete sequence 


4.4 


3340 


M58606 


B.subtilis threonine deaminase (ILVA) gene, complete cds 


0.62 


3341 


NC 001648 


Cassava vein mosaic virus, complete genome 


0.008 


3342 


U09277 


Caenorhabditis elegans Bristol N2 synaptic vesicle acetylcholine 
transporter (unc-17) gene, complete cds 


0.024 


3343 


U52821 


Mus musculus prion (PrP) gene, exons 1 and 2 


1.9 


3344 


AKO 14936 


Mus musculus adult male testis cDNA, RUCEN full-length enriched 
library, clone:4921521D15, full insert sequence 


2 


3345 


AE007447 


Streptococcus pneumoniae section 130 of 194 of the complete genome 


5 


3346 


AK005820 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone: 1 7000 10F03, full insert sequence 


0.18 


3347 


ABO 11681 


Danio rerio mRNA for ZOR-2, complete cds 


1.9 


3348 


AB030616 


Mus musculus Chetk-alpha gene for choline/ethanolamine kinase- 
alpha, exon 1 and 5'-flanking region 


2E-10 


3349 


AF269359 


Staphylococcus epidermidis strain SRI clone step.l002el2 genomic 
sequence 


0.53 


3350 


AF063004 


Trypanosoma rotatorium trans-spliced leader gene, partial sequence 


1.9 


3351 


AE001411 


Plasmodium falciparum chromosome 2, section 48 of 73 of the 
complete sequence 


0.008 


3352 


AF240496 


Mus musculus clone 148d4 Naip5 gene, partial sequence 


0.56 


3353 


XM 004722 


Homo sapiens hypothetical protein FLB6421 (FLB6421), mRNA 


2.2 


3354 


] 

AF100324 ] 


Mycoplasma fermentans DNA polymerase III subunit (dnaH), YabC 
protein (yabC), orf550, lipoprotein MALP-404 precursor (malP), 
insertion sequence IS 1630 transposase (tnpA) genes, complete cds; 
P78 ATP-binding cassette transport operon, complete sequence;> 


2.1 


3355 


NM 019684 ] 


vlus musculus serine/threonine kinase 23 (Stk23), mRNA 


0.23 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 



|SEQID 
NO 



ACCESSN 
AB004856 



DESCRIP 



P VALUE 



Buchnera aphidicola mRNA for homoserine kinase, partial cds 



1.9 



AF328428 Homo sapiens clone lptel_2 109Bf&Br sequence 



2E-11 



Homo sapiens hypothetical gene supported by AF308287; BC009516 
XM_041166 |(LOC91860), mRNA 



0.024 



Lactococcus lactis subsp. lactis IL1403 section 191 of 218 of the 
AE006429 [complete genome 



0.24 



HIV-1 clone 8 isolate BV from Milan, envelope glycoprotein, V1-V5 
U95455 I region (env) gene, partial cds 



0.026 



jHomo sapiens PEF protein with a long N-terminal hydrophobic 
XM 001775 [domain (peflin) (PEF), mRNA 



0.45 



AF100670 [Caenorhabditis elegans cosmid M4 



0.072 



AB036737 [Homo sapiens mRNA for RERE, complete cdT 



0.21 



AE001030 Archaeoglobus falgidus section 77 of 172 of the complete genome 



1.5 



Homo sapiens hypothetical gene supported by AK027658 
XM 046602 |(LOC92679), mRNA 



XM_005889 Homo sapiens kinesin-like 1 (KNSL1), mRNA 



0.21 



Lycopersicon esculentum MADS-box transcription factor jointless 
AF275345 |gene, complete cds 



1.2 



NCJ)00857 ICeratitis capitate complete mitochondrial genome 



0.022 



Human PCR Fragment of Cosmid gl346a274 from 7q3 1.3, complete 
AC002048 [sequence [Homo sapiens] 



1E-16 



ABO 166 1 1 [Enterobacter cloacae ampC gene, complete cds 



0.62 



Mus musculus RKEN cDNA 2810036K01 gene (2810036K01Rik) 
NM 026040 I mRNA 



e-138 



AJ252860 



Human immunodeficiency virus type 1, partial proviral POL gene, rt 
| region, isolate CL 



0.25 



XM_043434 Homo sapiens villin-like (VILL), mRNA 



1.8 



Drosophila melanogaster genomic scaffold 142000013385952, 
AE002776 | complete sequence 



0.19 



Homo sapiens tumor necrosis factor receptor superfamily member 8 
XM_030198 |(TNFRSF8), mRNA 



0.76 



X65450 1 C.canis mRNA for chloride channel 



6.3 



Homo sapiens hypothetical protein DKFZp434F1719 

3378 NM 032248 |(DKFZp434F1719), mRNA | 1E ^ 7 

Carnobacterium piscicola transposase, bacteriocin, histidine protein 
kinase, ATP dependent transloactor, accessory protein, and 
L47121 |carnobacteriocin B2 genes, complete cds | 0.18 



3379 



3380 



L26287 [Schistosoma mansoni SMDR2 gene, complete cds 



0.66 



3381 



AF125130 [Nemadactylus macropterus clone 15.1 microsatellite sequence 



0.072 



Oryzias latipes olgc4 gene for membrane guanylyl cyclase 01GC4, 5* 
3383 AB040099 [flanking sequence and partial cds 



3384 



AF375398 [ Arabidopsis thaliana Atlg5 1660/F19C24_26 gene, complete cds 



0.22 



3385 



AF391256 



HIV-1 clone TV004G24 from South Africa gag protein (gag) gene, 
| complete cds 



0.23 



3386 



X9 1865 | A. thaliana asp5 gene 



2.1 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3387 


XM_047084 


Homo sapiens human immunodeficiency virus type I enhancer- 
binding protein 2 (HIVEP2), mRNA 


0.21 


3388 


AF1 15469 


Arabidopsis thaliana cultivar Landsberg erecta Arac2 (Arac2) gene, 
complete cds 


2.3 


3389 


<-7 a one i 
Z4805 1 


H. sapiens gene for myelin oligodendrocyte glycoprotein (MOG) 


0.008 


3390 


U49259 


Arabidopsis thaliana isopentenyl diphosphate:dimethylallyl 
diphosphate isomerase (IPP2) mRNA, complete cds 


2.2 


3391 


AF032124 


Homo sapiens RET proto-oncogene (RET) gene, 5* flanking region 
and partial cds 


0.079 


3392 


Ab001420 


Plasmodium falciparum chromosome 2, section 57 of 73 of the 
complete sequence 


0.067 


3393 


XM 005555 


Homo sapiens UDP-glucose ceramide glucosyltransferase (UGCG), 
mRNA 


0.076 


3394 


Z81014 


Human DNA sequence from cosmid U65A4, between markers 
DXS366 and DXS87 on chromosome X * 


2.1 


3395 


XM 002056 


Homo sapiens hypothetical protein FLJ1 1006 (FLJ1 1006), mRNA 


0.26 


3396 


AF339787 


Homo sapiens clone IMAGE:205688, mRNA sequence 


2 


3397 


AP000145 


Homo sapiens genomic DNA, chromosome 21q21.2, LL56-APP 
region, clone B2291C14-R44F3, segment 10/10, complete sequence 


0.026 


3398 


AF261936 


Homo sapiens clone 1 Iptel_c3t7 sequence 


0.00002 


3399 


A T7 1 m/' 1 c 

AF193615 


Homo sapiens G-protein-coupled inward rectifying K+ channel 
KCNJ9 (KCNJ9) gene, complete cds 


0.009 


3400 


AK009952 


Mus musculus adult male tongue cDNA, RUCEN full-length enriched 
library, clone:2310057C01, full insert sequence 


0.079 


3401 


AF091779 


Oncorhynchus mykiss non-classical MHC class I antigen (Onmy- 
UAA) gene, Onmy-UAA*0101 allele, complete cds 


2.3 


3402 


AB001735 


Mus musculus DNA for ADAMTS-1, complete cds 


2.3 


3403 


AK000589 


Homo sapiens cDNA FLJ20582 fis, clone KAT12156 


1E-13 


3404 


U55243 


Dictyostelium discoideum glyceraldehyde-3 -phosphate dehydrogenase 
(GAPDH) gene, partial cds 


0.24 


3405 


Ar llUolz 


Boophilus microplus cytochrome b apoenzyme (Cytb) gene, partial 
cds; tRNA-Ser, tRNA-Leu, tRNA-Cys, and tRNA-Met genes, 
complete sequence; and NADH dehydrogenase subunit 2 (ND2) gene, 
partial cds, mitochondrial genes for mitochondrial products 


0.077 


3406 


AF060542 


Haemophilus influenzae biogroup aegyptius HaelV 
restriction/modification cvstpm fWapTVR \A} tre*nf* mmnidta r>Ac 


n 7^ 
u. / D 


3407 


AF303391 


Homo sapiens meiotic recombination 11 (MRE11A) gene, exon 16 


1.7 


3408 


AF269359 


Staphylococcus cpidermidis strain SRI clone step.l002el2 genomic 
sequence 


0.22 


3409 


- 

AF327975 


Hentzia palmarum 16S ribosomal RNA, partial sequence; 
mitochondrial gene for mitochondrial product 


0.57 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3410 


XM 047898 


Homo ^aniens variable charjyp nrntpin on Y with f*i «Vit rpnAate /\//^ v 
8r), mRNA 


^ o 


3411 


XM 047003 


Homo sapiens TATA box binding protein (TBP)-associated factor, 
RNA nolvmerase II B 1 50kD rrAF7TVk mRMA 


n n 

v). / 


3412 


D86566 


Human DNA for NOTCH4, partial cds 


0.025 


3413 


AF360301 


nicujiwjyoia Liuu.idJ.ta LULKJIUWII pruicin ^ 1 OD 1U /U/ A 1 jEOUt 1U ) 

mRNA, complete cds 


0.64 


3414 


AJ404228 


Saccharomyces douglasii mitochondrial tRNA gene cluster 


0.002 


3415 


XM_037369 


Homo sapiens hypothetical protein FLJ20693 (FLJ20693), mRNA 


9E-14 


3416 


AE005230 


Escherichia coli 0157*H7 EDL.933 Pennmp mnria 1 nf ^ Qprrinn ^4 
of 155 


6.2 


3417 


AL121778 


Human DNA sequence from clone RP5-839B1 1 on chromosome 20. 
domains me iirsi coding exon oi trie gene ior a novel protein witxi a 
Kunitz/Bovine pancreatic trypsin inhibitor domain and WAP-type 

fWhev AcifHc Protein^ 'ffllir-HiciilfiHf* ct\t&* Hnmsiinc anH an <=**> 




3418 


AY020760 


KJiyiAX aduvd iiiiduaai.ciiiic lvixvvj jVJo«? UUIlUllIllIlg ^ 1 AJ.A.i.j, ClOSeSl IO 

marker R2976 genomic <jfinnpnf!p 


\J . o 


3419 


AL122122 


Homo sapiens mRNA; cDNA DKFZp434L098 (from clone 
DKFZo434L0981 


1 K 


3420 


AF250346 


Xenopus laevis early growth response protein 1 (egrl) gene, partial 
cds 


0.077 


3421 


XM_044852 


Homo sapiens KIAA1572 protein (KIAA1572), mRNA 


0.7 


3422 


XM 049057 


Homo saniens O nrotein-coirnleri rerpntnr 4.Q / r fTPR4Q , \ mPWA 


7 


3423 


AJ223690 


H.sapiens D2-l-K2G7np gene for immunoglobulin kappa chain 
variable region 




3424 


AJ3 14752 


Anac n1atvrr1ivTir*fioQ mTcrN/f cjf»np» /ri»irfi!il\ mTa A rT*»r>r» /not-tiol'v onH 
rviiaa piaLjriiijriu*iiua uugivi gcilC IIUg/\ gCIlC ^piULiai ) aHU 

Iff A eerie (nartial^ clone 1"? 1 


n 07^ 


3425 


U51739 


Tnomoea trifida secreted f?1vcnnrntein 7 fTSfr9^ mRNA mmnlptp rHc 




3426 


S66752 


coagulation factor IX {3* region} [human, Genomic Mutant, 3130 nt] 


0.026 


3427 


AE001122 


Borrelia burgdorferi (section 8 of 70) of the complete genome 


0.72 


3428 


AJ233850 


Rattus norvefneu*; micro<»atellite oennenri* rlnnp 7TFQ 


7 1 


3429 


L35664 


Homo saniens ( subclone H8 8 f5 from PI IS H5 CXi DMA sennence 


0.079 


3430 


U97029 


Callithrix geotxroyi epsilon-globin gene, partial cds 


2.1 


3431 


Z82181 


Human DNA sequence from clone LL22NC01-86D10 on chromosome 
22 Contains part of the SYN3 gene for synapsin III and ESTs, 
complete sequence [Homo sapiens] 


0.71 


3432 


AB066544 


Macaca fascicularis brain cDNA clone:QtrA- 10780, full insert 
sequence 


0.62 


3433 


L77039 


Homo sapiens (subclone 2_e8 from PI H22) DNA sequence 


0.0000002 


3434 


AL590650 


Human DNA sequence from clone RP1 1-334F20 on chromosome 6, 
complete sequence [Homo sapiens] 


1.9 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3435 


U93519 


jviauoLiirips mgosus cytocixronie oxidase suDuiut i (i^a/ij gene, 
mitochondrial gene encoding mitochondrial protein, partial cds 


0.77 


3437 


U23451 


Caenorhabditis elegans cosmid ZK75 


0.48 


3438 


D^UUj^ id . 


U/vms* canipnc r*\r\ne* TKJS AtfTF-'l/l^ 1 1A1 mDXTA 
JTIOIIIU SapiCIls, C1UI1C llVLnVJJjr. j^f^l lOl, IILtvrNrY 




3439 


/vISAJZ** D ZO 


T-Tnmrk canipnc rDXTA* FT 17 Oft 71 fir r»lnn«> A V\V Afi^AAQ 

fiuiiiu sapiens cl/ina. ri^jzuo/3 ns, cionc al/jv/wjzoo;* 




3440 




Neisseria meningitidis serogroup B strain MC58 section 190 of 206 of 
uic VfUiiipiciv genuine 


0 f\ 

\J.\J 


3441 


AF1 1^87? 


r\nopncica iiuiuiiiua audiii rii wyiuciiroiiic uxioasc suuuiui l 


1.5 


3442 


AB055323 


Macaca fascicularis brain cDNA, clone:QflA-10308 


1.6 


3443 


AK014667 


JVIU5 IllUoCUlUo v Uaj llCUIUtlC IlCclU CL/iN/\ ? i\JJ\J_LrN lUli'lCIigLli CliridlvU 

library, clone:4833408P15, full insert sequence 


7E-25 


3444 


T TRrt7^1 
UoU/jj 


T-jVmyi/i conipnc f^AflT 70 m R M A T\ct t~ti *a 1 /*/lc 
rlUIIlU actpidla K^/WJL^/y IIUViNrV, pdilldl CUo 


0.048 


3445 




valuer dxiCLiiLUiu nir\j.N/\ iui puuiuvc /VD/\~iCopuiibivc pruiciu 


1 4 


3446 


AF336131 


Lumpy skin disease virus strain Neethling, 3* partial sequence 


0.3 


3447 


U11476 


fflV-1 isolate LBLA010 from Liberty City, Florida, envelope 
glycoprotein (env) gene, partial cds 


1.6 


3448 




riomo sapiens nypoineucai gene supported oy ali j /4 /o 


6E-13 


3449 




\ M-w -u o tniicAiilnc forv^o 1 cifiniQ iwontAf nana ^nmnlafa p/Jf" 

jvius IHU5CIL1U5 iypc i bigiiui rcLcpuji gene, complete cqs 


u.UJ 


3450 


AT j'fjyzo 


Myzopoda aunta 12S nbosomal RNA gene, complete sequence*, tRNA- 
Val gene, complete sequence; and 16S ribosomal RNA gene, complete 
sequence, iiuiuciiuiiuridi gciicb iui iiiiiuciioiiuricu pruciuwid 




3451 


INJVl_UUo /1j 


JVIUS mUSwlIluS INULL/llZ 1LKC ^INUIL.IIZ.1 IllTvlN/A. 


0 17 

V/. 1 f 


3452 


Aruo / /v^- 


iNicoudJui ictDdCuiiL iiLcnsiciii-bpcviiic pruiuuicr sequence 


0 1 


3453 


A T7ft71 7^ 


lVlUt/Ol llltllCUo paTLlal aUL**l gCILC IvJI aCllll 


2 


3454 


U89439 


Bos taurus ubiquitin-like protein mRNA, complete cds 


0.000003 


3455 




yvenopus lacvis eariy growui response protein i ^egri ) gene, poixiai 
cds 


0.072 


3456 


Z73587 


S.cerevisiae chromosome XVI reading frame ORF YPL23 lw 


0.19 


3457 




lMjirranii faor'inilflri^ hrain rDNA p1nn^*Ofl A-1A7T^ 


0.21 


3458 


Y16090 


Daucus carota Susy*Dcl gene 


0.13 


3459 


AE003868 


Xylella fastidiosa 9a5c, section 14 of 229 of the complete genome 


0.16 


3460 




Mnmn oarH^nQ plnnf* 1fintf*1 f*8Qht7 ^ftniiPTif^p 


5E-36 


3461 


AF126541 


Ateline herpesvirus 3 H-DNA terminal repeat unit, complete sequence 


0.023 


3462 


AK011178 


Mus musculus 10 days embryo cDNA, RIKEN full-length enriched 
library, clone:2600010E01, full insert sequence 


1.5 


3463 


AF370226 


Arabidopsis thaliana unknown protein (F1B16.11) mRNA, complete 
cds 


0.4 


3464 


X94768 


Rsapiens RP3 gene (XLRP gene 3) 


0.21 


3465 


AB029068 


Mus musculus gene for Ser/Thr kinase KKIAMRE, exon 1, 2, 3, 4, 5 


0.57 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRJP 


P VALUE 


3466 


X84742 


M. musculus GPX3 gene 


0.038 


3467 


BC010011 


Homo sapiens, Similar to RIKEN cDNA 493 1428D14 gene, clone 
MGC: 15407 IMAGE: 43096 13, mRNA, complete cds 


0.22 


3468 


AF288536 


Legionella loncbeachae soectinomvcin 3* adenvlvltnin<rferase (and A} 
possible transcriptional regulatory protein (IrpR), and possible sensor 
kinase protein (IskS) genes, complete cds; and unknown genes 


1 


3469 


AF288536 


Legionella longbeachae spectinomycin 3* adenylyltransferase (aadA), 
possible transcriptional regulatory protein (IrpR), and possible sensor 
kinase protein (IskS) genes, complete cds; and unknown genes 


1.8 


3470 


ABO 13 150 


Papilio machaon mitochondrial ND5 gene for NADH dehydrogenase 
subunit 5, partial cds 


5.9 


3471- 


XM 028011 


Homo sapiens CGI-145 protein (LOC51028), mRNA 


0.63 


3472 


XM_052069 


Homo sapiens actin related protein (MGC 15664), mRNA 


5.7 


3473 


AL449163 


Human DNA sequence from clone RP1 1-533E16 on chromosome 6, 
complete sequence [Homo sapiens] 


0.00001 


3474 


X96685 


B.burgdorferi cell division genes 


L9 


3475 


AF179589 


Rana tigrina rugulosa glucagon receptor (GluR) mRNA, complete cds 


0.19 


3476 


AJ297549 


Homo sapiens partial PIK3CB gene for phosphatidylinositol 3-kinase 
catalytic subunit pi lObeta, exons 1-2 and joined CDS 


0.87 


3477 


XM 032707 


Homo sapiens KLAA0630 protein (KIAA0630), mRNA 


0.2 


3478 


U25032 


Caenorhabditis elegans par-3 mRNA, complete cds 


1.3 


3479 


U89283 


Biomphalaria glabrata myoglobin gene, complete cds 


3.2 


3480 


AY029303 


Strongylocentrotus purpuratus ADAM precursor, mRNA, complete 
cds 


5.6 


3481 


AB043223 


Mus musculus V186.2 gene for immunoglobulin heavy chain, 
clone:LlG6001 


7.5 


3482 


AF304130 


Danio rerio transmembrane receptor Roundabout 1 (robol) mRNA, 
complete cds 


1.8 


3483 


AF386077 


Homo sapiens interleukin 17B (IL17B) gene, complete cds 


0.65 


3484 


U39889 


Bos taurus Y-chromosome specific genomic sequence 


0.65 


3485 


XM_032347 


Homo sapiens region containing TLS -associated serine-argini ne 
protein 1; TLS-associated serine-arginine protein 2; TLS-associated 
serine-arginine protein 2; TLS-associated serine-arginine protein 2; 
TLS-associated serine-arginine protein 1; TLS-associa> 


0 


3486 


AF120475 


Mus musculus myelin-associated oligodendrocytic basic proteins 
MOBP170 and MOBP69 (Mobp) gene, exons 3 through 5 and 
complete cds, alternatively spliced 


0.0008 


3487 


U81676 


Unidentified eubacterium clone vadinBC27 16S ribosomal RNA gene, 
partial sequence 


0.31 


3488 


AL596023 


Human DNA sequence from clone RP1 1-142 J18 on chromosome X, 
complete sequence [Homo sapiens] 


0.016 


3489 


U67210 


Human clone HS2, 16 Alu-Ya5 sequence 


0.056 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3490 


XM_007196 


Homo sarjiens similar to ret finder nrotpin 9* r.5inrHHatf» tiimnr 
suppressor involved in B-CLL (H. sapiens) (LOC93519), mRNA 


0.0000008 


3491 


AL049568 


Human DNA sequence from clone SC22CB-23F1 on chromosome 

22fll3 31-13 33 Contain*! a OSS rnrrmlptf* cpnnpnrp ri-Tnmr* canipncl 


O 07d. 

\J.\3 1 f 


3492 


X94755 


A.thaliana mRNA for SNF1 -related ser/thr protein kinase (1852 bp) 


0.15 


3493 


AL392109 


Human DNA sequence from clone RP13-459H7 on chromosome 10, 
complete sequence [Homo sapiens] 


1.8 


3494 


S53497 


immunoglobulin epsilon chain constant region=secreted form {3' 
region} [human, B cell myeloma U-266, Genomic, 3198 nt] 


3.2 


3495 


X16027 


R.norvegicus gene encoding alkaline phosphatase, exon 2 


0.0009 


- 

3496 


AF213918 


Buchnera aphidicola plasmid pTrpEG isolate Mor26ps anthranilate 
svnthase comnonent I ftroE^ Dseudoeene nartial sennence 


0.004 


3497 


AE006491 


Streptococcus pyogenes Ml GAS strain SF370, section 20 of 167 of 
the complete genome 


0.33 


3498 


Z73587 


S cerevisiae chromosome XVI reading frame ORF YPL23 lw 


0.089 


3499 


AK023040 


Homo sapiens cDNA FLJ12978 fis, clone NT2RP2006321 


2E-62 


3500 


AF062039 


Homo saoiens intecrin aloha-2 subunit fTTGA2 1 i eene TTGA2-1 
allele, partial cds 


0.65 


3501 


Z77974 


H. sapiens flow-sorted chromosome 6 Hindlll fragment, SC6pA5D12 


0.036 


3502 


M13934 


Human ribosomal orotein S14 eene comolete cds 


0.54 


3503 


NM 023220 


Mus museums RDCEN cDNA 20 1 0 1 06GO 1 eene (20 1 0 1 06GO 1 RikA 
mRNA 


1.1 


3504 


AF140513 


Homo sapiens clone 254LL2 genomic sequence 


0.0004 


3505 


AL359558 


Homo saoiens mRNA* cDNA DKFZn7620 1615 f from clone 
DKFZp76201615) 


0.019 


3506 


XM 048768 


Homo saoiens tetratricorjentide reoeat domain 3 rTTC3^ mRNA 


0.00009 


3507 


AF074994 


Homo sapiens full length insert cDNA YH91C06 


0.65 


3508 


NC 001338 


Sulfolobus virus 1, complete genome 


0.008 


3509 


U67502 


Methanococcus iannaschii section 44 of 150 of the comnlete ffenome 


0.21 


3510 


AY035821 


Schizosaccharomyces pombe Byrl-binding protein Bobl (bobl) gene, 
complete cds 


0.19 


3511 


AF275842 


Plasmodium falciparum clone b.2 PfEMPl (var) mRNA, partial cds 


2 


3512 


AJ294714 


Pichia anomala leu2 gene for beta-isopropylmalate dehydrogenase 


0.19 


3513 


AL391240 


Human DNA sequence from clone RP3-397P18 on chromosome 6, 
complete sequence [Homo sapiens] 


3.4 


3514 


AE002304 


Chlamydia muridarurn, section 35 of 85 of the complete genome 


1.8 


3515 


AF083830 


Homo sapiens dihydropyridine receptor alpha 2 subunit 
(CACNA2D1) gene, exon 14 


0.2 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3516 


AE005222 


Escherichia coli 0157:H7 EDL933 genome, contig 1 of 3, section 46 
of 155 


1.7 


3517 


U04208 


Escherichia coli 9142-88 cytolethal distending toxin (cdtA, cdtB, and 
cdtC) genes, complete cds 




3518 


Y07738 


M.musculus gene for vimentin 


0 65 


3519 


AF304091 


Sotalia fluviatilis alpla lactalbumin gene, partial cds 


2 


3520 


AF270356 


Staphylococcus epidermidis strain SRI clone step.4025c09 genomic 
sequence 


0.52 


3521 


AF017516 


Bombus pascuorum cytochrome b (cytb) gene, mitochondrial gene 
encoding mitochondrial protein, partial cds 


1.8 


3523 


AF098919 


Gallus gallus alpha-globin gene domain 5 1 region 


2 


3524 


XM_047517 


Homo sapiens Kruppel-like factor 4 (gut) (KLF4), mRNA 


1E-39 


3525 


NC_001626 


Petromyzon marinus mitochondrion, complete genome 


0.021 


3526 


AF246647 


Bemisia tabaci biotype Q 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


1.9 


3527 


AL391240 


Human DNA sequence from clone RP3-397P18 on chromosome 6, 
complete sequence [Homo sapiens] 


3.1 


3528 


M32779 


AcNPV with an S.frugiperda insertion element IFP2.2 


1.9 


3529 


L44121 


Homo sapiens (clone pHFC2. ID) CMT1A gene, repeat unit 


6E-44 


3530 


U52350 


Arabidopsis thaliana GTP-binding protein (ARAC5) mRNA, 
complete cds 


0.007 


3531 


X80908 


L.esculentum gene for fruit ripening polygalacturonase 


0.65 


3532 


AF267203 


Candidatus Carsoneila ruddii natural-host Blastopsylla occidentalis 
ATP synthase alnha subunit fatnA^ ?enp nartial rHc* ATP cvnthsicp 
gamma subunit (atoG) gene, comnlete cds* and ATP svnthase beta 
subunit (atpD) gene, partial cds 


0.22 


3533 


AE001379 


Plasmodium falciparum chromosome 2, section 16 of 73 of the 
complete sequence 


0.25 


3534 


XM_034129 


Homo sapiens hypothetical protein DKFZp761J1523 
(DKFZp761J1523), mRNA 


6 


3535 


AE002304 


Chlamydia muridarum, section 35 of 85 of the complete genome 


1.8 


3536 


AK024082 


Homo sapiens cDNA FLJ14020 fis, clone HEMBA1002508 


1.7 


3537 


U67547 


Methanococcus jannaschii section 89 of 150 of the complete genome 


0.22 


3538 


AF2 15629 


Crypteronia paniculata chloroplast rpll6 gene, intron sequence 


0.024 


3539 


X03853 


Pea chloroplast gene for ribulose 1,5-bisphosphate carboxylase (ibcl) 


0.008 


3540 


X85055 


B.taurus cosmid-derived microsatellite DNA (clone IDVGA-40) 


0.0009 


3541 


AK005508 


Mus musculus adult female placenta cDNA, RIKEN full-length 
enriched library, clone: 1600021C16, full insert sequence 


0.63 


3542 


Y16262 


Daucus carota mRNA for neutral invertase 


0.21 


3543 


AL5 13491 


Human DNA sequence from clone RP1 1-79 A21 on chromosome X, 
complete sequence [Homo sapiens] 


0.56 


3544 


XM 036298 


Homo sapiens hypothetical gene supported by AL442095 
(LOC91124), mRNA 


0.02 


3545 


XM_049045 


Homo sapiens KIAA01 18 protein (KIAA01 18), mRNA 


0.46 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3546 


AY020760 


aauTo nuvi uaaiwii i& i vxxx VJ JUOJ tAJllulxillxxli ^ 1 f\ )j\.L -3, CLUSCol LO 

marker R2976, genomic sequence 


5.4 


3547 


AF198037 


xvi^ruupiaaiiui |/uiiiiijiiid x^xjjj_> ynyijj gene, vonipicic ccis, anu vsavj 
^vsaG^ and VsaE fvsaE^ f?ene<; nartial cH« 


v.UV/O 


3548 


X62658 


E.faecalis plasmid pADl seal gene and orfy 


1.7 


3549 


AF083468 


Fmericella nidiilfin^ rmtativp yinc fintrpr nrot**in rflhf^ apnp r*mTmlf*t*» 

x-iiiitx iv^vxxci xmiiiici ixa puuiuiv^ £.1X1^ xxxigvx pxvfLdll ^HUv^ J liCHC, ^\/lllL/xdC 

cds 


0.18 


3550 


AF3 15801 


Homo sapiens chromosome Y AHCP pseudogene, complete sequence 


1.9 


3551 


AC007042 


Homo sapiens clone RP11-399H17, complete sequence 


6 


3552 


AF198037 


Mvconlasma Dulmonis LinB flinR^ eene comnlete cds* and VsaG 
(vsaG) and VsaE (vsaE) genes, partial cds 


0.008 


3553 


AK025090 


Homo sapiens cDNA: FLJ21437 fis, clone COL04285 


0 


3554 


X76027 


K.lactis APA2 gene for tetraphosphatase, QCR7 gene, bcl complex 
<*iibunit VII 


0.4 


3555 


Z78713 


H. sapiens flow-sorted chromosome 6 Hindlll fragment, SC6pA15All 


0.61 


3556 


U50357 


Sltrpritococcii** yooenidemiciiQ 7<\c\c\ n A immnnih/ fhptnr nnrt 

zoocin A endopeptidase (zooA) genes, complete cds 


0.13 


3557 


U93721 


Homo ^anient; {rreeTi cone nhotOTecerrtor nifrnient opn** V flanlcinff 

XXU11XV/ OtXtyiWXU glWvll LSXXULVfX VV&LJIAJX LSXglllbfXll gCXl^, — ' XXdXXIVXXlg 

region 


1.4 


3558 


AJ291489 


Paracentrotus lividus mRNA for fibrosurfin (surfin2656 gene) 


3.7 


3559 


M13820 


PlactniH C**r\\ TV>-nQ mlifin Th ctntpfiiral unH imtnnnitv ir*»n*»c rnmnlpt^ 

X^ladXllXU 17X117 I^VJXIVXXX 1U OLX 1X1/ 1 Ul Ctl ClXlU lllllllUllliy c£GllGd, C\JXII|JICLC 

cds 


0.2 


3560 


Z54147 


T-Tumnn F"iM A cpniifnpp fVnm pncmiH T 1 0 0T-T7 Mutitin o+r» ti*c fiicpsicf* 
XXlxxxlcxxx L/iNn o^CJ U t- 1 ll/V> XX \J1X1 lAJollxxU l-» 1^711 / , rxllllLliigLvJll a 1/iaCaoC 

Repion chromosome 4nliS 3 contains PnO island 

xvv^xvjxx, vrxxx vsxxxisovjxtxw Lvj . ~j wxxicixxxa v^iyvj x^xcxx xvx 


2 


3561 


AJ301616 

rw iJ\f i\f xv/ 


Homo saniens DNC! j?ene for deoxvnncleotide carrier evons 1 -9 


0.17 


3562 


U29403 


Human immunodeficiency virus type 1 defective gag gene, clone 3-13, 
truncated cds 


0.63 


3563 


AF090118 


Plasmodium falciparum heat shock protein hspl04 (hspl04) gene, 
partial cds 


0.21 


3564 


Z22707 


S scrofa sol FFN gene 


0.57 


3565 


AF090889 


Homo sapiens clone HQ0092 


0.62 


3566 


XM 050705 


Homo sapiens GalNAc-4-sulfotransferase 2 (GALNAC4ST-2), mRNA 


0.15 


3567 


XM 016611 


Homo ^aniens similar to 8-oxojnianine DNA elvcosvlase ftt saoiens^ 
(LOC93577), mRNA 


0.052 


3568 


Z35914 


S.cerevisiae chromosome II reading frame ORF YBR045c 


0.59 


3569 


S69278 


C YP 1 A2=3 -methy lcholanthrene responsive gene [Oncorhynchus 
mykiss=rainbow trout, liver, Genomic, 5023 nt] 


1.5 


3570 


X56832 


H.sapiens EN03 gene for muscle specific enolase 


0.21 


3571 


XMO 11264 


Homo sapiens KIAA0372 gene product (KIAA0372), mRNA 


0 


3572 


NM_022628 


Rattus norvegicus nephrin (Nphsl), mRNA 


0.017 


3573 


AJ299718 


Homo sapiens partial MASP2 gene, intron 7 


6E-33 


3574 


Z69651 


Human DNA sequence from cosmid L75B9, Huntington's Disease 
Region, chromosome 4pl6.3 


0.022 
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SEQED 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3575 


AF327739 


^Itrpntncoccii^ tViprmnnliiliic TJphl Z' rw*H 1 ^ filnfl fo-lnfYl rpcnnnc^ 

regulator Rrl (rrl), Hpk2 (hpk2), YycJ-like protein (yycJ), and MurM 
fmiii"N/f ■ ffene*; comnlpte cd<; 


0.59 


3576 


AC001656 


Drncnnhilfl mplflnnaactpr fPl H^n01Q7 /Til Ji^ HMA cpnupnrp 

complete sequence 


0.068 


3577 


AF262620 


Periplaneta americana 16S ribosomal RNA gene, partial sequence; 
mitochondrial eene for mitochondrial orodnct 

A- *- 1 A LWl *\Jm. till i{U rAtM* m. W 1V1 A A 11 IWl 4 V Hul 4141 Vr\l H W L 


0.2 


3578 


L08425 


Zea mays auxin-binding protein (abpl) gene, exons 1-5 and complete 
cds 


0.64 


3579 


M97702 


Drosophila melanogaster glutathione S -transferase gene 


1.8 


3580 


AF268059 


f^jinHidjitii*; C'ar^onellfl niddii natnral-ho«it Arvtainn jypniotap RNA 
polymerase beta subunit (rpoB) and RNA polymerase beta-prime 
subunit (rpoC) genes, partial cds 


0.063 


3581 


U30149 


Entamoeba histolytica pyruvate: ferredoxin oxidoreductase (pfol) 
gene, complete cds 


1.9 


3582 


X88893 


C.jacchus intron 4 of visual pigment gene (green allele) 


0.00008 


3584 


AK000939 


Homo sapiens cDNA FLJ10077 fis, clone HEMBA1001864 


0.19 


3585 


AL1 14356 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


5.6 


3586 


AF126992 


Oxvtroois bessevi var ventosa chloronlast tRNA-Leu ftrnLi gene 
intron sequence 


0.21 


3587 


Z47046 


Human cosmid OLL2C9 from Xa28 


5.7 


3588 


AE006246 


Lactococcus lactis subsp. lactis IL1403 section 8 of 218 of the 
comnlete genome 


5.3 


3589 


AYO 18921 


nrwa cotiv?! micro^itellitp. 1M"RCt1 946 contfliniiKT f ATYX^^ ffpnnmic 

\yi.y£i€L ottUVa llUVl UOiH.WllllW IVAXWJ 1 ArTU wV/il Kill Lillet ^Al JJ\^* ~J -y KbllWilLlW 
Uvl iwv 


0.71 


3590 


AL5 89705 


Human DNA sequence from clone RP1 1-64M7 on chromosome 6, 
comnlete sentience IHomo saniensl 


0.069 


3591 


NM 010250 


Mus musculus gamma-aminobutyric acid (GABA-A) receptor, subunit 
alnha 1 f Gab raU mRNA 

{Ui/llil y VIUI/4U *- J j iKUu ~ *- *- 


3.8 


3592 


AC024748 


Caenorhabditis elegans cosmid Yl 10A2AR, complete sequence 


0.017 


3593 


AJ002240 


Mus musculus minisatellite tandem repeat (MMS9) 


1.7 


3594 


AJ293574 


Rra<;<;ica nanus t?ene for niitative corticosteroid hinriiri? nrotein and 
partial gene for hypothetical protein, cultivar N-o-9 


0.21 


3596 


AF324889 


Homo saniens mvosin Dhosnhatase target siihnnit 2 fMYPT^ $?ene 
exon 1 


0.22 


3597 


S62623 


IGF2= insulin-like growth factor 2 {clone PPA1, exon 9} [human, 
Genomic, 811 nt] 


5.4 


3598 


Z22707 


S.scrofa spl IFN gene 


0.57 


3600 


AB030490 


Glycine max SG-05 gene for thiamin biosynthetic enzyme, complete 
cds 


0.023 


3602 


S69278 


CYP 1 A2=3 -methylcholanthrene responsive gene [Oncorhynchus 
mykiss=rainbow trout, liver, Genomic, 5023 nt] 


1.7 


3603 


XM_029148 


Homo sapiens hypothetical protein BC004923 (LOC85865), mRNA 


0.002 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3604 


AJ002979 


L. concinnum chloroplast DNA for intergenic spacer between tRNA- 
Thr (UGU) and tRNA-Leu (UAA) 5'exon, collected from W. 
Pakawau, Cape Farewell, South Island, New Zealand 


0.067 


3605 


U36927 


Plasmodium yoelii rhoptry protein gene, complete cds 


0.2 


3606 


AK021502 


Homo sapiens cDNA FLJ1 1440 fis, clone HEMBA10013 19 


0.00001 


3607 


AB033604 


Capra hircus Hsp70-3 gene for 70 kDa heat shock protein, complete 
cds 


0.18 


3608 


M81651 


Human semenogelin II (SEMGII) gene, complete cds 


1.8 


3609 


AB054512 


Berardius bairdii DNA, SINE flanking sequence Tuti35 locus 


0.49 


3610 


AF057740 


Homo sapiens 17-beta-hydroxysteroid dehydrogenase IV (HSD17B4) 
gene, exon 24 and complete cds 


0.018 


3611 


AJ295158 


Homo sapiens gene for thymosin beta-4 


1E-48 


3612 


X79402 


B.taurus mRNA for plasminogen 


0.21 


3613- 


NC_001807 


Human mitochondrion, complete genome 


3E-20 


3614 


X92204 


P.hybrida NAM gene 


0.049 


3616 


AB032016 


Homo sapiens PDNP1 gene for phosphodiesterase I/nucleotide 
pyrophosphatase 1, 5' flanking region and partial cds 


0.009 


3617 


AF339159 


Macropus rufogriseus mannose-6-phosphate/insulin-like growth factor 
II receptor (m6p/igf2r) mRNA, partial cds 


0.47 


3618 


AB025285 


Homo sapiens c-ERBB-2 gene, exons 1', 2*, 3\ 4' 


0.0000001 


3619 


AJ002550 


Homo sapiens MMP-1 gene, promoter region 


3.8 


3620 


AB004856 


Buchnera aphidicola mRNA for homoserine kinase, partial cds 


0.79 


3621 


ABO 10426 


Phytoplasma sp. gene for AL1 like protein, complete cds 


L8 


3622 


L36903 


Saccharomyces cerevisiae mitochondrion 2 IS ribosomal RNA (2 IS 
rRNA) gene, 21S R2 ribosomal RNA (21S R2 rRNA) gene, and ORF 
RI 


0.022 


3623 


AF267642 


Filobasidiella neoformans var. neoformans Myo2p-like protein gene 


0.49 


3624 


XM_044499 


Homo sapiens KIAA0484 protein (KIAA0484), mRNA 


0.0001 


3625 


L04550 


Homo sapiens DNA fragment 


2.6 


3626 


AF2 10054 


Homo sapiens bone morphogenetic protein 7 (BMP7) gene, promoter 
region and partial cds 


2.5 


3627 


AL591505 


Human DNA sequence from clone RP1 1-67112 on chromosome X, 
complete sequence [Homo sapiens] 


0.001 


3628 


AL049969 


Homo sapiens mRNA; cDNA DKFZp564A072 (from clone 
DKFZp564A072) 


0.35 


3629 


AB055284 


Macaca fascicularis brain cDNA, clone:QflA-l 1654 


0.15 


3630 


AF304448 


Homo sapiens vitamin D receptor-interacting protein complex 
component DRIP 1 50 (DRIP 1 50) mRNA, complete cds 


1.3 


3631 


AL133157 


S.pombe chromosome I cosmid clB2 


0.6 


3632 


AC002475 


Homo sapiens Chromosome 22qll.2 PCR Product dgcr-gapl In 
DGCR Region, complete sequence 


0.005 


3633 


AK003943 


Mus musculus 18 days embryo cDNA, RIKEN full-length enriched 
library, clone: 1 1 10028F1 1, full insert sequence 


4E-18 


3634 


S78712 


hemolin=48 kda C2-type immunoglobulin-like protein [Hyalophora 
cecropia=giant silkmoths, Genomic, 6622 nt] 


3.5 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 




DESCRIP 


P VALUE 


3635 


AK023897 


Homo sapiens cDNA FLJ13835 fis, clone THYRO1000712 


2E-98 


3636 


X559/0 


T. brucei ESAG 8 gene for a leucine-nch repeat family protein 


0,14 


3637 


AM 031534 


Homo sapiens similar to uterine protein (M. musculus) (LOC90410), 

__^T» XT A 

mRNA 


5E-47 


3638 


Ar 044287 


Drosophila melanogaster delta adaptin subunit of AP-3 (garnet) gene, 
complete cds 


0.004 


3639 


ABOoOyi/ 


Macaca fascicularis brain cDNA clone:QtrA-14684, full insert 
sequence 


1.1 


3640 


AK00o522 


Mus musculus adult male small intestine cDNA, RUCEN full-length 
enriched library, clone. 201 03 05C02, full insert sequence 


0.41 


3641 


XTA A f\ "> 1 "> A f\ 

NM 031340 


Rattus norvegicus timeless (Drosophila) homolog (Timeless), mRNA 


0.18 


3642 


Ar334544 


Homo sapiens chromosome Y landmark: proximal chromosome 1 
translocation boundary distal to AZFc region 


0.19 


3643 


XM_033642 


Homo sapiens nucleophosmin (nucleolar phosphoprotein B23, 
numatnn) (NPM1), mRNA 


1E-66 


3645 


Ab001462 


Helicobacter pylori, strain J99 section 23 of 132 of the complete 
genome 


1.3 


3646 


L81761 


Homo sapiens (subclone 6_cl0 from PI H17) DNA sequence, 
complete sequence 


4E-12 


3647 


AJ271348 


Homo sapiens partial DRD3 gene for dopamine D3 receptor, exon 1 


0.19 


3648 


U58991 


Simian immunodeficiency virus from African green monkey, tantalus 
species (SIVtan) proviral DNA, complete genome 


0.16 


3649 


NM_0 14601 


Homo sapiens EH-domain containing 2 (EHD2), mRNA 


0.087 


3650 


NM_032717 


Homo sapiens hypothetical protein MGC1 1324 (MGC1 1324), mRNA 


0.62 


3651 


XM_004462 


Homo sapiens signal sequence receptor, alpha (translocon-associated 
protein alpha) (SSR1), mRNA 


0.79 


3652 


AY016336 


Phragmites australis rpl36-rps8 intergenic spacer, chloroplast 
sequence 


0.69 


3653 


Z63374 


H. sapiens CpG island DNA genomic Msel fragment, clone 82b3, 
forward read cpg82b3 . ft 1 a 


0.22 


3654 


t ti nmi 
U 10927 


Staphylococcus aureus M type 1 capsular polysaccharide biosynthesis 
(capA, capB, capC, capD, capE, capF, capG, capH, capl, capJ, capK, 
capL, capM) genes, complete cds 


0.026 


3655 


AE007446 


Streptococcus pneumoniae section 129 of 194 of the complete genome 


0i8 


3656 


BC006035 


Mus musculus, clone IMAGE:3492938, mRNA, partial cds 


5.3 


3657 


AF3 52740 


v/tivAJiiijrii^iiua nijfRj.aa wiuiiw v*/ivulviiv»7 j iiuLausiiiciiiic sequence 


Z. 1 


3658 


NM 032141 


Homo sapiens hypothetical protein DKFZp434K1421 
(DKFZP434K1421), mRNA 


0 


3659 


AF275231 


Phloeomyzus passerinii 12S small subunit ribosomal RNA gene, 
partial sequence; tRNA-Val gene, complete sequence; and 16S large 
subunit ribosomal RNA gene, partial sequence; mitochondrial genes 
for mitochondrial products 


0.077 
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SEQ ID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3660 


AF017646 


Schizosaccharomyces pombe TFIIH subunit p47 (tfh47) gene, 
complete cds 


0.72 


3661 


AF200526 


Zea mays cellulose synthase-2 (CesA-2) mRNA, complete cds 


2.2 


3662 


AJ297654 


Sus scrofa AACT2 gene for alpha- 1-antichymotrypsin 2, exons 1-5 


0.74 


3663 


XM 018189 


Homo sapiens hypothetical protein FLJ13949 (FLJ13949), mRNA 


0.74 


3664 


AE004047 


Xylella fastidiosa 9a5c, section 193 of 229 of the complete genome 


2.2 


3665 


XM 003012 


Homo sapiens EphBl (EPHB1), mRNA 


6.9 


3666 


AF083240 


Shewanella putrefaciens multi -cytochrome gene cluster, complete 
sequence; ferrous iron transporter (feoB), deca-heme c-type 
cytochrome (mtrC), c-type cytochrome precursor (mtrA), and outer 
membrane protein precursor (mtrB) genes, complete cds 


0.085 


3667 


AF100304 


Caenorhabditis elegans cosmid W07B3, complete sequence 


0.74 


3668 


AP001348 


Homo sapiens genomic DNA, chromosome 21q21.1-q21.2 
clone:f39G8, LL56-APP region, complete sequence 


2E-18 


3669 


AB005039 


Alternaria alternata MAT2 gene, complete cds 


6.4 


3670 


NM_009181 


Mus musculus sialyltransferase 8 (alpha-2, 8-sialytransferase) B 
(Siat8b), mRNA 


0.22 


3671 


AB045894 


Nepenthes alata NaAP4 mRNA for aspartic proteinase 4, complete cds 


7 


3672 


M74944 


Simian immunodeficiency virus (32H Re-isolate of SIVmac 251, clone 
W9) env gene, complete cds 


0.25 


3673 


AF288211 


Danio rerio Nkx5-1 mRNA, complete cds 


0.072 


3674 


AC006753 


Caenorhabditis elegans cosmid Y40A1 A, complete sequence 


0.02 


3675 


S53497 


immunoglobulin epsilon chain constant region=secreted form {3* 
region} [human, B cell myeloma U-266, Genomic, 3198 nt] 


6.8 


3676 


AL5 12305 


Human DNA sequence from clone RP1 1-85E24 on chromosome 6, 
complete sequence [Homo sapiens] 


0.009 


3677 


AF207862 


Bos taurus bradykinin receptor B2 (BDKRB2) gene, promoter 


5.8 


3678 


AF269417 


Staphylococcus epidermidis strain SRI clone step.l003e03 genomic 
sequence 


5.1 


3679 


X53232 


Rat mRNA for preoptic regulatory factor-2 (PORF-2) 


2 


3680 


AB005039 


Alternaria alternata MAT2 gene, complete cds 


6.5 


3681 


AE003252 


Drosophila melanogaster genomic scaffold 142000013385369, 
complete sequence 


1.9 


3682 




i^aenornaDuius eiegans lniegnn oeta pat- J gene, complete cds 


O.UoJ 


3683 


XM 010829 


Homo sapiens potassium channel, subfamily K, member 3 (TASK-1) 
(KCNK3), mRNA 


0.25 


3684 


AF065853 


Homo sapiens OR7E1 IP pseudogene, partial sequence 


0.077 


3685 


AF325859 


Choristoneura mmiferana antifreeze protein (AFP-Lul) gene, 
complete cds 


2.3 


3686 


XM_017134 


Homo sapiens breast cancer 2, early onset (BRCA2), mRNA 


0.23 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3687 


AE002139 


Ureaplasma urealyticum section 40 of 59 of the complete genome 


0.17 


3688 


J00241 


Human Ig germline kappa-L chain, C region (inv3 allele) 


0.22 


3689 


U23855 


Bos taurus von Willebrand factor gene, 5 1 flanking region and exon 1 


0.78 


3690 


AK015465 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone: 493045 5H04, full insert sequence 


6.2 


3691 


XM_042056 


Homo sapiens LOC86123 (LOC86123), mRNA 


2 


3692 


AL137458 


Homo sapiens mRNA; cDNA DKFZp434E222 1 (from clone 
DKFZp434E2221) 


0,086 


3693 


Z81079 


Caenorhabditis elegans cosmid F39H1 1, complete sequence 


2 


3694 


AF142998 


Pteronotus parnellii NADH dehydrogenase subunit 1 (ND1) gene, 
partial cds; mitochondrial gene for mitochondrial product 


0.69 


3695 


AKO 12282 


Mus musculus 1 1 days embryo cDNA, RIKEN full-length enriched 
library, clone:2700023I24, full insert sequence 


0.75 


3696 


AB032937 


Cucumis sativus CS-ACS1 gene for 1 -aminocyclopropane- 1 - 
carboxylate synthase, complete cds 


1.5 


3697 


M92295 


Gorilla gorilla gamma- 1 and gamma-2 globin genes, complete cds 


0.029 


3698 


AF034085 


Caenorhabditis elegans UNC-45 (unc-45) gene, complete cds 


0.0001 


3699 


AB048996 


Macaca fascicularis brain cDNA, clonerQnpA- 14055 


0.25 


3700 


AK024527 


Homo sapiens cDNA: FLJ20874 fis, clone ADKA02818 


0.000001 


3701 


AKO 14489 


Mus musculus 14 days embryo liver cDNA, RIKEN full-length 
enriched library, clone:4432412D04, full insert sequence 


4.3 


3702 


NM 022599 


Rattus norvegicus outer membrane protein (Omp25), mRNA 


1E-14 


3703 


AF177242 


Bodo saltans clone pBME40 mitocondrial minicircle DNA 


0.021 


3704 


AF183329 


Homo sapiens clone NIGMS NA10926D chromosome 10 paralogous 
sequence variant, genomic sequence 


5E-61 


3706 


AF374728 


Saxifragella bicuspidata maturase K (matK) gene, partial cds; 
chloroplast gene for chloroplast product 


0.65 


3707 


AK010113 


Mus musculus adult male tongue cDNA, RIKEN full-length enriched 
library, clone:2310068J16, full insert sequence 


5E-29 


3708 


XM_045657 


Homo sapiens McKusick-Kaufman syndrome (MKKS), mRNA 


1.8 


3709 


AJ223690 


H. sapiens D2-l-K2G7np gene for immunoglobulin kappa chain 
variable region 


0.026 


3710 


XM 008389 


Homo sapiens Clq-related factor (CRF), mRNA 


0.026 


3711 


AF200526 


Zea mays cellulose synthase-2 (CesA-2) mRNA, complete cds 


2.3 


3712 


AE000049 


Mycoplasma pneumoniae M129 section 31 of 63 of the complete 
genome 


0.68 


3713 


XM 002495 


Homo sapiens cyclic nucleotide gated channel alpha 3 (CNGA3), 
mRNA 


2.1 


3714 


AF322106 


Oncorhynchus mykiss lysozyme II precursor, gene, complete cds 


0.7 


3715 


AK002007 


Homo sapiens cDNA FLJ11 145 fis, clone PLACE1006626, highly 
similar to Homo sapiens mRNA for KIAA0928 protein 


0.029 


3716 


Z73978 


Caenorhabditis elegans cosmid ZC302, complete sequence 


0.23 


3717 


NM 023979 


Rattus norvegicus apoptotic protease activating factor 1 (Apafl), 
mRNA 


0.52 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQ ID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3718 


AF335504 


Oryza sativa hemoglobin 1 (hbl), hemoglobin 3 (hb3), and 
hemoglobin 4 (hb4) genes, complete cds 


0.077 


3719 


BC006943 


Mus musculus, Similar to RIKEN cDNA 323040 1N03 gene, clone 
MGC:6906 IMAGE:2655807, mRNA, complete cds 


2 


3720 


AK007159 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:1700110I01, full insert sequence 


6 


3721 


AF033002 


Cerebratulus lacteus body wall globin gene, complete cds 


0.026 


3722 


X95276 


P.falciparum complete gene map of plastid-like DNA (IR-B) 


0.2 


3723 


XM_012629 


Homo sapiens DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 8 
(RNA helicase) (DDX8), mRNA 


1.4 


3724 


XM 042643 


Homo sapiens hypothetical protein DKFZp434G131 
(DKFZP434G131), mRNA 


0.69 


3725 


AF145640 


Drosophila melanogaster clone GH07346 BcDNA.GH07346 
(BcDNA.GH07346) mRNA, complete cds 


2.3 


3726 


X58823 


Bovine COX7cPl retroposed pseudogene for cytochrome c oxidase 
subunit VIIc 


0.21 


3727 


XM 052301 


Homo sapiens hypothetical protein MGC3199 (MGC3199), mRNA 


6E-25 


3728 


U00176 


Dictyostelium mucoroides DMUC2 clone p288m2 Gl-like and G5- 
like ORFs' proteins, complete cds 


0.00009 


3729 


AJ132901 


Drosophila guanche mitochondrial A+T-rich region 


0.071 


3730 


AL583762 


Human chromosome 14 DNA sequence Partial sequence from BAC R- 
1109N18_PCR1 of library RPCI- 11 from chromosome 14 of Homo 
sapiens (Human), complete sequence 


7E-35 


3731 


AK021083 


Mus musculus adult male corpus striatum cDNA, RIKEN full-length 
enriched library, clone:C030015A19, full insert sequence 


1.5 


3732 


XM_001934 


Homo sapiens PRO0529 protein (PRO0529), mRNA 


1.4 


3733 


AJ235836 


Labordia tinifolia chloroplast ndhF gene 


2.2 


3734 


M81855 


Rat mdr mRNA sequence 


1.9 


3735 


NM 031649 


Rattus norvegicus killer cell lectin-like receptor subfamily G, member 
1 (Klrgl), mRNA 


4.9 


3736 


Z78014 


Caenorhabditis elegans cosmid F42E8, complete sequence 


0.002 


3737 


XM 045320 


Homo sapiens chondroitin sulfate proteoglycan 6 (bamacan) (CSPG6), 
mRNA 


0.0005 


3738 


AF288788 


Staphylococcus aureus rot-like protein Rip (rip) gene, complete cds 


0.4 


3739 


D16417 


Dictyostelium discoideum mRNA 


0.008 


3740 


XI 5465 


Hamster SPC14 LINE LI repeat DNA 


0.000003 


3741 


Y14544 


Danio rerio mRNA for Hoxc8 protein 


O Old 




M13897 


Rattus norvegicus thyrotropin beta subunit (TSH-beta) gene, complete 
cds, clones RP100-14 and RP21 


3.5 


3743 


AE001390 


Plasmodium falciparum chromosome 2, section 27 of 73 of the 
complete sequence 


0.018 


3744 


XM 035389 


Homo sapiens ribosomal protein, large, PI (RPLP1), mRNA 


9E-36 


3745 


XM_007409 


Homo sapiens alkylation repair; alkB homolog (ABH), mRNA 


0.057 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3746 


AE007425 


Streptococcus pneumoniae section 108 of 194 of the complete genome 


0.15 


3747 


Z38113 


S.cerevisiae chromosome IX lambda clone 4554 


4.9 


3749 


X80491 


S.latifolia SLM4 mRNA 


0.02 


3750 


XM 035466 


Homo sapiens hBKLF for basic kruppel like factor (LOC51274), 
mRNA 


2E-35 


3751 


Z75957 


Human DNA sequence from cosmid U203H4, between markers 
DXS366 and DXS87 on chromosome X 


0.000001 


3752 


AE000800 


Methanobacterium thermoautotrophicum from bases 58271 to 68710 
(section 6 of 148) of the complete genome 


0.19 


3754 


AJ409501 


Mus musculus RNA binding site for Dazl protein, clone gflO 


0.73 


3755 


AJ243829 


Oryza sativa RPA gene for protein phosphatase 2 A A subunit, exons 1 
12 


3E-25 


3756 


AK026692 


Homo sapiens cDNA: FLJ23039 fis, clone LNG02242 


0.022 


3757 


S81000 


{microsatellite regions} [Cryptococcus neoformans^pathogenic yeast, 
var. gattii, serotype B, UCLA 371-B, human isolate, Genomic, 283 nt] 


0.94 


3758 


AF252420 


Bos taurus clone MNB-5 microsatellite sequence 


0.074 


3759 


AF338734 


Homo sapiens hypothetical PHD zinc finger protein XAP135 
pseudogene, complete sequence 


2.1 


3760 


AJ401038 


Human immunodeficiency virus type 1 proviral gpl60 gene for 
envelope protein, strain 97DC.KFE267 


0.79 


3761 


NC_002087 


Plasmid pRL765, complete sequence 


0.71 


3762 


AE004161 


Vibrio cholerae chromosome I, section 69 of 251 of the complete 
chromosome 


1.9 


3763 


Z54349 


H. sapiens MN/CA9 GENE 


0.56 


3764 


AF372396 


HIV-1 isolate CM53658 from Cameroon envelope glycoprotein (env) 
gene, partial cds 


1.8 


3765 


AY021127 


Oryza sativa microsatellite MRG3452 containing (TA)X19, genomic 
sequence 


1.9 


3766 


AK021485 


Homo sapiens cDNA FLJ11423 fis, clone HEMBA1001024 


3.4 


3767 


AF269468 


Staphylococcus epidermidis strain SRI clone step.l005a02 genomic 
sequence 


0.69 


3768 


AF229080 


Rattus norvegicus glucagon receptor gene, promoter and partial cds 


0.71 


3769 


X04384 


Sea urchin (L.pictus) gene for testis specific histone H2B-2 


0.23 


3770 


Z36061 


S.cerevisiae chromosome II reading frame ORF YBR192w 


0.087 


3771 


AB038488 


Halimeda discoidea chloroplast rbcL gene for ribulose-1,5- 
bisphosphate carboxylase/oxygenase large subunit, partial cds 


0.25 


3772 


XM_003759 


Homo sapiens leukocyte cell-derived chemotaxin 2 (LECT2), mRNA 


0.000001 


3773 


AF122979 


Tridacna maxima 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.076 


3774 


L44120 


Homo sapiens (clone pHK1.5D) CMT1A gene, primer binding site 


5E-39 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3775 


AC008855 


Homo sapiens chromosome 5 clone CTD-2178M23, complete 
sequence 


0.73 


3776 


AF267225 


Candidatus Carsonella ruddii natural-host Tainarys sordida ATP 
synthase alpha subunit (atpA) gene, partial cds; ATP synthase gamma 
subunit (atpG) gene, complete cds; and ATP synthase beta subunit 
(atpD) gene, partial cds 


0.024 


3777 


U67231 


Human clone HS4.74 Alu-Ya5 sequence 


2E-11 


3778 


XM_038985 


Homo sapiens similar to HYPOTHETICAL PROTEIN KIAA0379 (H. 
sapiens) (LOC91526), mRNA 


0.13 


3779 


X71625 


H.sapiens dinucleotide repeat polymorphism cA14 


0.057 


3780 


BC007480 


Mus musculus, clone MGC:7570 IMAGE: 3493227, mRNA, complete 
cds 


0.59 


378t 


U60153 


Human immunodeficiency virus type 1 isolate ID4-77, clone 12, from 
USA, envelope glycoprotein (env) gene, partial cds 


0.013 


3782 


XM_050963 


Homo sapiens hypothetical protein DKFZp56401664 
(DKFZP56401664), mRNA 


0 


3783 


AC024236 


Homo sapiens BAC clone RP1 1-400117 from Y, complete sequence 


0.0001 


3784 


M17627 


Chicken alpha-A-crystallin gene, complete cds and 5' flank 


0.077 


3785 


NM_031557 


Rattus norvegicus Prostaglandin 12 (prostacyclin) synthase (Ptgis), 
mRNA 


5.8 


3786 


Z79647 


B.thayeri mitochondrial 16S rRNA gene 


0.23 


3787 


AF032386 


Nicotiana tabacum aldose- 1 -epimerase-like protein (GP40) mRNA, 
complete cds 


0.077 


3788 


X89568 


P. sativum mRNA for HMGI/Y protein 


5.8 


3789 


Z71180 


Caenorhabditis elegans cosmid F22E12, complete sequence 


0.24 


3790 


AJ239079 


Tetrahymena pyriformis partial gc4 gene for guanylyl cyclase, isoform 
4 


0.79 


1791 


AF3 10890 


Dictyostelium discoideum PcmA (pcmA) and RacE (racE) genes, 
complete cds; tRNA-Phe gene, complete sequence; and unknown gene 


0.00003 


3792 


L22300 


Chilo iridescent virus type 6 zinc finger protein and non-histone 
chromosomal high mobility group protein homolog genes, complete 
cds 


0.021 


17Q1 


AF155054 


Rickettsia slovaca protein PS 120 (D) gene, partial cds 


0.47 




X59799 


M.musculus S-antigen gene promoter region 


0.2 


3795 


AF330221 


Oncorhynchus keta clone Oke4 microsatellite sequence 


0.56 


3796 




Homo sapiens similar to proteasome (prosome, macropain) 26S 
suDunii, non-Airase, ij yti. sapiens j jj), iiikjna 


o 


3797 


D17668 


Clostridium septicum gene for alpha-toxin, complete cds 


0.39 


3798 


AF165135 


Mycoplasma mycoides mycoides SC strain Afade hypothetical proline 
rich glycoprotein, hypothetical surface-located membrane protein, 
lipoprotein B precursor (ippB), and hypothetical ABC transporter 
protein genes, complete cds; hypothetical ABC transport> 


1.5 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQED 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3799 


AF379892 


Rhagigaster sp. M222 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.068 


3800 


NM 024379 


Rattus norvegicus glutamate receptor delta-2 subunit (Grid2), mRNA 


1.5 


3801 


AF364550 


Drosophila melanogaster transposon cruiser, complete sequence 


5.4 


3803 


AJ3 15160 


Xenopus tropicalis BMP-2 gene for bone morphogenetic protein 2 


0.24 


3804 


AJ293015 


Pisum sativum loxl:Ps:7 gene for lipoxygenase, exons 1-8 


0.079 


3805 


Z82768 


R.prowazekii genomic DNA fragment (clone A810F) 


1.5 


3806 


NM_013780 


Mus musculus neuronal PAS domain protein 3 (Npas3), mRNA 


0.72 


3807 


U56088 


Human periodic tryptophan protein 2 (PWP2) gene, exons 3 to 14 


0.25 


3808 


D88987 


Mus musculus Ampd3 gene, exon 4 


0.18 


380? 


Z72680 


S.cerevisiae chromosome VII reading frame ORF YGL158w 


0.73 


3810 


M17988 


Spiroplasma virus 4 (SpV4) replicative form, complete genome 


5.3 


3811 


XM 039684 


Homo sapiens putative G-protein coupled receptor (SH120), mRNA 


0.17 


3812 


XM_039684 


Homo sapiens putative G-protein coupled receptor (SH120), mRNA 


0.19 


3813 


X60735 


S.murinus Dq52 gene and immunoglobulin heavy chain S region gene 
loci 


1.4 


3814 


XM_045862 


Homo sapiens ring finger protein 10 (RNF10), mRNA 


5E-14 


3815 


AB005521 


Homo sapiens ppar gamma gene for peroxisome proliferator activated- 
receptor gamma, exon 1 


1.6 


3816 


AF271964 


Bos taurus microsatellite MNB-162 sequence 


0.019 


3817 


AF274307 


Zaocys dhumnades creatine kinase mRNA, complete cds 


6.1 


3818 


BC010716 


Mus musculus, clone MGC:6551 IMAGE:2655861, mRNA, complete 
cds 


5.5 


3819 


AE004219 


Vibrio cholerae chromosome I, section 127 of 251 of the complete 
chromosome 


5.7 


3820 


AB060193 


Macaca fascicularis brain cDNA clone:QccE-22277, full insert 
sequence 


1.9 


3821 


AY007106 


Homo sapiens clone TCCCIA00427 mRNA sequence 


3E-45 


3822 


X02801 


Mouse gene for glial fibrillary acidic protein (GFAP) 


0.08 


3824 


AJ234506 


Hordeum vulgare genomic DNA fragment; clone MWG0561.uni 


2.1 


3825 


AKO 19864 


Mus musculus 1 1 days pregnant adult female ovary and uterus cDNA, 
RIKEN full-length enriched library, clone:5031405K23, full insert 
sequence 


1.8 


3826 


AK023304 


Homo sapiens cDNA FLJ13242 fis, clone OVARC1000578 


6.5 


3827 


AB062056 


Homo sapiens p53DINPl gene for p53DINPla, p53DINPlb, complete 
cds, alternative splicing 


0.055 


3828 


AF379858 


Dendrocerus carpenteri 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.07 


3829 


X05621 


Chironomus pallidivittatus BR1 gene for giant secretory protein 


0.077 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQ IE 
NO 


> 

ACCESSN 


DESCRIP 


P VALUE 


3830 


AF108841 


Homo sapiens human endogenous retrovirus HERV-H10 pol protein 
(pol) gene, partial cds; env pseudogene and 3' LTR, complete 
sequence 


0.009 


3831 


AF309030 


Ixodes scapularis haplotype SSCP NC2_29 16S ribosomal RNA gene, 
partial sequence; mitochondrial gene for mitochondrial product 


0.21 


3832 


AF268055 


Candidatus Carsonella ruddii natural-host Glycaspis brimblecombei 
RNA polymerase beta subunit (rpoB) and RNA polymerase beta-prime 
subunit (rpoC) genes, partial cds 


0.0007 


3833 


AY021307 


Oryza sativa microsatellite MRG3632 containing (TA)X23, genomic 
sequence 


0.009 


3834 


XM 034551 


Homo sapiens dual-specificity tyrosine-(Y)-phosphorylation regulated 
kinase 4 (DYRK4), mRNA 


0.00003 


3835 


AK001647 


Homo sapiens cDNA FLJ10785 fis, clone NT2RP4000457, weakly 
similar to UBIQUITIN CARBOXYL-TERMINAL HYDROLASE 15 
(EC 3.1.2.15) 


0 


3836 


XM_043492 


Homo sapiens 42259 (KIAA1728), mRNA 


5E-19 


3837 


U22345 


Human chromosome 20ql2 locus-specific repeat 


0.003 


3838 


J05535 


Bovine vascular smooth muscle connexin43 mRNA, complete cds 


0.18 


3839 


AF290221 


Cucumis sativus clone B80F mitochondrial genomic sequence 


0.065 


3840 


NM_p 10059 


Mus musculus disrupted meiotic cDNA 1 homolog (Dmclh), mRNA 


5.3 


3841 


NM_0 17681 


Homo sapiens hypothetical protein FLJ20130 (FLJ20130), mRNA 


2 


3842 


AE002114 


Ureaplasma urealyticum section 15 of 59 of the complete genome 


1.9 


3843 


AK022372 


Homo sapiens cDNA FLJ12310 fis, clone MAMMA1001970 


0.019 


3844 


U33633 


Rice ragged stunt virus minor structural protein gene, complete cds 


0.21 


3845 


AB062993 


Macaca fascicularis brain cDNA clone:QmoA-10825, full insert 
sequence 


4E-40 


3846 


AJ251485 


Gorilla gorilla partial lmp7 gene for large multifunctional protease 7, 
exon 7 and tap2 gene for antigen peptide transporter 2, exons 1-6 


0.71 


3847 


AP000145 


Homo sapiens genomic DNA, chromosome 21 q2 1.2, LL56-APP 
region, clone B2291C14-R44F3, segment 10/10, complete sequence 


0.68 


3848 




Homo sapiens similar to ribosomal protein S3 A (H. sapiens) 


0.017 


3849 


XM_0 12036 


Homo sapiens potassium inwardly-rectifying channel, subfamily J, 
member 5 (KCNJ5), mRNA 


5 


3850 


] 

AB035883 . 


Pseudoregma bambucicola mitochondrial genes for small subunit 
rRNA, tRNA-Val, large subunit rRNA, partial and complete 
sequences, country :Taiwan: Sun Moon Lake 


0.077 


3851 


AB037688 ; 


Xenopus laevis Xptch-2 mRNA for patched-2, complete cds 


1.8 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


seo m 
NO 


ACCESSN 


DESCRIP 


P VALUE 


3852 


AB041264 


Mus musculus pseudogene for GTP-binding protein betal subunit 


0.53 


3853 


AF288170 


Plasmodium falciparum isolate MM cytoadherence linked asexual 
protein (clag9) mRNA, partial cds 


0.083 


JOJt 


AF304321 


Neospora caninum RNA polymerase CI (rpoCl) gene, partial 
sequence, plastid gene for plastid product 


0.002 


JOJJ 


AF133181 


Peziza varia 18S ribosomal RNA gene, and internal transcribed spacer 
1, partial sequence 


2.1 


3856 


NM_012694 


Rattus norvegicus Solute carrier family 6 (neurotransmitter 
transporter, dopamine), member 3 (Slc6a3), mRNA 


0.56 


JO J / 


NM 030261 


Mus musculus hypothetical protein MGC7182 (MGC7182), mRNA 


6E-36 


JOJO 


AF231164 


Pseudosymblepharis schimperiana tRNA-Leu, partial sequence; trnL- 
trnF intergenic spacer, complete sequence; and tRNA-Phe, partial 
sequence; chloroplast genes for chloroplast products 


0.059 


3859 


Z36070 


S.cerevisiae chromosome II reading frame ORF YBR201w 


0.064 




Z24279 


H. sapiens (D1S489) DNA segment containing (CA) repeat; clone 
AFM309ve9; single read 


0.2 


3861 


XM_036937 


Homo sapiens KIAA1020 protein (KIAA1020), mRNA 


2.1 




AB033019 


Homo sapiens mRNA for KIAA1 193 protein, partial cds 


1.6 




XM 030075 


Homo sapiens hypothetical protein DKFZp434D0513 
(DKFZp434D0513), mRNA 


2E-74 


JOOJ 


NM_023979 


Rattus norvegicus apoptotic protease activating factor 1 (Apafl), 
mRNA 


0.59 


3866 


U50715 


Mus musculus alpha-galactosidase A gene, complete cds 


0.008 


3R67 
J ou / 


AF078780 


Caenorhabditis elegans cosmid C04F2 


1.7 


3869 


L07305 


Histoplasma capsulatum (clone pMS3) H-ATPase gene, complete cds 


0.46 


3870 


XM_035796 


Homo sapiens hypothetical gene supported by AF001893; AH005586 
(LOC91063), mRNA 


1.7 


3871 


NM_025788 


Mus musculus RIKEN cDNA 49305 11N1 3 gene (49305 HN13Rik), 
mRNA 


5.4 


3872 


AF339787 


Homo sapiens clone IMAGE:205688, mRNA sequence 


2 


3873 


AF028784 


Rattus norvegicus glial fibrillary acidic proteins alpha and delta 
(GFAP) gene, alternatively spliced products, complete cds 


0.005 


JO /** 


AF328904 


Mus musculus lysyl-tRNA synthetase (Kars) gene, exon 14, complete 
cds; adenosine deaminase tRNA-speciflc 1 (Adatl) gene, exons EA, 1 
and partial cds 


0.22 


3875 


AF033647 


Rhytidiadelphus triquetrus 23 S ribosomal RNA gene, partial 
sequence; internal transcribed spacer 2, 4.5S ribosomal RNA gene, 
internal transcribed spacer 3, and 5S ribosomal RNA gene, complete 
sequence, chloroplast genes for chloroplast RNAs 


2 


3876 


BC006035 


Mus musculus, clone IMAGE:3492938, mRNA, partial cds 


5.6 


3877 


AF3 11856 


Homo sapiens spectrin beta IV (SPTBN3) mRNA, complete cds, 
alternatively spliced 


0.5 


3878 


AB041327 


Anguilla japonica GnRH receptor mRNA for gonadotropin-releasing 
lormone receptor, complete cds 


0.24 
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SEQIE 
NO 


> 

ACCESSN 


DESCRTP 


T> \/ A T T TTT 


3879 


AF220175 


sapi&iia a^tu i^iaiiuudac yi\tjj\ri) gene, exons j tnrougn 14, anc 
complete cds 


0.0009 


3880 


X89398 


Rsapiens ung gene for uracil DNA-glycosylase 


0.002 


3881 


L14017 


ouipiijiuw^A^wud aiucua uicuuiauin-resisiance protein (mecK) gene and 
unknown ORF, complete cds 


0.56 


3882 


M97008 


XenODUS laevis/eilli hvbrid TpM chain ppnp rlnn** T ^Hrnii 


u.uu / 


3883 


Z78414 


Caenorhabditis elecans co^mid WOQD 1 7 mmniptp cpnnpnrp 




3884 


AL5 12549 


S.pombe chromosome I BAC pB2B4 


1.9 


3885 


AB030616 


MuS mUSCuluS Chetk-alnha apnp ff»r r*hn1irif»/£»tHarirk1aTTiirif» l-inoco 

alpha, exon 1 and 5'-flanking region 


6.4 


3886 


AF069535 


BradVDUS tlidaCtvlUS 16S rihoSOmal RNA ctptip mitnr'finnHrial apnp 

for mitochondrial RNA, complete sequence 




3887 


AK023304 


Homo sapiens cDNA FLJ13242 fis, clone OVARC1000578 


0.056 


3888 


AB039756 


Brassica rapa SP1 1-32 mRNA for S locus protein 1 1-32, partial cds, 
strain: S32(50-l It) 


1 0 

i.y 


3889 


AL5 12405 


Human DNA sequence from clone RP1 1-370N21 on chromosome 6, 
complete sequence [Homo sapiens] 


l.O 


3890 


AF147813 


Protrama radicis elongation factor 1 alpha gene, partial cds 


0.18 


3891 


XM 046967 


xTuiiiu o<i{jiciij> ic^iuii (AriiLdining vanaDie cnarge protein on A. witn 
eight repeats; variable charge protein on X with two repeats; variable 
charge X chromosome (LOC9271JM mRNA 


U.Z 1 


3892 


XM 037664 


Homo saDiens hvoothetical nrotein MOP4.0QO f\ACif*dCiQf\} mt?Tsj a 


i <i 

1.3 


3893 


AJ296142 


Silene foptirfa nartial rnh2 (tptip finr T?XT A nnKmipncp tt ovnnr OT1y -i 
vjix^ii^ xu^utux paiuoi lyu*. gcuc r\j.N/v poiyiucnise ii, exons anu 

24 


0.16 


3894 


AF360340 


ruauiuujjaia uiaixaiia |siiutUVG pcwUIlCMcniSe ^Jv.lVCl.1 //AlJgjjj / 

mRNA, complete cds 


? 7 


3895 


AF327975 


Hentzia Dalmanim 16S riho^nrrml RNA nartial cpntipnn»' 
mitochondrial gene for mitochondrial product 


U.JJ 


3896 


Z15140 


L.esculentum mRNA for chitinase 


2.1 


3897 


U57326 


nhlamvdomonas reinhardtii rhlnrnrtlnct rvtmnlptA Tim 1 n fm <rmpnt 
rpoC2 gene, partial cds 


0.001 


3898 


AK024908 


Homo sapiens cDNA: FLJ21255 lis, clone COLO 1321 




3899 


NC 002087 


Plasmid pRL765, complete sequence 




3900 


AF063220 


Papaya ringspot vims isolate P polyprotein gene, partial cds 


0.25 


3901 


AE002173 


Chlamvdoohila nnenmoniae AR^Q w^inn Q nfQA nftht* mmniptp 
genome 


0.25 


3902 


BC001152 


Homo sapiens erowth arrest-soecific 7 clone MOP - 1 14.7 
IMAGE:3353809, mRNA, complete cds 


5.9 


3903 


AB011800 


Agrobacterium tumefaciens plasmid pRi A4 gene, virB operon 


5.4 


3904 


] 

BCO 10389 < 


Homo sapiens, clone MGC: 13615 IMAGE:4283814, mRNA, 
^>mplete cds 


0.084 


3905 


i 

AF091779 1 


Oncorhynchus mykiss non-classical MHC class I antigen (Onmy- 
JAA) gene, Onmy-UAA*0101 allele, complete cds 


2.3 


3906 


] 

AF222716 j 


Plasmodium falciparum clone 3D7 unconventional myosin PfM-B 
gene, complete cds 


6.2 
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I SEQ ID] 
NO 



ACCESSN 
248951 



DESCRIP 



P VALUE 



3907 
3908 



|S.cerevisiae chromosome XVI cosmid 9723 



2.5 



XM_040326 
AF271964 



Homo sapiens absent in melanoma 1 (AIM1), mRNA 



0.056 



3909 



Bos taurus microsatellite MNB-162 sequence 



0.019 



3910 
3911 



BC003261 
AJ011641 



Mus musculus, serine/threonine kinase 5, clone MGC:5803 
IMAGE:3501444, mRNA, complete cds 



0.056 



Arabidopsis thaliana (ecotype Columbia) sp!8 gene, exons 1-3 



1.7 



3912 
3913 



AK000864 Homo sapiens cDNA FLJ10002 fis, clone HEMBA1000046 



0.53 



XM_045747 jHomo sapiens midline 1 (Opitz/BBB syndrome) (MIDI), mRNA 



2E-33 



3914 AF269581 



Staphylococcus epidermidis strain SRI clone step. 101 lg07 genomic 
I sequence 



3916 AF269810 



Staphylococcus epidermidis strain SRI clone step.l023h06 genomic 
1 sequence 



0.057 



3915 XM 042333 Homo sapiens casein kinase 2, beta polypeptide (CSNK2B), mRNA 1.9 



0.022 



3917 | AJ404228 TSaccharomyces douglasii mitochondrial tRNA gene cluster 
iHomo sapiens hypothetical gene supported by AK021954 

3918 I XMJ)27223 (LOC89916), mRNA - 

3919 I AJO 1 1641 J Arabidopsis thaliana (ecotype Columbia) sp!8 gene, exons 1-3 



3920 AF020725 



Gallus gallus unoccupied integration site for endogenous ALV-type 
retroviral element ev-B2 



0.21 



0.61 



3921 
3922 



AL157825 



Human DNA sequence from clone RP1 1-518C21 on chromosome 
Xql3.2-21. 1, complete sequence [Homo sapiens] 



1.2 



AF203676 Myotis myotis microsatellite H23 sequence 



0.2 



3923 



XM_0 10734 Homo sapiens smg GDS-ASSOCIATED PROTEIN (Kifap3), mRNA 
iHeterodera glycines beta-l,4-endoglucanase-3 precursor (eng-3) gene, 
3924 I AF056048 [promoter and complete cds 



0.66 



3926 AL359750 



Human DNA sequence from clone RP11-24A10 on chromosome 13, 
I complete sequence [Homo sapiens] 



3928 AL023845 [Caenorhabditis elegans cosmid Y51B9 A, complete sequence 



3929 BC001781 



Homo sapiens, ribosomal protein L44, clone MGC2064 
|lMAGE:3353669, mRNA, complete cds 



0.026 



3925 XMJ)42676 Homo sapiens hypothetical protein MGC4618 (MGC4618), mRNA 2.2 



0.024 



3927 NMJH9054 Homo sapiens hypothetical protein MGC5 560 (MGC5 560), mRNA 2E-31 



0.25 



7E-43 



3930 AK004553 



Mus musculus adult male lung cDNA, RIKEN full-length enriched 
library, clone: 1200002022, full insert sequence 



0.51 



3932 [ X17339 |Dengue-2 virus NS1 gene for nonstructural protein (patient M2) | 0.23 
jVibrio cholerae chromosome I, section 186 of 251 of the complete 
| chromosome I q 35 



3933 AE004278 



Drosophila melanogaster putative extracellular ligand trunk gene, 
3934 I U41064 | complete cds 



0.53 



3935 AF005900 



Homo sapiens alpha2B-adrenergic receptor (alpha2C2AR) gene, 
I complete cds 



5.2 



3936 I AK024599 jHomo sapiens cDNA: FLJ20946 fis, clone ADSE01819 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 



3941 



3944 



3948 



3949 



ACCESSN 
K00489 



DESCRIP 



[human enkephalin gene, intron c (3' end) and 3* flank 



U82516 



iDictyostelium discoideum random slug CDNA22 protein (rsc22) 
|mRNA, complete cds 



3943 AF241235 



Homo sapiens Na+,K+-ATPase gamma-subunit (FXYD2) gene, 
alternatively spliced isoforms 1 and 2, exons 1 through 6 and 
[complete cds 



[Human 3-beta-hydroxysteroid dehydrogenase/delta-5-delta-4- 
M38180 isomerase (3-beta-HSD) gene, complete cds 



3945 AJ275020 



Homo sapiens partial FAH gene for fumarylacatoacetate hydrolase, 
intron 10 



3946 AB042 198 Mus museums mac25 gene promoter and exon 1, partial cds 



3947 AF082520 | Cryptosporidium parvum Hsp60 gene, partial cds 



Z4805 1 H.sapiens gene for myelin oligodendrocyte glycoprotein (MQG) 



AF269768 



[Staphylococcus epidermidis strain SRI clone step.l023b09 genomic 
| sequence 



3950 f AK022181 [Homo sapiens cDNA FLJ121 19 fis, clone MAMMA1000092 



3951 AF186889 



[Chlosyne janais 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 



3956 AB038979 Homo sapiens gene for caspase-10, exon 1 1 and complete cds 



3958 AF148987 



[Legionella fallonii macrophage infectivity potentiator (mip) gene, 
[partial cds 



3961 AF072439 [Rattus norvegicus zinc-finger protein-37 mRNA, complete cds 



P VALUE 



0.24 



3938 [ U92545 I Gallus gallus tubby-like protein (TULP1) mRNA, complete cds | L8~ 
[Human DNA sequence from cosmid L60G9B, Huntington's Disease 

3939 | Z69363 [Region, chromosome 4pl6.3 contains ES Ts | o 55 



1.8 



Mus musculus RIKEN cDNA 3 1 10040N1 1 gene (3 1 10040N1 IRik) 
3942 | NM 026077 mRNA 



e-107 



0.00002 



0.64 



0.71 



1.9 



0.68 
0.71 



0.59 



0.62 



0.056 



3952 AB026544 Hyoscyamus niger trl gene for tropinone reductase-I, complete cds | 0.17 



3954 M32666 [Homo sapiens ITGB3 gene, intron 1, partial sequence 

3955 Z72888 [ Sxerevisiae chromosome VII reading frame ORF YGR103w 



0.009 



0.001 



0.075 



3959 j AF159173 I Gallus gallus structural muscle protein titin mRNA, partial cds 
[Mycoplasma pulmonis FtsZ (ftsZ) gene, complete cds, methionyl- 

3960 | U34931 [tRNA synthetase (metG) gene, partial cds 



1.2 



0.21 



0.58 



3962 AF079877 [Mus musculus cyclin G2 (Ccng2) gene, complete cds 

3963 X87947 |S.cerevisiae ALG2 gene 



0.049 
0.024 



3964 BC004730 



Mus musculus, proteasome (prosome, macropain) subunit, beta type 
10, clone MGC:5837 IMAGE:3583052, mRNA, complete cds 



6.8 



3965 | BC010610 [Homo sapiens, clone IMAGE: 42 145 15, mRNA, partial cds 



3966 AK012050 



Mus musculus 10 days embryo cDNA, RIKEN full-length enriched 
library, clone:2610319A01, full insert sequence 



2E-11 



e-144 



3967 BC000240 



Homo sapiens, chromosome 4 open reading frame 1, clone 
|IMAGE:3352004, mRNA 



1.9 



3969 | AE007346 | Streptococcus pneumoniae section 29 of 194 of the complete genome 4.4 
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SEQID 
NO 






n 1 T A T T I Li' 

P VALUE 


3970 


anuz i y\yy 


Untnn canipnc r* T^XJ A T7T T 1 1 Qf\*7 €a c /*Ism-ia /TDD 1 AA Art C n 

nonio oiipiciib cj-/in/\ fjl»j 1 i.y\j / us, Clone m r, tvi rs p ImUUUj" 


0.005 


3971 


U05765 


Human annexin V (ANX5) gene, exon 7 


4.9 


3972 


S68117 


rrLf-A— proiacun-nKe protein a (3 region, exon 1, intron 1} [rats, 
Genomic, 1187 nt] 


0.11 


3973 


AL358852 


riuman jl/jna sequence rrom cione Kri-J i /iNy on chromosome o, 
complete sequence [Homo sapiens] 


0.18 


3974 


XV1ZOZZ 1 


/Airiwiii green monKcy origin oi replication ^v/tvoo j region 


U.U1 


3975 




Mouse MHC class I Lyt-2-a gene encoding lyt-2. 1 T-cell surface 
anoanugen, compieie cos 


/ 1 


3976 




Arabidopsis thaliana putative wall-associated kinase 1 (F16F4.6) 

«riD XT A n-% rxliiff* /"vie 

iiiiviN/\, cumpicic wjo 


1 5 

D.J 


3977 


U67488 


Methanococcus jannaschii section 30 of 150 of the complete genome 


0.021 


3979 


XM 027089 


Homo sapiens N-acetylated alpha-linked acidic dipeptidase 2 
(NAALAD2), mRNA 


1E-09 


3980 


AF062751 


Manduca sexta soluble guanylyl cyclase beta- 1 subunit mRNA, 
complete cds 


0.53 


3981 


AP000214 


Homo sapiens genomic DNA, chromosome 21q22.1, D21S226-AML 
region, clone f43Dl 1-119B8, segment 12/12, complete sequence 


0.0003 


3982 


AtjOUZ /o 


/vraDiaopsis inanana puiauve rviLrii protein (Ai4guyi4U j ; iTikjna, 
compieie cos 


U.UUU 1 


3983 


AF187968 


z xiomo sapiens concenirauve nucieosiae transporter {\,rn 1 1 ) gene, 
exon 2 


0.085 


3984 


rV I XJjZ, /H- 1 


Drosophila melanogaster H.M.S. Beagle transposon long terminal 
repeat, complete sequence; and heat shock protein Hsp70Ab gene, 

TvtwfnfYf"<»f Qfi/I nci fti q 1 /vie 
piUlllUlCI allU pcLTLldl \AXS 


v/.o 


3985 


IN XVI UUoOy / 


\A|ic mi t cm line n i Ti**i n /"Win^ mDMA 




3986 


AB055369 


Macaca fascicularis brain cDNA, clone:QflA- 12661 


0 


3987 


A1J777 


Kluyveromyces lactis mitochondrial tRNA-Val, COII and COI 

^Pdllld-l ) 




3988 




Homo sapiens chromosome 5 clone CTD-2178M23, complete 
sequence 




3989 


X79482 


I.punctatus Immunoglobulin mu heavy chain gene 


0.00007 


3990 


AF051289 


uaiouius cnariesi iN/vL/ri oenyarogenase suoumt x gene, 
mitochondrial gene encoding mitochondrial protein, complete cds 


0.072 


3991 


DL-UU'frJoZ 


riUIIlU ZMipiCIla, LrlUIlC ULYl^WJCr. DO'rVzfoJLy ITLivlN/A, pdlTiai CuS 


c-ih / 


3992 


YU fid 14** 9 


Homo sapiens hypothetical gene supported by AL1 17603 




3993 


NM_027142 


Mus musculus RIKEN cDNA 2310012P17 cene r2310012P17Rik^ 
mRNA 


0.027 


3994 


AB022673 


Oryza sativa rpll2-l gene for chloroplast ribosomal protein L12, 
complete cds 


0.69 


3995 


AB009987 


Bombyx mori nuclear polyhedrosis virus gene for putative DN A- 
directed RNA polymerase component lefB, complete cds 


0.2 


3996 


NM_000144 


Homo sapiens Friedreich ataxia (FRDA), mRNA 


O.O08 
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NO 
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3997 


XM_002395 


Homo sapiens LBP protein; likely ortholog of mouse CRTR-1 (LBP- 
9), mRNA 


0.58 


3998 


AF056532 


Brassica napus cultivar Darmor AP3 1 gene, AP3 1-DN allele, partial 
cds 


2 


3999 


Z72888 


Sxerevisiae chromosome VII reading frame ORF YGR103w 


0.008 


4000 


AL5 12660 


Human DNA sequence from clone RP1 1-353M9 on chromosome 10, 
complete sequence [Homo sapiens] 


5.3 


4001 


AF190131 


Cloning vector pVO205 hygromycin-B-phosphotransferase (hph) 
gene, complete cds 


1.7 


4002 


AJ132557 


Oryctolagus cuniculus CYP19 gene, ovarian promoter region 


0.069 


4003 


AB001901 


Homo sapiens PACE4 gene, exon 4-7 


4 


4004 


AE006784 


Sulfolobus solfataricus section 143 of 272 of the complete genome 


0.19 


4005 


XM_049237 


Homo sapiens KIAA0841 protein (KIAA0841), mRNA 


1.7 


4006 


AF107676 


Aedes aegypti clone 416 Feilai family of SINES 


0.54 


4007 


AB053857 


TT virus gene for ORF1, partial cds, clone:Pd-10-3 


0.24 


4008 


AC078798 


Homo sapiens 3 BAC PAC-56F11 (Roswell Park Cancer Institute 
Human BAC Library) complete sequence 


5 


4009 


NM_026040 


Mus musculus RUCEN cDNA 2810036K01 gene (2810036K01Rik), 
mRNA 


2E-33 


4011 


XM_027300 


Homo sapiens DKFZP434K114 protein (DKFZP434K1 14), mRNA 


4E-08 


4012 


AY021973 


Oryza sativa microsatellite MRG4298 containing (TA)X104, closest 
to marker S 10620, genomic sequence 


0.0009 


4013 


AB003097 


Fruitfly strain g20 mitochondrial DNA, A+T-rich region, partial 
sequence 


0.000003 


4014 


AJ000682 


Bos taurus PPARG gene, intron 1, 5' 


0.1 


4015 


AE001504 


Helicobacter pylori, strain J99 section 65 of 132 of the complete 
genome 


0.64 


4016 


NM_031442 


Homo sapiens hypothetical protein DKFZp761J17121 
(DKFZP761J17121), mRNA 


0.007 


4017 


U10698 


Rattus norvegicus liver microsomal carboxylesterase mRNA, complete 
cds 


0.21 


4018 


AF009255 


Homo sapiens putative chloride channel gene (CLCN6), exons 14, 15, 
16, and 17 


1.7 


4019 


NM_010657 


Mus musculus kappa B and Rss recognition component (Krc), mRNA 


0.17 


4020 


AY021100 


Oryza sativa microsatellite MRG3425 containing (TA)X19, closest to 
marker L246, genomic sequence 


4.7 


4021 


U43092 


Rattus norvegicus surfactant protein-A (SP-A) gene, complete cds 


0.9 


4022 


XM 050127 


Homo sapiens oligophrenin 1 (OPHN1), mRNA 


0.64 


4023 


AE005576 


Escherichia coli 0157:H7 EDL933 genome, contig 3 of 3, section 195 
of 290 


0.45 


4024 


AF288693 


Mus musculus Ubell (Ubell) gene, partial cds 


0.0008 


4025 


AF203748 


Taphozous sp. Brcal (Brcal) gene, partial cds 


5.6 
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NO 
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P VALUE 


4026 


AF179234 


Homo sapiens short-chain dehydrogenase/reductase 1 (SDR1) gene, 
exon 1 


0.56 


4027 


AF163735 


Illicium parviflorum internal transcribed spacer 1, 5.8S ribosomal 
RNA gene, and internal transcribed spacer 2, complete sequence 


0.62 


4028 


NM 026233 


Mus musculus RIKEN cDNA 4933434120 gene (4933434I20Rik), 
mRNA 


6.5 


4029 


AL031426 


Human DNA sequence from clone CTA-191D12 on chromosome 
22ql3. 1 Contains two exons of the APOL2 gene for apolipoprotein L 
2, ESTs and GSSs, complete sequence [Homo sapiens] 


0.000001 


4030 


XM 008148 


Homo sapiens myeloid/lymphoid or mixed-lineage leukemia (trithorax 
(Drosophila) homolog); translocated to, 6 (MLLT6), mRNA 


0.63 


4031 


AF007904 


Chaetopterus variopedatus histone H4 and histone H2 A genes, 
complete cds 


0.009 


4032 


XM 051353 


Homo sapiens C-type (calcium dependent, carbohydrate-recognition 
domain) lectin, superfamily member 5 (CLECSF5), mRNA 


0.69 


4033 


U88154 


Homo sapiens proline and glutamic acid rich nuclear protein isoform 
mRNA, partial cds 


0.074 


4034 


AC023838 


Arabidopsis thaliana chromosome III PI MJM20 genomic sequence, 
complete sequence 


0.2 


4035 


X15894 


Sinapsis alba cab-1 gene for chlorophyll a/b-binding polypeptide 


0.1 


4036 


X54742 


Nicotiana plumbaginifolia beta-(l,3)-glucanase gene for a vacuolar 
isoform 


1.8 


4037 


NM 017590 


Homo sapiens hypothetical protein DKFZp434K0920 
(DKFZp434K0920), mRNA 


0.13 


4038 


U92482 


Equus caballus interleukin-1 receptor antagonist (EqlL-IRA) mRNA, 
complete cds 


0.38 


4039 


U47540 


Aspergillus nidulans nitrogen regulatory protein (tamA) gene, 
complete cds 


0.18 


4040 


AK001076 


Homo sapiens cDNA FLJ10214 fis, clone HEMBA1006530 


0.0000009 


4041 


AC002049 


Homo sapiens Chromosome 22qlL2 Cosmid Clone 107d7 In BCRL2- 
GGT Region, complete sequence 


1.3 


4042 


U55243 


Dictyostelium discoideum glyceraldehyde-3 -phosphate dehydrogenase 
(GAPDH) gene, partial cds 


0.23 


4043 


Z16701 


H. sapiens (D1S218) DNA segment containing (CA) repeat; clone 
AFM157xe7; single read 


0.002 


4044 


] 


Rhododendron semibarbatum chloroplast matK gene for ribosomal 
maiurase, complete cos 


0.024 


4045 


AF034950 3 


Bos taurus gonadotropin hormone receptor (GnRH) gene, partial cds 


0.22 


4046 


1 

AJ307017 


VIus musculus mRNA for putative ubiquitin-specific protease (Usp9y 
gene) 


1.7 


4047 


AE001159 


Borrelia burgdorferi (section 45 of 70) of the complete genome 


0.89 


4048 


AF239215 1 


7 orficula auricularia microsatellite FA2 sequence 


0.0002 
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4049 AP000249 



iHomo sapiens genomic DNA, chromosome 21q22.1, D21S226-AML 
region, clone:B762Q15, complete sequence 



P VALUE 



4050 
4051 



AP000249 
AF355470 



[Homo sapiens genomic DNA, chromosome 21q22.1, D21S226-AML 
region, clone:B762Q15, complete sequence 



iMus musculus cysteine dioxygenase gene, exon 2 



4052 BC010739 



IHomo sapiens, hypothetical protein FLJ12612 similar to COP9 
(constitutive photomorphogenic), summit 7b (Arabidopsis), clone 
MGC: 16836 IMAGE: 3 893 022, mRNA, complete cds 



4053 
4054 



Mus musculus adult male testis cDNA, RUCEN full-length enriched 
AK016525 library, clone:4932415G12, full insert sequence 



4055 



AJ237609 [Borrelia recurrentis vmpAl gene, silent copy 



UO0038 



*s5 



4056 AF268053 



[Caenorhabditis elegans cosmid T21D1 1, complete sequence 
Candidatus Carsonella ruddii natural-host Tainarys sordida RNA 
polymerase beta subunit (rpoB) and RNA polymerase beta-prime 
[subunit (rpoC) genes, partial cds 



4058 AKO 19557 



Mus musculus adult male testis cDNA, RDCEN full-length enriched 
library, clone:4930402H05, full insert sequence 



4059 AF233898 [Escherichia coli strain KI12 18 Bfp A (bfpA) gene, partial cds 



£0 



4064 



AF276 169 I Scaphirhynchus platorynchus microsatellite Spl- 100 sequence 



4065 AF1 56797 



Mus musculus ATP synthase coupling factor 6 gene, 5* UTR; and 
GABP-alpha subunit gene, promoter and 5' UTR, partial sequence 



4066 I AF1 10420 [Mus musculus hoTgN37INRA locus sequence 



4067 AK021659 I Homo sapiens cDN A FLJ1 1597 fis, clone HEMB A 1003856" 



4068 AL5 12724 



[Homo sapiens mRNA; cDNA DKFZp547M202 (from clone 
|DKFZp547M202) 



4069 S54531 |L-plastin {exon 1, promoter} [human, Genomic, 3231 nt] 



4070 



L02110 



Mus musculus proviral retroviral insertion in the cGMP- 
phosphodiesterase (rd beta PDE) gene, intron 1, with the proviral 
insert encompassing the env pseudogene (3' end) and 3' LTR 



Homo sapiens ring finger protein 21, interferon-responsive (RNF21) 
4071 I XM 015244 mRNA 



4072 AK015420 



I Mus musculus adult male testis cDNA, RDCEN full-length enriched 
library, clone:4930448I18, full insert sequence 



1E-18 



2E-18 



0.61 



4.5 



1.5 



1.8 



4.3 



0.067 



4057 AC016694 Homo sapiens BAC clone RP1 1-123 Gl from Y, complete sequence 0.2 



0.15 



0.063 



5.7 
1.7 



4060 AK026269 [Homo sapiens cPN A: FLJ226 16 fis, clone HSI05 164 

4061 [ Z80362 H. sapiens HLA-PRB pseudogene, exon 1; 
I Homo sapiens hypothetical gene supported by AK026722 

4062 | XM_046282 (LOC92635), mRNA | 0 007 

Human line-1 reverse transcriptase gene, partial cds, and granulocyte 

4063 | U83303 chemotactic protein-2 (GCP-2) gene, complete cds 



5E-12 
0.074 



1.2 



4E-10 



6E-10 



0.2 



0.023 



2.1 



1.5 



0.003 



4073 U67583 Methanococcus jannaschii section 125 of 150 of the complete genome I 0.021 

4074 I X05640 |Mouse NF-M gene for middle-molecular-mass neurofilament protein | 0.072 
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SEQ IE 
NO 


> 

ACCESSN 


DESCRIP 


P VALUE 


4075 


D01021 


Anthocidaris crassispina mRNA for dynein beta-heavy chain, 
complete cds 


0.61 


4076 


XM 029072 


Homo sapiens LOC89256 (LOC89256), mRNA 


0.006 


4077 


NM 017757 


Homo sapiens hypothetical protein FLJ20307 (FLJ20307), mRNA 


0.26 


4078 


AF145592 


Nicotiana tabacum calcium/calmodulin-dependent protein kinase 
(CCaMK) gene, exons 


0.002 


4079 


AF3 84970 


Arabidopsis thaliana somatic embryogenesis receptor-like kinase 3 
(SERK3) mRNA, complete cds 


5.3 


4080 


AF238313 


Dictyostelium discoideum developmental protein DG1037 (DG1037) 
gene, partial cds 


0.2 


4081 


NM 012519 


Rattus norvegicus Ca-H-/calmodulin-dependent protein kinase II, delta 
subunit (Camk2d), mRNA 


0.071 


4082 


AL590384 


Human DNA sequence from clone RP1 1-349A16 on chromosome 
Xq22.3-24, complete sequence [Homo sapiens] 


0.11 


4083 


AF391284 


Homo sapiens llpl5.5 clone LOH11A, partial sequence 


0.17 


4086 


XM 034197 


Homo sapiens KIAA1505 protein (KIAA1505), mRNA 


e-153 


4087 


AE007255 


Sinorhizobium meliloti plasmid pSymA section 61 of 121 of the 
complete plasmid sequence 


5.4 


4088 


AE006586 


Streptococcus pyogenes Ml GAS strain SF370, section 115 of 167 of 
the complete genome 


1.6 


4089 


AF391284 


Homo sapiens llpl5.5 clone LOH11A, partial sequence 


0.14 


4090 


U41549 


Caenorhabditis elegans cosmid F22F1 


0.065 


4091 


AF231164 


Pseudosymblepharis schimperiana tRNA-Leu, partial sequence; trnL- 
trnF intergenic spacer, complete sequence; and tRNA-Phe, partial 
sequence; chloroplast genes for chloroplast products 


0.066 


4092 


XM 005131 


Homo sapiens chromatin accessibility complex 1 (CHRAC1), mRNA 


0.39 


4093 


NM_025929 


Mus musculus RIKEN cDNA 2010109103 gene (2010109I03Rik), 
mRNA 


5.1 


4094 


X82784 


D.discoideum mRNA for calcium binding protein 


0.00001 


409 S 


U04524 


Caedibacter taeniospiralis 47 R body synthesis and assembly (rebA 
rebB, rebC, rebD) genes, complete cds 


0.58 


4096 


XM_050063 


Homo sapiens B-cell CLL/lymphoma 9 (BCL9), mRNA 


0.075 


4097 


XM_037650 


Homo sapiens syndecan 1 (SDC1), mRNA 


2 


4098 


AF286472 


Homo sapiens retinitis pigmentosa GTPase regulator (RPGR) gene, 
exon ORF1 5 partial cds 


0.076 


4099 


L05466 


Pneumocystis carinii beta-tubulin gene, complete cds 


0.009 


4100 


Zo9723 


Human DNA sequence from cosmid U238E5, between markers 
DXS6791 and DXS8038 on chromosome X 


0.18 


4101 


] 

D83657 


Human DNA for CAAF1 (calcium-binding protein in amniotic fluid 
1), complete cds 


5E-30 


4102 


] 

AL1 17050 < 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


2.1 


4103 


AE004168 < 


Vibrio cholerae chromosome I, section 76 of 251 of the complete 
chromosome 


2.3 
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SEQ ID 
NO 


» 
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DESCRIP 


P VALUE 


4104 


BC007114 


Homo sapiens, clone IMAGE:4295422, mRNA 


0.22 


4105 


D13859 


Penicillium chrysogenum mitochondrion genes for rRNA, tRNA and 
S5 ribosomal protein, complete cds 


0.75 


4106 


AF 137068 


Canis familiaris cubilin (CUBN) mRNA, complete cds 


4.3 


4107 


AK012387 


Mus musculus 1 1 days embryo cDNA, RIKEN full-length enriched 
library, clone:2700047H13, full insert sequence 


0.38 


4108 


NC_001942 


Mycoplasma arthritidis bacteriophage MA VI, complete genome 


0.12 


4109 


AF333431 


Homo sapiens pseudo attP site psiD, partial sequence 


0.45 


4110 


AF305628 


Cubitermes subarquatus microsatellite P19 sequence 


0.68 


4111 


D 10997 


C.ellipsoidea rbcL, rpsl4, trnM, trnG, trnD, trnS, rps4, atpE, atpB 
genes, complete cds 


1.8 


4112 


Z99704 


Human DNA sequence from clone LL22NC01-75B8 on chromosome 
22 Contains GSSs, complete sequence [Homo sapiens] 


0.005 


4113 


XM 016579 


Homo sapiens androgen-induced prostate proliferative shutofT 
associated protein (KIAA0979), mRNA 


1.6 


4114 


NM_008149 


Mus musculus glycerol-3 -phosphate acyltransferase, mitochondrial 
(Gpam), mRNA 


5.5 


4115 


XM_036116 


Homo sapiens KIAA0594 protein (KIAA0594), mRNA 


e-144 


4116 


XM_030510 


Homo sapiens microfibrillar-associated protein 2 (MFAP2), mRNA 


2.4 


4117 


ABO 10203 


Leptospira interrogans gene, 19kb region containing 5S rRNA gene 


1.4 


4118 


XM_007094 


Homo sapiens collagen, type IV, alpha 1 (COL4A1), mRNA 


1.9 


4120 


AF124049 


Fugu rubripes double stranded RNA adenosine deaminase RED IB 
gene, complete cds 


0.066 


4121 


AY007236 


Pneumocystis carinii pheromone receptor a mRNA, complete cds 


0.53 


4122 


AF3 07966 


Aptostichus simus clone PD1 16S ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


2 


4123 


AJ001022 


Marmata monax mRNA for tumor suppressor, p53 


1.3 


4124 


AF239469 


Polycycnis gratiosa maturase (matK) gene, partial cds; chloroplast 
gene for chloroplast product 


0.65 


4125 


X75675 


C.parapsilosis mitochondrial ND6 and ND1 genes 


0.00001 


4126 


AF335589 


Glycine max chitinase class I (Chial) and hypothetical protein genes, 
complete cds 


0.0003 


4127 


U33175 


Oryza sativa sucrose phosphate synthase gene, complete cds 


6.4 


4128 


AB028198 


Arabidopsis thaliana ZM4 mRNA, complete cds 


0.008 


4129 




tieteropneustes iossnis prolactin Hormone (PRL) mRNA, complete cds 


1.3 


4130 


M63955 


S.oedipus MHC class I SO-NI gene 


0.6 


4131 


] 

BC003238 < 


Mus musculus, protein kinase, cAMP dependent, catalytic, alpha, 
:lone MGC6169 IMAGE:3497908, mRNA, complete cds 


0.22 


4132 


AJ001688 ] 


Homo sapiens NKG2D gene, exons 6-9 


0.67 


4133 


D10872 1 


rlumam h NAT allele 3-2 gene for arylamine N-acetyltransferase 


0.016 
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bbQ ID 
IN LI 


ACCESSN 


DESCRIP 


P VALUE 


4134 


AF049877 


Cotesia congregata EP1 locus with complete integrated Cotesia 
congregata vims EP1 circle and partial integrated A circle 


0.19 


A 1 1 C 

41 j5 


Y15783 


Rickettsia prowazekii groESL gene 


0.3 


4136 


Y00354 


Xenopus laevis gene encoding vitellogenin A2 


0.077 


4137 


NC_001950 


Scyliorhinus canicula mitochondrion, complete genome 


0.019 


4138 


AF266048 


Macromia splendens small subunit ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


0.42 


4ijy 


AL035426 


Human DNA sequence from clone 370N13 on chromosome Xq25- 
26.3. Contains an exon of the GRIA3 gene for glutamate receptor, 
ionotrophic, AMPA 3. Contains ESTs, complete sequence [Homo 
sapiens] 


0.0006 


a i /in 


AFO 18434 


Mus musculus long-chain acyl-CoA dehydrogenase (Acadl) gene, 
intron 1 


6.4 


/I 1 A 1 

4141 


AF309654 


Mus musculus U2AF small subunit-related protein (U2afl-rsl) gene, 
complete cds 


0.64 


A 1 /IT 

414/ 


AJ009913 


Bos taurus pip gene 


2.1 


4143 


XM_028213 


Homo sapiens up-regulated by BCG-CWS (LOC641 16), mRNA 


0.078 


4144 


U67550 


Methanococcus jannaschii section 92 of 150 of the complete genome 


0.081 


4143 


AB037771 


Homo sapiens mRNA for KIAA1350 protein, partial cds 


0 


4146 


AP001438 


Homo sapiens genomic DNA, chromosome 2 lq22.2, clone:T695, 
LB7T-ERG region, complete sequence 


0.009 


4147 


U88995 


Callithrix geoffroyi clone CGE85 mitochondrial D-loop, partial 
sequence 


0.57 


4148 


AE007343 


Streptococcus pneumoniae section 26 of 194 of the complete genome 


1.9 


4149 


Z83832 


Avena sativa mRNA for UDP-glucose: sterol glucosyltransferase 


0.072 


4150 


BC007664 


Homo sapiens, hepatocellular carcinoma-associated antigen 59, clone 
MGC:866 IMAGE: 3 140172, mRNA, complete cds 


0 


4151 


NM_002577 


Homo sapiens p21 (CDKNlA)-activated kinase 2 (PAK2), mRNA 


5E-39 


4152 


XM_040120 


Homo sapiens ADP-ribosylation factor GTPase activating protein 1 
(ARFGAP1), mRNA 


2.1 


4153 


NM031511 


Rattus norvegicus Insulin-like growth factor II (somatomedin A) 
(Igf2), mRNA 


0.024 


A \ CA 

4154 


XM_049817 


Homo sapiens Sec23-interacting protein pl25 (P125), mRNA 


1.8 


4155 


BCO 10892 


Homo sapiens, clone MGC: 13350 IMAGE:4333615, mRNA, 
complete cds 


0.000004 


4156 


AJ222787 


Hordeum vulgare mRNA for alpha-keto acid dehydrogenase-like 
protein, clone RG136 


0.075 


4157 


AL021306 


Human DNA sequence from clone CTB-1 109B5 on chromosome 22 
Contains a GSS, complete sequence [Homo sapiens] 


0.000001 


4158 


AF315813 


Norwalk virus (Hu/NLV/OC96065/1996/JP) polymerase gene, partial 
cds 


0.71 
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SEQID 
NO 
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DESCRIP 


P VALUE 


4159 


AE001407 


Plasmodium falciparum chromosome 2, section 44 of 73 of the 
complete sequence 


0.0008 


4160 


U32706 


Haemophilus influenzae Rd section 21 of 163 of the complete genome 


0.6 


4161 


AF293663 


Anopheles culicifacies 12S ribosomal RNA gene, partial sequence; 
mitochondrial control region, complete sequence; and tRNA-Ile gene, 
partial sequence; mitochondrial genes for mitochondrial products 


0.066 


4162 


XM037617 


Homo sapiens acetyl-Coenzyme A carboxylase alpha (ACACA), 
mRNA 


3E-35 


4163 


AF305712 


Bos taurus insulin-like growth factor binding protein-3 (IGFBP3) 
gene, complete cds 


0.23 


4164 


AK018618 


Mus musculus adult male cecum cDNA, RIKEN full-length enriched 
library, clone:9130014E20, full insert sequence 


0.68 


4165 


Y00354 


Xenopus laevis gene encoding vitellogenin A2 


0.079 


4166 


M61827 


Human leukosialin (CD43) gene, complete cds 


2.2 


4167 


M64861 


D.discoideum TFIID mRNA, complete cds 


0.066 


4169 


AB028894 


Mus musculus Rpsl 1, U35 genes for ribosomal protein S 1 1 and U35 
snoRNA, complete cds and sequence 


0.6 


4170 


XM_003974 


Homo sapiens hypothetical protein FLJ1 1159 (FLJ1 1 159), mRNA 


0.68 


4171 


AF100658 


Caenorhabditis elegans cosmid H08G01 


0.75 


4172 


AF 11 5645 


Sedum multiceps maturase K (matK) gene, partial cds; chloroplast 
gene for chloroplast product 


0.0001 


4173 


AF064029 


Helianthus tuberosus lectin 1 mRNA, complete cds 


0.11 


4175 


AF093754 


Arabidopsis thaliana trans-membrane nitrate transporter protein 
AtNRT2 : 1 mRNA, complete cds 


0.22 


4176 


M14292 


Human LIHeg repetitive element from the intergenic region of the 
epsilon and G-gamma globin genes 


2E-08 


4177 


NM_0 13564 


Mus musculus insulin-like 3 (Insl3), mRNA 


0.067 


4178 


AF218815 


Coturnix japonica early response to neural induction ERNI (ERNI) 
mRNA, partial cds 


0.21 




U67616 


Human 28S ribosomal RNA pseudogenes and alu repeat region 
sequence 


6E-20 


4180 


AJ297945 


Frankia sp. ArI3 cryptic plasmid pFQ3 1 ORF AF, korS AF gene, ORF 
CrF, ORF DrF, repF gene, ORF FrF, ORF GF, ORF HF, ORF JF, 
ORF KrF, ORF LrF, ORF MF, ORF NrF, ORF OF, ORF PrF, parAF 
gene and ORF SrF 


0.68 


4181 




no mo sapiens avxl# gene ior giycogen-ueorancmng enzyme, exon i*f 


ft f*D 


4182 


AE006774 


Sulfolobus solfataricus section 133 of 272 of the complete genome 


0.063 


4183 


U38803 


Dictyostelium discoideum vacuolar proton ATPase 100-kDa subunit 
(vatM) mRNA, complete cds 


0.025 


4184 


U78096 


Human macrophage colony stimulating factor receptor (c-fms) gene, 
exon 1 A, 2 and partial cds 


0.0001 
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SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


4185 


AJ I\JJ 1 


T th*»rmnnhi1a OR-^eRNA DNA 

1 . 11 LCI lllUJJlllla VJO OtlxiirV Ulrlt\. 


S 7 


4186 


AJ001005 


Picea abies chloroplast DNA for ibcL pseudogene 


0.67 


4187 




noino sdpiens /vpuucvj- i conipieineniauon iactor, j\±kjdi1\^- l 

cti mill Qti n <j T>vr\tf*\ n ( lk L C y \*\ mP M A 
alllllUIdllllg piULCill ^/W^F J, IIirvTN.r\. 


*fC-lZ 


4188 


I U /OJ'r 


Jr.cnspum paii gene 


u.z^ 


4189 


£u /o /Uo 


n.sapienb iiuw-surteci ciuoniusuine o riiiiuiii iragnient, □Lopni'tniz 


ICr-Zo 


4190 




ureapidsiim urediyiicuiii accuuii *t*t ui j> ui me cuinpieie genuine 


7 1 


4191 


AF247729 


Oncorhynchus mykiss glucose transporter IB mRNA, partial cds 


1.8 


4192 


z> / zyzy 


O.VJCi CVIbldC wlUUIUUaUIUC Vll ICdUIIlg Ildine v-/xVT I vJxv l^r^r W 




4193 


JVlo / / J*t 


nuniaii biinpic repcdi puiymuipiiiaiii 


0.081 


4194 


AT ft7 IRftQ 


O.UU111UC IflllUlllUaUlUC l WUaltllU mJAo 


0.026 


4195 


ni 7ji7n 


Iviil IIUVLN/A. IUI VJUO 1 Z / piUlClIl, lA/lIipiCIC vUb 


3.7 


4196 


JVlOUJ .jZ 


jviuusc icaiuoiciuiic lod^iyuxuxyidsc type u ^lodon-tij, exon i 


A 1 


4197 


vv/f nfi7i^n 
yuvi_uu / jou 


riomo sdpienb nepdiucyie nuciear iacior j, dipim i^riiNr js\), iiu\±sr\. 


0 9S 


4198 


Z36064 


S.cerevisiae chromosome II reading frame ORF YBR195c 


0.52 


4199 




nonio bapieiiD cl/in/\ sti^j xjoy/ iio, cione in i ruiuui /uo 


0 49 


4200 


U48298 


Dictyostelium discoideum rtoA gene, complete cds 


0.68 


4201 




Fteicrouonius iiaiicibci iwr ueia gene 




4202 


AT3flA1 n7** 


xiuiiiiuus iiipuiub uiN/\, loo-zoo liiiergem^ apdeer, complete aeu^uciiLoC 


0 000R 


4203 


AF1 18226 


Hordeum vulgare high pi alpha-glucosidase ( AGL97) gene, complete 
cds 


L8 


4204 


APT 1 SROJ.O 


Platybrachys decemmacula 16S ribosomal RNA gene, partial 

aeUUCIlCC, lIUlUClHJlLUlIdl gCllC IVJi llilLUVrllUIlUIldl piuuuwi 


0.077 


4205 


UOjojj 


urosopmia viniis aecapeniapiegic protein (oppj gene, complete cos, 

IldpiUllloUlliCieilL IC^lUil 


0.2 


4206 




Maoricicada cassiope isolate TB-MI-004 tRNA-Asp gene, complete 
seQuence, a. i r^ase suDunit o gene, complete cos, ana /v i x ase suDimn 
o gene, paiiidi tua, iiuiubiiuiiuiidi gciicb iui iiuiuviiuiiuridi pruutiLio 




4207 


XM_037529 


Homo sapiens KIAA0410 gene product (KIAA0410), mRNA 


0.78 


4208 


AF330636 


Plant DNA excision vector pX6-GFP, complete sequence 


0.76 


4209 


AL080235 


Homo sapiens mRNA; cDNA DKFZp586E1621 (from clone 
DKFZp586E1621) 


0.007 


4210 


NM 021262 


Rattii^ norveeiens Acid nhosohatase 1 soluble TAcnl^ mRNA 


0.16 


4211 


Ml 1449 


Yeast mitochondrial ori2-ori7 region DNA with putative peptide 


0.25 


4212 


AF211134 


Carsonella ruddii natural-host Ctenarytaina longicauda 16S ribosomal 
RNA gene, partial sequence; 23S ribosomal RNA and 5S ribosomal 
RNA genes, complete sequence; transaldolase (tal) and alkyl 
hydroperoxide reductase small subunit (ahpC) genes, complete> 


0.074 


4213 


AF157293 


Sporodiniella umbellata translation elongation factor 1 -alpha (EF- 
1 alpha) gene, partial cds 


0.77 
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CTT/~\ TT"\ 


ACCESSN 


DESCRIP 


P VALUE 


4214 


AL390767 


Human DNA sequence from clone RP1-68P15 on chromosome llpl3- 
14.2 Contains GSSs and ESTs. Contains part of a novel gene, 
complete sequence [Homo sapiens] 


0.004 


4215 


XMJ)04579 


Homo sapiens HGC6.1.1 protein (HGC6.1.1), mRNA 


4E-08 


4216 


XMJ)43964 


Homo sapiens similar to KIAA0220 protein (H. sapiens) (LOC92263), 
mRNA 


5E-40 




AL390767 


Human DNA sequence from clone RP1-68P15 on chromosome llpl3- 
14.2 Contains GSSs and ESTs. Contains part of a novel gene, 
complete sequence [Homo sapiens] 


0.004 




AF147082 


Homo sapiens gamma-glutamyl hydrolase gene, exons 3 through 7 


0.088 




AY023560 


Oryza sativa microsatellite MRG5885 containing (TTA)X8, closest to 
marker R2561, genomic sequence 


0.026 




XM 050088 


Homo sapiens hepatocellular carcinoma-associated antigen 127 
(HCA127), mRNA 


0.007 


A 11 1 

4z21 


AF131999 


Plasmodium falciparum putative erythrocyte binding protein EBL-1 
(ebl-1) gene, partial cds 


0.68 


4222 


AF008434 


Solanum fendleri farnesyl-protein transferase beta subunit gene, 
intron 10 


0.21 


4223 


AF367267 


Arabidopsis thaliana AT3g26020/MPEl 1_17 mRNA, complete cds 


0.53 


4224 


XM_006706 


Homo sapiens primase, polypeptide 1 (49kD) (PRIM1), mRNA 


7E-12 


4225 


AK004675 


Mus musculus adult male lung cDNA, RIKEN full-length enriched 
library, clone: 1200009120, full insert sequence 


0.017 


4226 


AB026660 


Arabidopsis thaliana genomic DNA, chromosome 5, BAC 
clone:T20O7 


0.23 


4227 


AF272387 


Homo sapiens leucine zipper nuclear factor (BLZF1) gene, exon 6 


5E-09 


4228 


L08897 


Mycoplasma gallisepticum (strain A5969) 16S-, 23S-, 5S ribosomal 
RNA (rrs A, rrlA, rrfA) genes 


0.073 


422V 


Z26314 


P.falciparum gene for STARP antigen 


0.008 


42JU 


NC_001942 


Mycoplasma arthritidis bacteriophage MAV1, complete genome 


0.18 


4231 


Y18277 


Gallus gailus mRNA for neurobeachin, partial 


3 


4232 


XM_047009 


Homo sapiens pleomorphic adenoma gene-like 2 (PLAGL2), mRNA 


1.8 


42Jj 


X84729 


S.cereale repeated DNA sequence 


0.21 


4234 


AL513132 


Human DNA sequence from clone RP1 1-460C6 on chromosome 9, 
complete sequence [Homo sapiens] 


0 


/I OK 

42J j 


Z71694 


E.herklotsi mitochondrial genes for NADH Dehydroenase subunit 5 
and 6 


0.6 


4236 


Z33215 


M.capricolum DNA for CONTIG MC294 


0.54 


4237 


X86012 


Human DNA sequence from intron 22 of the factor VIII gene, Xq28. 
Contains the end of a 9.5kb repeated region, int22h-l, involved in 
many cases of haemophilia 


0.0000005 


4238 


AF174355 


Praon barbatum 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.026 
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NO 
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4239 


AY039607 


Arabidopsis thaliana AT3g52380/F22O6_240 mRNA, complete cds 


6.1 


4240 


AJ223385 


Fowlpox virus strain HP-440 DNA, isolate FP9, 14.6 kb fragment 


0.000004 


4241 


AF277282 


Daphnia cristata country Lithuania 12S ribosomal RNA gene, partial 
sequence 


1.9 


4242 


NM 023850 


Mus musculus carbohydrate (keratan sulfate Gal-6) sulfotransferase 1 
(Chstl), mRNA 


0.59 


4243 


AJ238599 


Lepilemur septentrionalis genomic fragment, RAPD with primer 
operon OPH4, clone Al 


0.2 


4244 


XM_051747 


Homo sapiens eukaryotic translation initiation factor 4 gamma, 2 
(EIF4G2), mRNA 


3E-58 


4245 


M63376 


Human TRPM-2 protein gene, exons 1,2 and 3 


0.7 


4246 


AF367267 


Arabidopsis thaliana AT3g26020/MPEl 117 mRNA, complete cds 


0.59 


4247 


AJ294714 


Pichia anomala leu2 gene for beta-isopropylmalate dehydrogenase 


0.17 


4248 


AF320616 


Mus musculus amnionless precursor protein (Amn) gene, partial cds 


0.57 


4249 


AL049609 


S.pombe chromosome III cosmid c297 


1.8 


4250 


AJ252313 


Homo sapiens genomic hybrid Rhesus box 


3.8 


4251 


XM_034693 


Homo sapiens ectodermal dysplasia 1, anhidrotic (EDI), mRNA 


0.2 


4252 


NC_001463 


Caprine arthritis-encephalitis virus, complete genome 


1.3 


4253 


AC018655 


Homo sapiens 12q BAC RP11-946G22 (Roswell Park Cancer Institute 
Human BAC Library) complete sequence 


5.9 


4254 


AF003519 


Paraligoneurus sp. 16S ribosomal RNA gene, partial sequence 


0.024 


4255 


AL049434 


Homo sapiens mRNA; cDNA DKFZp586M151 (from clone 
DKFZp586M151) 


0.0008 


4256 


U39321 


Triticum aestivum acetyl-CoA carboxylase gene, exons 1-30, complete 
cds 


0.22 


4257 


AB055106 


Oryza sativa V-ATPase B gene for vacuolar ATPase B subunit, 
complete cds 


1.7 


4258 


AF150091 


Rattus norvegicus small zinc finger-like protein (TIM 10) mRNA, 
complete cds 


0.24 


4259 


AF047677 


Homo sapiens dystrophin (DMD) gene, deletion breakpoints 1-3 in 
intron 5 


0.025 


4260 


Z81504 


Caenorhabditis elegans cosmid F15H9, complete sequence 


0.24 


4261 


AF226147 


HIV-1 isolate 630469c from USA protease (pol) gene, partial cds 


2 


4zo2 


AF270072 


Staphylococcus epidermidis strain SRI clone step. 1047b05 genomic 
sequence 


0.68 


4263 


U56088 


Human periodic tryptophan protein 2 (PWP2) gene, exons 3 to 14 


0.076 


4264 


X83692 


Brassica napus (tournefortii) mitochondrial genes for atp6, and 
ORF263 


2.1 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


496S 


AF243425 


Penicillium oxalicum vacuolar serine protease mRNA, complete cds 


0.63 




NM_024381 


Rattus norvegicus ATP-stimulated glucocorticoid-receptor 
tianslocaton promoter (Gyk), mRNA 


0.68 




AF098919 


Gallus gallus alpha-globin gene domain 5* region 


2.3 


4268 


AJ297909 


Hypocrea jecorina rho3 gene, exons 1-5 


0.23 


4269 


U67528 


Methanococcus jannaschii section 70 of 150 of the complete genome 


0.029 


IK) 


AF104920 


Arabidopsis thaliana BAC T9E19 


0.026 


*r£, 1 1 


AF137394 


Euperipatoides rowelli elongation factor- 1 alpha mRNA, partial cds 


0.23 


*tZ / Z 


AF243087 


Homo sapiens intrinsic factor-vitamin B12 receptor (CUBN) gene, 
exons 11, 12, and 13 


0 


4273 


AF264703 


Astyanax mexicanus retinal homeobox protein (Rxl) mRNA, 
complete cds 


2.2 




AJ3 11054 


Chironomus pallidivittatus 4L ORF for putative recombinase 
subteiocentric clone, Cp5.5 


0.025 


497S 


AF267209 


Candidatus Carsonella ruddii natural-host Calophya schini ATP 
synthase alpha subunit (atpA) gene, partial cds; ATP synthase gamma 
subunit (atpG) gene, complete cds; and ATP synthase beta subunit 
(atpD) gene, partial cds 


0.0008 


497^ 


Z17148 


H. sapiens (Dl 1S935) DNA segment containing (CA) repeat; clone 
AFM254zb9; single read 


0.008 


4977 


AF175434 


Gallus gallus T cell receptor delta chain (TCRD) pseudogene, partial 
sequence 


2.3 


4278 


AK011996 


Mus musculus 10 days embryo cDNA, RUCEN full-length enriched 
library, clone:2610306H15, full insert sequence 


0.49 


4279 


AF269490 


Staphylococcus epidermidis strain SRI clone step. 1005h04 genomic 
sequence 


0.087 




XM_050118 


Homo sapiens hephaestin (HEPH), mRNA 


2E-75 


49&1 


AF232828 


Mus musculus ventral neuron-specific protein 1 NOVA1 (Noval) 
mRNA, partial cds 


0.008 


4989 


XM_029168 


Homo sapiens hypothetical gene supported by AK023 162 
(LOC90120), mRNA 


0.53 




L31645 


Helianthus annuus ribosomal protein S3a mRNA, complete cds 


0.67 




XM_012756 


Homo sapiens transcription factor 4 (TCF4), mRNA 


0.22 


49RS 


AK021049 


Mus musculus adult male corpus striatum cDNA, RIKEN full-length 
enriched library, clone: C030006F08, full insert sequence 


0.006 


49 


AB020763 


Ptychodera flava mRNA for Paxl9, complete cds 


0.26 


4987 


XM_047401 


Homo sapiens G protein-coupled receptor 51 (GPR51), mRNA 


0.25 


4288 


AF250952 


Osmia satoi 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.24 


4289 


AF267985 


Homo sapiens 15 kDa selenoprotein gene, exon 4 


0.024 


4290 


AJ409503 


Mus musculus RNA binding site for Dazl protein, clone jd8 


0.029 


4291 


AC009304 


Homo sapiens clone NH01 1 1109, complete sequence 


0.026 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCbSSN 


DESCRIP 


P VALUE 


4292 


XM 050269 


Homo sapiens solute carrier family I (glial high affinity glutamate 
transporter), member 3 (SLC1A3), mRNA 


0.21 


4293 


AK004286 


Mus musculus 18 days embryo cDNA, RIKEN full-length enriched 
library, clone: 1110057K12, full insert sequence 


2.1 


4294 


NC 002095 


Chlorobium limicola plasmid pCLl, complete sequence 


0.24 


4295 


AC006678 


Caenorhabditis elegans cosmid R05G9, complete sequence 


2.1 


4296 


AF2 17235 


Staphylococcus aureus pathogenicity island SaPIbov, complete 
sequence 


0.009 


4297 


VTl/ /"VI *^f\ A C 

XM 012045 


Homo sapiens hypothetical protein FLJ13074 (FLJ13074), mRNA 


0.68 


4298 


NM 022087 


Homo sapiens hypothetical protein FLJ21634 (FLJ21634), mRNA 


e-158 


4299 


XM 010678 


Homo sapiens similar to zinc finger protein 258 (H. sapiens) 
(LOC65323), mRNA 


7E-63 


4300 


AJ3 11847 


Nicotiana tabacum chloroplast mRNA for FtsZ-like protein (ftsZ 
gene), clone FtsZ2-2 


0.55 


4301 


Z24366 


H. sapiens (D18S480) DNA segment containing (CA) repeat; clone 
AFM320yc9; single read 


8E-12 


4302 


AK003805 


Mus musculus 18 days embryo cDNA, RIKEN full-length enriched 
library, clone: 11 100 19B22, full insert sequence 


0.009 


4303 


AB047606 


Macaca fascicularis brain cDNA, clone:QnpA-10881 


0 


4304 


AK000493 


Homo sapiens cDNA FLJ20486 fis, clone KAT08039 


0.24 


4305 


AF375597 


Mus musculus medium and short chain L-3 -hydroxyacyl-Coenzyme A 
dehydrogenase (Mschad) gene, exons 2 through 8, and complete cds; 
nuclear gene for mitochondrial product 


0.75 


4306 


AP000301 


Homo sapiens genomic DNA, chromosome 21 q22.1, D21S226-AML 
region, clone:D16B8, complete sequence 


0.081 


4307 


AKO 19632 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone :4930455M05, full insert sequence 


0.068 


4308 


X94556 


R.cingulata DNA for internal transcribed spacer ITS2 of rDNA unit 


0.079 


4310 


AY027174 


Arabidopsis thaliana sequence flanking 3* end of Ds-GeneTrap 
insertion from line GT2850 


0.2 


4311 


AC092063 


Homo sapiens clone RP1 1-745H7, complete sequence 


0.14 


4312 


AJ297034 


Homo sapiens SDS-stable vimentin-bound DNA fragment 

T TT7T7 1 m JTfc JTI O 

HEF19VIM18 


e-117 


4313 


Z82088 


Caenorhabditis elegans cosmid ZK256, complete sequence 


0.015 


4314 


AC006709 


Caenorhabditis elegans cosmid Y119C1A, complete sequence 


0.17 


4315 


AF016620 


ivius muscuius rvo-o neurai cen aonesion moiecuie long xorm 
(RNCAM) mRNA, partial cds 


6.2 


4316 


AK013121 


Mus musculus 10, 1 1 days embryo cDNA, RIKEN full-length 
enriched library, clone:2810421E14, full insert sequence 


1.9 


4317 


XM 033379 


Homo sapiens KIAA1607 protein (KIAA1607), mRNA 


4E-09 


4318 


AL355864 


Human DNA sequence from clone RP4-803 A2 on chromosome 1, 
complete sequence [Homo sapiens] 


1.4 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


4319 


AE001422 


Plasmodium falciparum chromosome 2, section 59 of 73 of the 
complete sequence 


0.028 


4320 


AL157437 


Homo sapiens mRNA; cDNA DKFZp434E1512 (from clone 
DKFZp434E1512); partial cds 


4.9 


4321 


U31929 


Human orphan nuclear receptor (DAX1) gene, complete cds 


5.4 


4322 


AF338875 


Hylobates moloch NAN03 cytochrome b gene, partial cds; and tRNA- 
Thr, tRNA-Pro, and tRNA-Phe genes, complete sequence; 
mitochondrial genes for mitochondrial products 


0.6 


4323 


X04502 


Human SLPI gene for secretory leukocyte protease inhibitor 


2 


4324 


U07018 


Human immunodeficiency virus type 1 SH803 (env) gene, partial cds 


0.64 


4325 


AF046143 


Homo sapiens chromosome Xp22 958-13 


0.023 


4326 


U17081 


Human fatty acid binding protein (FABP3) gene, complete cds 


6 


4327 


AJ249365 


Plasmodium falciparum partial mRNA for delta-aminolevulinic acid 
dehydratase 


0.009 


4328 


AF090936 


Homo sapiens clone HQ0582 


0.006 


4329 


X97970 


Athaliana mRNA for RNA helicase 


0.079 


4330 


XM_047794 


Homo sapiens KIAA1641 protein (KIAA1641), mRNA 


e-156 


4331 


AF334172 


Zea mays clone B7 chromosome 4 centromeric region 


3.9 


4332 


AJ401041 


Human immunodeficiency virus type 1 proviral partial gpl60 gene for 
envelope protein, strain J_97DC.KTB147 


0.64 


4333 


XM 038175 


Homo sapiens KIAA0872 protein (KIAA0872), mRNA 


0.65 


4334 


Y16473 


Chlamydomonas reinhardtii chloroplast DNA for rpsl8 gene, and 
orf570 


0.028 


4335 


AF190381 


Eucalyptus tetragona photosystem Q(B) protein (psbA) pseudogene 
and tRNA-His (trnH) gene, partial sequence; chloroplast genes for 
chloroplast products 


0.008 


4336 


AE006276 


Lactococcus lactis subsp. lactis IL1403 section 38 of 218 of the 
complete genome 


0.22 


4337 


AK020646 


Mus musculus adult male urinary bladder cDNA, RIKEN full-length 
enriched library, clone:9530077A04, full insert sequence 


2.3 


4338 


Z69666 


Hiunan DNA sequence from cosmid 24F8 from a contig from the tip 
of the short arm of chromosome 16, spanning 2Mb of 16pl3.3. 
Contains ESTs, repeat polymorphism and CpG island 


0.018 


4339 


AF017254 


West Nile virus nonstructural protein NS5 (NS5) gene, partial cds 


0.73 


4341 


NM 008645 


Mus musculus murinoglobulin 1 (Mugl), mRNA 


0.00009 


4342 


AF1 14927 


Saccharomyces pastorianus CBS1538 small subunit ribosomal RNA 
gene, mitochondrial gene for mitochondrial RNA, complete sequence 


0.25 


4343 


AL451057 


Human DNA sequence from clone CTD-2010K20 on chromosome 10, 
complete sequence [Homo sapiens] 


0.17 


4344 


AK022875 


Homo sapiens cDNA FLJ12813 fis, clone NT2RP2002503, weakly 
similar to ZINC FINGER PROTEIN 45 


1.7 


4345 


D10910 


Arabidopsis thaliana Atpk7 gene for serine/threonine protein kinase, 
complete cds 


0.61 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQ ID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


4346 


X53096 


S.aureus genes encoding Sau96I DNA methyltransferase and Sau96I 
restriction endonuclease 


0.2 


4347 


M57855 


Human protein S-beta pseudogene, exons 5 and 6 


0.2 


4348 


Z73523 


S.cerevisiae chromosome XVI reading frame ORF YPL167c 


0.065 


4349 


U67562 


Methanococcus jannaschii section 104 of 150 of the complete genome 


0.54 


4350 


Y09047 


M.musculus pxF gene, strain 129VJ 


0.002 


4351 


XM 027332 


Homo sapiens ribosomal protein L36 (RPL36), mRNA 


0.0001 


4352 


AF106029 


Thomomys mazama clone 12-8 microsatellite sequence 


0.25 


4353 


AL137428 


Homo sapiens mRNA; cDNA DKFZp761N1323 (from clone 
DKFZp761N1323) 


0.028 


4354 


L81915 


Homo sapiens (subclone l_g8 from BAC H76) DNA sequence, 
complete sequence 


0.63 


4355 


J01291 


soybean 7s seed storage protein alpha subunit mrna (b), from gmc- 
alpha' -alpha 236 


3.7 


4356 


NM_006649 


Homo sapiens serologically defined colon cancer antigen 16 
(SDCCAG16), mRNA 


0.69 


4357 


AL137635 


Homo sapiens mRNA; cDNA DKFZp434E175 (from clone 
DKFZp434E175) 


6.1 


4358 


Y14422 


M.musculus DNA for retinal protein 


0.23 


4359 


D32056 


Human gene for 2-oxoglutarate dehydrogenase, exon 1 sequence 


6.4 


4360 


D14337 


Yeast pckl+ gene for protein kinase, complete cds 


0.085 


4361 


J02756 


Mouse band 3 anion exchange protein gene, complete cds 


1.9 


4362 


XM 048747 


Homo sapiens KIAA1223 protein (KIAA1223), mRNA 


0.69 


4363 


M97514 


Saccharomyces douglasii mitochondrial cytochrome c oxidase subunit 
I (COXI) gene, complete cds 


0.19 


4364 


D10910 


Arabidopsis thaliana Atpk7 gene for serine/threonine protein kinase, 
complete cds 


0.53 


4365 


AK015420 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4930448I18, full insert sequence 


0.003 


4366 


U82487 


Branchiostoma floridae engrailed protein (AmphiEn) mRNA, 
complete cds 


0.22 


4367 


AB052798 


Feline foamy virus env gene for envelope protein, complete cds, 
clone:HFFV 


0.08 


4368 


AY027174 


Arabidopsis thaliana sequence flanking 3' end of Ds-GeneTrap 
insertion from line GT2850 


0.25 


4369 


AJ298684 


Phylloxera sp. MBLM2 mitochondrial ATP6 gene for ATP synthase A 
chain subunit 6 


0.67 


4370 


NM_0 13696 


Mus musculus thyrotropin releasing hormone receptor (Trhr), mRNA 


0.000003 


All 1 


AF176830 


Dictyostelium discoideum polyphosphate kinase (PPK) gene, complete 
cds 


0.83 


4372 


AB036665 


Bacteriophage WO genes, ISW1 inserted sequence, partial and 
complete cds 


0.17 


4373 


AJ310187 


Prevotella albensis dpp-IV gene for dipeptidyl peptidase IV 


1.6 


4374 


L77040 


Homo sapiens (subclone 8_cl 1 from PI H22) DNA sequence 


0.025 
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Table 3A Nearest Neighbor (BlastN vs. Genbanto 


SEQIE 
NO 


) 

ACCESSN 


DESCRIP 


P VALUE 


4375 


AJ277162 


Beta vulgaris mRNA for putative Cdc2-related protein kinase CRK2 


0.009 


TJ / \J 


K02781 


Mouse PND gene encoding atrial natriuretic factor, complete cds 


0.24 


4177 


AL1 10166 


Homo sapiens mRNA; cDNA DKFZp586M1424 (from clone 
DKFZp586M1424) 


0.69 


4378 


AE006400 


Lactococcus lactis subsp. lactis IL1403 section 162 of 218 of the 
complete genome 


6.4 


4379 


AF3 10887 


Dictyostelium discoideum RacB (racB) gene, partial cds; and 
unknown gene 


0.23 


4380 


AY023357 


Oryza sativa microsatellite MRG5682 containing (TAA)X20, 
genomic sequence 


2.3 


438T 


AL133038 


Homo sapiens mRNA; cDNA DKFZp434D1835 (from clone 
DKFZp434D1835) 


0.00004 




M37760 


Mouse serine 2 ultra high sulfur protein gene, complete cds 


0.083 


4383 


AF151698 


Agrobacterium tumefaciens cryptic plasmid pAtC58 transcriptional 
repressor, GlnH homolog, GlnP/GlnQ homolog, putative 
oxidoreductase, and AgaE homolog genes, complete cds 


0.25 


4384 


X59370 


X.laevis POMC-A gene for proopiomelanocortin-A 


0.007 


4385 


AF244933 


Oryctolagus cuniculus interferon gamma precursor, mRNA, complete 
cds 


0.66 


4386 


Z69919 


Human DNA sequence from cosmid 91K3, Huntington's Disease 
Region, chromosome 4pl6.3 contains CpG island 


0.077 


4387 


AK027136 


Homo sapiens cDNA: FLJ23483 fis, clone KAIA04052 


1.7 


4388 


AK027136 


Homo sapiens cDNA: FLJ23483 fis, clone KAIA04052 


6.6 


4389 


AL589705 


Human DNA sequence from clone RP1 1-64M7 on chromosome 6, 
complete sequence [Homo sapiens] 


0.63 


4390 


AF035192 


Campylobacter rectus S-layer-RTX protein (csxB) gene, complete cds; 
tRNA-Arg gene, complete sequence; and putative histidine kinase 
gene, partial cds 


0.34 


4391 


AK021269 


Mus musculus 12 days embryo spinal cord cDNA, RDCEN full-length 
enriched library, clone:C530007A02, full insert sequence 


0.69 


4397 


XM_030404 


Homo sapiens acid phosphatase 1, soluble (ACPI), mRNA 


0.007 


4393 


BC006633 


Mus musculus, Flt3 interacting zinc finger protein 1, clone 
MGC7201 IMAGE:3482191, mRNA, complete cds 


2 


4394 


AK009564 


Mus musculus adult male tongue cDNA, RDCEN full-length enriched 
library, clone:2310032D21, full insert sequence 


1.6 


4395 


L04802 


Silkmoth apoptosis preventing protein (p35) gene sequence 


0.21 


4396 


Ad026599 


Neogerris parvulus mitochondrial gene for 16S ribosomal RNA, 
partial sequence 


0.027 


4397 


] 

BC005581 ] 


Vius musculus, RIKEN cDNA 2310022K15 gene, clone MGC: 1 1736 
[MAGE:3968615, mRNA, complete cds 


0.22 


4398 


] 

XM_033374 


flomo sapiens myosin, heavy polypeptide 7, cardiac muscle, beta 
[MYH7), mRNA 


0.23 


4399 


U31534 ] 


sla Vista virus MC-SB-1 nucleocapsid protein mRNA, complete cds 


0.54 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 




DESCRIP 


P VALUE 


4400 


XM 005318 


Homo sapiens hypothetical protein FLJ1 1 125 (FLJ1 1 125), mRNA 


0.44 


4401 


u /oo /u 


Arabidopsis thaliana recA-like protein (AtDMCl) gene, complete cds 


0.051 


4402 


a vnno/i A 9 
AJs.UUV44o 


Mus musculus adult male tongue cDNA, RIKEN full-length enriched 
library, clone.2310021K08, full insert sequence 


1.6 


4403 




Phytophthora infestans isolate 94-52 NADH dehydrogenase subunit 4 
gene, partial cds; mitochondrial gene for mitochondrial product 


0.16 


4404 


AC084160 


Caenorhabditis elegans cosmid Y73B3B, complete sequence 


0.21 


4405 


AF330120 


Homo sapiens voltage-gated sodium channel type III alpha subunit 
(SCN3A) gene, exon 4 


2.4 


4406 




Homo sapiens cDNA FLJ 12073 fis, clone HEMBB 1002387 


0.49 


4407 


AT332583 


Homo sapiens stratum corneum chymotryptic enzyme (KLK7) gene, 
complete cds, alternative transcripts 


1.7 


4408 


XM_043049 


Homo sapiens hypothetical gene supported by AL080215 
(LOC92 138), mRNA 


0.7 


4409 


AK0 17682 


Mus musculus 8 days embryo cDNA, RIKEN full-length enriched 
library, clone:5730465I09, full insert sequence 


2.1 


4410 


AJ409162 


Solanum ochranthum LINE retrotransposon within partial ure gene 
for urease, exons 6-8 


0.025 


4411 


A 17 1 OO CO/I 

AF183584 


Papio hamadryas tyrosinase gene, exon 2 


0.24 


4412 


AK010138 


Mus musculus adult male tongue cDNA, RIKEN full-length enriched 
library, clone:2310069P03, full insert sequence 


0.56 


4413 


aci ni/»i 

AT112141 


Oryzias latipes homeodomain transcription factor (eng2) mRNA, 
complete cds 


0.021 


4414 


AK022458 


Homo sapiens cDNA FLJ12396 fis, clone MAMMA1002758 


4.9 


4415 




Homo sapiens cDNA FLJ 14097 fis, clone MAMMA 1000760 


0.00007 


4416 




Macaca fascicularis brain cDNA clone: QtrA- 10686, full insert 
sequence 


2.1 


4417 


AF063864 


Schizosaccharomyces pombe essential nuclear protein Mcm3p 
(mcm3+) gene, complete cds 


0.21 


4418 


Ar23328o 


Drosophila melanogaster WDS (wds) and egghead (egh) genes, 
complete cds 


1.3 


4419 


AE003106 


Drosophila melanogaster genomic scaffold 142000013386018, 
complete sequence 


0.007 


4421 


AB001919 


Oryza sativa DNA for phosphoiipase D, complete cds 


0.23 


4422 


AF2 16290 


Vfus nuisculim heterochmmfltin nrntpin 1 alnhsi mPXT A /-nmrilRt** r*Hc 


u.uu 


4423 


AF028828 


Homo sapiens Tax interaction protein 43 mRNA, partial cds 


0.18 


4424 


AF011446 


Mus musculus granzyme K gene, complete cds 


1.9 


4425 


AF200375 


Ceratosolen bisulcatus cytochrome oxidase subunit I (COI) gene, 
partial cds; tRNA-Leu gene, complete sequence; and cytochrome 
Dxidase subunit II (COII) gene gene, partial cds; mitochondrial genes 
for mitochondrial products 


0.68 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


4426 


AF328031 


Zygoballus ruflpes NADH dehydrogenase subunit I (ND1) gene, 
partial cds; mitochondrial gene for mitochondrial product 


1.7 


4427 


AT/0770J 


Alouatta stramineus cytochrome b gene, partial cds; mitochondrial 
gene ior miiocnonanai product 


0.72 


4428 




d.cerevisiae cnromosome aii reading irame UKr YLROolw 


0.19 


4429 


IN1YI Ulo/UO 


Homo sapiens hypothetical protein DKFZp762M115 (KIAA1630), 

m P\TA 


1 c jo 

1H-4© 


4430 




Dictyostelium discoideum non-LTR retrotransposon TRE5-B, 
poryproiein vg<*gj ana group-specinc ami gen (poi) genes, complete cos 


U. / 4 


4431 


AJ276991 


Mus musculus partial mRNA for ClpX protein 


0.079 


4432 


AB047240 


riuman enaogenous rexrovirus ricKv-K>(iij din A, complete sequence 
and flanking region 


0.13 


4433 




wAiitiitifi gidur did EfIxvj i i gene, complete cos 


u.uz / 


4434 




Mouse Ins DNA; cellular component that mediates integration and 

CXOlMUIl Ul pUiyUIIld VII lib -L/rN/\ 


ft 151 


4435 


Uj 1 / oZ 


onaKe ^green nauuj g 1 g 1 or gene ior i a i a-oox oinuing protein 


1.3 


4436 




Mus musculus adult male testis cDNA, RJJCEN full-length enriched 
library, clone:4930480I16, full insert sequence 


O.ZO 


4437 


U36777 


Carassius auratus aldolase C mRNA, complete cds 


0.094 


4438 


AF036690 


Caenorhabditis elegans cosmid F09C1 1 


0.72 


4439 


NM_0 12750 


Rattus norvegicus Tyrosine kinase receptor ligand 2 (Retl2), mRNA 


0.63 


4440 


Ivl /40U0 


Brugia malayi myosin heavy chain gene, complete cds 


0.07 J 


4441 


YTVvff fVAOA^^ 
yUVl_0*l-ZtO O 


xiomo sapiens laji^o / 154 (iaji^o / io4j, mKN A 


O.OO 


4442 


AF335329 


Kallichroma tethys isopenicillin N synthase (pcbC) and alpha- 
aminoadipyl-cysteinyl-valine synthetase (pcbAB) genes, complete cds 


6.1 


4443 


XM_037269 


Homo sapiens region containing hypothetical protein FLJ 10549; 
hypothetical protein FLJ105 17 (LOC82252), mRNA 


0.26 


4444 


AT IjUU /y 


riomo sapiens cione rjLXAjj /© rssxjzojz rnivJN a, complete cos 


0.24 


4445 




Usvmst c o moti c -hill Inti inraW oT^M A nlnna 'f 1 lO(M7f\4 

riomo sapiens run lengin insen curs a cione Zj\j2ryr\}*\ 


A CI 

O.D1 


4446 




Schedorhinotermes sarawakensis 16S mitochondrial ribosomal RNA 
j^ciic, pdiudi act|uciiwe, iiuiucnuiiuridi gene ior nuiocnonanai prouuci 




4447 


AF170550 


Arabidopsis thaliana germin-like protein 7 (GLP7) gene, complete cds 


2 


4448 


AC084500 


OuPTirvrriJiriHiHc hri<? crest A r*r»cmiH C* 1 "Ifxl ^ rnmnlpfp cpnnpnpp 
Va&iiui iiauui ua unggoaG iajdiiuu VJlJUiJ, d/IHJJldC bdJUCil^C 




4449 


M23360 


Trypanosoma brucei cytochrome C mRNA, partial cds 


2.1 


4450 


AF236387 


Schizosaccharomyces pombe zinc finger protein Grtl (grtl) gene, 
complete cds 


0.076 


4453 


XM 010610 


Homo sapiens hypothetical protein (FLJ10287), mRNA 


0.027 


4454 


AY021908 


Oryza sativa microsatellite MRG4233 containing (TA)X42, closest to 
marker C563, genomic sequence 


0.025 


4455 


AK021675 


Homo sapiens cDNA FLJ11613 fis, clone HEMBA1004012 


0.073 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQ ID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


4456 


NM 030995 


Rattus norvegicus Microtubule-associated protein la (Mapla), mRNA 


1.8 


44 57 


AF015298 


Human herpesvirus 6 IE-2 protein (ie2hom) gene, complete cds 


0.058 


4458 


XM_0 11297 


Homo sapiens hypothetical protein FLB3342 (FLB3342), mRNA 


0.0000009 


4459 


AE006714 


Sulfolobus solfataricus section 73 of 272 of the complete genome 


0.002 




AE001434 


Plasmodium falciparum chromosome 2, section 71 of 73 of the 
complete sequence 


0.004 


4461 


AF068270 


Rachiplusia ou nuclear polyhedrosis virus protein-tyrosine 
phosphatase (ptp), late expression factor-2 (lef-2), occlusion matrix 
protein (polh), capsid-associated protein, and protein kinase (pkl) 
genes, complete cds; and unknown genes 


0.027 


4467 


AK015582 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4930477O15, full insert sequence 


0.08 


4463 


XM_039778 


Homo sapiens HSPC047 protein (HSPC047), mRNA 


0 


4464 


S73580 


putative polymerase... capsid protein [potato aucuba mosaic potexvirus 
PAMV, Genomic RNA Complete, 6 genes, 7059 nt] 


0.071 


4465 


XM_016858 


Homo sapiens LOC87857 (LOC87857), mRNA 


3E-13 


4466 


AE004427 


Vibrio cholerae chromosome II, section 84 of 93 of the complete 
chromosome 


0.69 


4467 


AK025346 


Homo sapiens cDNA: FLJ21693 fls, clone COL09609 


4.9 


4468 


U89926 


Drosophila melanogaster cut gene, partial sequence 


0.22 


44AQ 


X57171 


D.caryophyllus CARSR12 gene 


2 


447ft 


L36434 


Mus Musculus basic domain/leucine zipper transcription factor 
mRNA, 3' end of cds 


2 


4471 


AF1 15469 


Arabidopsis thaliana cultivar Landsberg erecta Arac2 (Arac2) gene, 
complete cds 


1.9 


4472 


AB063169 


Petunia x hybrida tazl gene for tapetum-specific zinc finger protein 1, 
promoter region and partial cds 


0.64 


4477 


L41691 


Bos taurus (clones L6, C15, C12, C8, C9, C17, C19, C12, C18, C5, 
C6, C3, C13, CIO, C47) mRNA, 3' end of cds 


0.47 


4474 


U61463 


Human myosin Vila (MY07A) gene, 5' exon 36 


0.0007 


4475 


NM_008992 


Mus musculus ATP-binding cassette, sub-family D (ALD), member 4 
(Abcd4), mRNA 


1.9 


4476 


XM 050942 


Homo sapiens ribosomal protein L6 (RPL6), mRNA 


0.00009 


4477 


AK005146 


Mus musculus adult male cerebellum cDNA, RIKEN full-length 
enriched library, clone: 1500004F14, full insert sequence 


0.71 


4478 


AF345089 


Hepatitis C virus isolate SCpreSC2cl4 nonfunctional polyprotein 
gene, partial sequence 


0.71 


4479 


AK008812 


Mus musculus adult male stomach cDNA, RIKEN full-length 
enriched library, clone:2210403E17, full insert sequence 


6.4 


4480 


XM_006983 


Homo sapiens A kinase (PRKA) anchor protein 3 (AKAP3), mRNA 


1.6 


4481 


X06438 


Soybean leghaemoglobin gene lba promoter region 


0.19 


4482 


AF020051 


Blackcurrant reversion virus RNA2 polyprotein mRNA, complete cds 


0.71 
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Table 3A Nearest Neighbor (BlastN vs. Genbanto 


SEQIT 
NO 


) 


DESCRIP 


P VALUE 


4483 


YU OS 1 SAO 


Homo sapiens peroxisome proliferative activated receptor, gamma 

/"Dp A O i~Z\ mD XT A 


0.6 


4484 


nDl/ZJfo 1 


Cyprinus carpio mRNA for mitogen activated protein (MAP) kinase 
pjo, complete cos 


6.3 


4485 




Homo sapiens partial SIP1 gene for survival interacting protein 1, 
exons j-4 


2.2 


4486 


.A1V1 UJ7707 


Homo sapiens microfibrillar-associated protein 3 (MFAP3), mRNA 


0.69 


4487 




Homo sapiens similar to KIAA1473 protein (H. sapiens) (LOC92830), 

mDXTA 
nuvJN A 


6.7 


4488 


AB042411 


Homo sapiens strg gene for striatum-specific G potein-coupled 
receptor, complete cds 


2 


4489 


-AJ.V1 Uj jZZ 1 


Homo sapiens ancient ubiquitous protein 1 (AUP1), mRNA 


0.16 


4490 


J_/l.*tU01 


Scluzosaccharomyces pombe mRNA for ORF 


0.015 


4491 


/VC(UUl*fZO 


Plasmodium falciparum chromosome 2, section 63 of 73 of the 
complete sequence 


0.074 


4492 




Rat leukocyte common antigen (L-CA) gene, exons 1 through 5 


0.16 


4493 


AF020726 


Tetrahymena thermophila phosphoglucomutase (PGM1) gene, 
complete cds 


0.058 


4494 


iVloUZ*f 1 


Caenorhabditis elegans unc-6 gene, complete cds 


2.1 


4495 


UO /ZJ*f 


riuman clone rib^.zy Alu-Ya5 sequence 


0.083 


4496 


AK000864 


Homo sapiens cDNA FLJ10002 fis, clone HEMBA1000046 


0.00001 


4498 




Homo sapiens cDNA FLJ1433 1 fis, clone PLACE4000320 


0.064 


4499 


AF071754 


Mus musculus DNA cytosine methyltransferase mRNA 


0.21 


4500 


XM 047325 


Homo sapiens hypothetical protein LOC57187 (LOC57187), mRNA 


0 


4501 


.AJ.V1 U*f J*f J*t 


Homo sapiens villin-like (VILL), mRNA 


1.8 


4502 


rNivi_ujzo / j 


Homo sapiens hypothetical protein MGC10954 (MGC10954), mRNA 


0.0008 


4503 


AL589203 


Human DNA sequence from clone RP1 1-391F23 on chromosome 6, 
complete sequence [Homo sapiens] 


0.0003 


4504 


M86544 


Cow prostaglandin F synthetase II (PGFSII) mRNA, complete cds 


0.66 


4505 


nrjJ7 /oZ 


Homo sapiens clone IMAGE: 1871856, mRNA sequence 


0 


4506 




S.cerevisiae chromosome IV reading frame ORF YDL148c 


6.2 


4507 


Ap*ni 97^1 

ATU1Z /Jl 


Danio rerio olfactory receptor protein 2.4 mRNA, complete cds 


6.8 


4508 


AF1 84796 , 


Arabidopsis thaliana inhibitor tagged site ITS29 genomic sequence 


1 


4509 


] 

AE006187 j 


Pasteurella multocida PM70 section 1 54 of 704 nf th#» rnmniPtP 
genome 


1.8 


4510 


AJ272171 ] 


Podospora anserina degenerate gypsy-like retrotransposon Yeti 


0.074 


4511 


AK024656 ] 


Somo sapiens cDNA: FLJ21003 fis, clone CAE03685 


1E-90 


4512 


NCJ)02629 1 


Lithobius forficatus mitochondrion, complete genome 


0.066 


4513 


I 

X87205 i 


Vifascicularis mRNA for metalloprotease-Iike, disintegrin-likc 
)rotein, IVa 


4.9 
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# 



ACCESSN 



Table 3A Nearest Neighbor (BlastN vs. Genbank) 



UE3CK1P 

Homo sapiens CDC 10 (cell division cycle 10, S. cerevisiae 
(CDC 10), mRNA 



P VALUE 



4514 XM 011595 



homolog) 



5E-47 



4515 AK021537 



Homo sapiens cDNA FLJ11475 fls, clone HEMBA1001734, 
moderately similar to CADHERIN-1 1 PRECURSOR 



0.068 



4516 AB044877 



Equus caballus DNA, microsatellite TKY377 



0.076 



4517 AC079391 



Homo sapiens clone CTD-2243M11, complete sequence 



0.00001 



4518 AK000932 



Homo sapiens cDNA FLJ10070 fis, clone HEMBA1001581 



4519 AE001376 



Plasmodium falciparum chromosome 2, section 13 of 73 of the 
complete sequence 



0.24 



4520 XM 040095 



Homo sapiens hypothetical protein (DKFZP434G03 10), mRNA 



1.8 



4521 AF224317 



Gallus gallus Frizzled-7 (cFz-7) mRNA, complete cds 



1.5 



4522 AE001155 



Borrelia burgdorferi (section 41 of 70) of the complete genome 



4523 AJ313130 



Medicago truncatula transposon (CACTA type), clone 65K06 



4524 AF020187 



Amblyomma americanum ecdsyteroid receptor (AamEcRAl) mRNA, 
complete cds 



6.3 



4525 AP000249 



Homo sapiens genomic DNA, chromosome 21q22.1, D21S226-AML 
region, clone:B762Q15, complete sequence 



2E-19 



4526 AF218380 



Mus museums CBLN3 (Cbln3) gene, complete cds 



0.69 



4527 AP000601 



Arabidopsis thaliana genomic DNA, chromosome 3, PI clone: MJL14 



0.029 



4528 AE001140 



J " > » V-VM.*V. 

Borrelia burgdorferi (section 26 of 70) of the complete genome 



0.64 



4529 AL3 58852 



Human DNA sequence from clone RP1-3 17N9 on chromosome 6, 
complete sequence [Homo sapiens] 



2.2 



4530 NM 024596 



Homo sapiens hypothetical protein FLJ12847 (FLJ12847), mRNA 



0.82 



X12918 



Human Na+,K+ ATPase gene part of intron XVI (alpha III isoform) 



0.0003 



4532 AF298180 



Caenorhabditis elegans tropomyosin isoform IV mRNA, complete cds 



0.57 



M21590 



M.mycoides capri Gly-tRNA gene 



0.48 



4534 AF250227 



Homo sapiens adenylyl cyclase type VI gene, partial cds 



0.19 



Z36061 



cerevisiae chromosome II reading frame ORF YBR192w 



0.071 



4536 AF006510 



Endogone pisiformis 18S ribosomal RNA gene, partial sequence, 
internal transcribed spacer 1, 5.8S ribosomal RNA gene and internal 
transcribed spacer 2, complete sequence, and 26S ribosomal RNA 
gene, partial sequence 



0.08 



D88262 



Pisum sativum PsCHS5 gene for chalcone synthase, complete cds 



0.075 



L31504 



Neurospora crassa cytoplasmic dynein heavy chain (ro-1) gene, 
complete cds 



2.3 



S81605 



S100 beta =neural calcium binding protein beta subunit {5* region} 
[rats, Genomic, 2279 nt] 



0.66 



X95275 



P.falciparum complete gene map of plastid-like DNA (IR-A) 



0.23 



U18868 



Squalus acanthias multifunctional protein CAD mRNA, complete cds 



0.024 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 



DESCRIP 



P VALUE 



4542 AF184835 Arabidopsis thaliana inhibitor tagged site ITS68 genomic sequence | 0,71 



4543 AL513350 



4544 
4545 



Z68107 
AK002184 



Human DNA sequence from clone RP1 1-509D8 on chromosome 9, 
complete sequence [Homo sapiens] 



4546 
4547 



ICaenorhabditis elegans cosmid F48C5, complete sequence 
Homo sapiens cDNA FLJU322 fis, clone PLACE101O329 



0.64 



U00025 I Caenorhabditis elegans cosmid PAR2, complete sequence 



0.71 



U39548 



[Meloidogyne artiellia mtcuticline-1 gene, 5' upstream region" 



0.46 



4548 AK024271 jHomo sapiens cDNA FL J 14209 fis, clone NT2RP3003 346 



0.64 



4549 AF248989 



Arabidopsis griffithiana chalcone synthase gene, promoter region and 
[partial cds 



4550 AF081208 



Monoclea gottschei subsp. gottschei tRNA-Thr/tRNA-Leu intergenic 
I spacer, partial chloroplast sequence 



4551 XM_002770 I Homo sapiens hypothetical protein (FL J 10996), mRNA 



4552 AF035672 



Mus musculus MHC class I related protein 1 (MR1) gene, complete 
cds 



4553 AF374466 



iMacrobrachium intermedium 16S ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 



ru 



in 



4554 



S80559 



pdm-2-POU domain gene [Drosophila melanogaster, mRNA 2171 
nt] 



4555 AY029489 



Homo sapiens aminophospholipid-transporting ATPase (ATP IOC) 
I gene, exon 3 



4557 AJ276163 



Homo sapiens partial GDH gene for uridine diphospho-glucose 
I dehydrogenase, exons 8-11 



4558 NC_001566 Apis mellifera ligustica mitochondrion, complete genome 



4559 BC003453 



Mus musculus, calreticulin, clone MGC:6209 IMAGE:2655918, 
I mRNA, complete cds 



4560 
4561 



XM 048784 



AF1 19845 



[Homo sapiens mature T-cell proliferation 1 (MTCP1), mRNA" 
Homo sapiens PRO1304 mRNA, complete cds 
Plasmodium berehei elutathinnp rpHnrtacp 



4564 AF145282 



I Trichomonas vaginalis calcium motive P-type ATPase (CA-2) gene, 
| partial cds 



4568 Y08926 jP.falciparum mRNA for AARP1 protein, partial 



4571 AL358951 



(Human DNA sequence from clone RP3-456L16 on chromosome 6, 
[complete sequence [Homo sapiens] 



4572 | AJOO 1 164 |Drosophila melanogaster additional sex combs cDNA sequence 



0.0003 



0.3 



0.009 



5.8 



1.1 



5.9 



0.009 



1E-89 



iMus musculus RIKEN cDNA 4930433D19 gene (4930433D19Rik) 
4556 I NM 026255 mRNA 



2.2 



0.0002 



0.19 



0.62 



0.01 



4562 [ AF251477 [Plasmodium berghei glutathione reductase gene, partial cds | 0.053 
[Homo sapiens ATPase, Na+/K+ transporting, beta 3 polypeptide 

4563 | XM 018443 |(ATP1B3), mRNA 



0.11 



4565 AF280548 Homo sapiens neuropilin-1 (NRP1) gene, exons 1, 2, and partial cds 2.1 



4566 NC_002253 [ Buchnera sp. APS plasmid pLeu, complete seq uence 

4567 AJ238613 |Gallus gallus mRNA for teneurin-1 



0,24 
0.61 



0.21 



4569 Z86105 [A.thermophilum eel A gene and manA pseudogene 

4570 I AL583902 |s,pombe chromosome I pi p32A8 



0.69 
1.1 



0.072 



0.1 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQIE 
NO 


ACCESSN 


DESCRIP 


P VALUE 


4573 


AF301004 


dcrustosoma mansom tnioredoxm peroxidase 3 (TPx3) gene, complete 
cds 


0.19 


4574 




^venopus laevis nomeooox protein bix4.2 (Six4.2) gene, partial cds 


2 


4575 


BC007465 


xitmiu aapiCUb, Clone UVLAvJEf. jo/oj4 /, ITlKiN A 


0.00002 


4576 


AL589824 


tiuman jjjn a sequence from clone RP1 1-76G1 1 on chromosome X, 
complete sequence [Homo sapiens] 


0.17 


4577 


AF151171 


Ornithorhynchus anatinus mannose 6-phosphate/insulin-like growth 
idiaui z. icvA;piur jiviop/igizri gene, exons z ana J ana partial cds 


0.67 


4578 


AK000883 


Homo sapiens cDNA FLJ10021 fis, clone HEMBA1000568 


0 


4579 


XM_016630 


Homo sapiens KIAA1046 protein (KIAA1046), mRNA 


1.1 


4580 


NM_0 15906 


nomo sapiens transcriptional intermediary factor 1 gamma 
(TIF 1 GAMMA), transcript variant alpha, mRNA 


2 


4581 


AF274053 


^enopus laevis maxi-K potassium channel alpha subumt Slo mRNA, 
complete cds 


0.19 


4582 


nDUJJO /*f 


Pseudoregma pendleburyi mitochondrial genes for small subunit 
rKNA, tKNA-Val, large subumt rRNA, partial and complete 
sequences 


0.003 


4583 


AF277081 


Ajellomyces dermatitidis strain 104 blastomyces yeast phase-specific 
protein 1 (bysl) gene, complete cds 


0.26 


4584 


AF334561 


riomo sapiens enromosome i landmark: proximal external boundary 
of P7 VCY inverted repeat 


0.69 


4585 


AJJ i y / 


r .yoeni i clo gene ior a i .rase 


0.23 


4586 




ivuiocnonanon i^ymanina aispar noosomai KJN A large subumt 


0.008 


4587 


AF979001 


Ezvijid vi rub suurype z^aire strain Jviayinga complete genome 


1.5 


4588 


afooj.j.os 


Vibrio cholerae chromosome II, section 62 of 93 of the complete 
ciiroiiiosome 


1.9 


4589 


XM_010966 


Homo sapiens debranching enzyme (S. Cerevisiae) homolog 1 
(DBR1), mRNA 


2 


4591 


U68246 


Dictyostelium discoideum WacA (wacA) gene, complete cds 


0.002 


4592 


XM_047617 


Homo sapiens KIAA1349 protein (KIAA1349), mRNA 


0.027 


4593 


AK003428 


Mus musculus 18 days embryo cDNA, RIKEN full-length enriched 
lihrzirv Hnnf 1 1 10004.107 full inc^rt cpmipn^o 


2.2 


4594 


XM 029072 




0.009 


4595 


L43549 


Buchnera aphidicola aspartyl-tRNA synthetase (aspS), thioredoxin 
reductase (trxB), seryl-tRNA synthase (serS), phosphoserine 
aminotransferase fserC). 3-enolDvruvvlshikirriate-^-nho<?nhntf k 
synthetase (aroA), ribosomal protein SI (rpsA), integration host fa> 


0.21 


4596 


AK023907 


Homo sapiens cDNA FLJ13845 fis, clone THYRO1000815 


0 


4598 


AB042649 


Watermelon spotted wilt virus nss, np genes for non-structural, 
nucleocapsid protein, complete cds, isolate:WS-0 


0.024 


4599 


] 

D85547 < 


Bacillus sp. DNA for cytochrome c oxidase subunit 1,11 and III, 
:omplete and partial cds 


0.23 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRTP 


P VALUE 


4600 


L29252 


Human (clone D13-2) L-iditol-2 dehydrogenase gene, exon 4, exon 5, 
exon 6 and exon 7 


2E-08 


4601 


AE003872 


Xylella fastidiosa 9a5c, section 18 of 229 of the complete genome 


1.5 


460? 


U29924 


Human AMP deaminase (AMPD3) gene, exon 12 


0.0003 




AE000653 


Helicobacter pylori 26695 section 131 of 134 of the complete genome 


0.75 


460 S 


AP001235 


Homo sapiens genomic DNA, chromosome 2pl 1.2, clone:cos607/7B8 


0.024 


4606 


AF207725 


Mustela lutreola isolate D23 mitochondrial D-loop, partial sequence 


0.028 


4607 


AE007377 


Streptococcus pneumoniae section 60 of 194 of the complete genome 


0.083 


4608 


XM_018430 


Homo sapiens LOC88330 (LOC88330), mRNA 


4E-27 


460Q 


AF273674 


Plasmodium falciparum DNA-dependent RNA polymerase (TRNAP) 
gene, complete cds; nuclear gene for probable mitochondrial product 


0.002 




AY018919 


Oryza sativa microsatellite MRG1244 containing (AT)X23, genomic 
sequence 


6.7 


461 1 


AB025642 


Arabidopsis thaliana genomic DNA, chromosome 5, BAC 
done:T32G24 


2.1 


4612 


X59275 


D.melanogaster mRNA for posterior sex combs 


0.68 


461 1 


AF052959 


Homo sapiens type XV collagen (COL15A1) gene, exon 6 


0.71 


4614 

Wit 


U32730 


Haemophilus influenzae Rd section 45 of 163 of the complete genome 


0.025 


461 S 


AF2 12976 


Boiitoglossa mexicana isolate MEX_6C cytochrome b gene, partial 
cds; mitochondrial gene for mitochondrial product 


0.076 


4616 


AF031493 


Cancer magister molt-inhibiting hormone precursor mRNA, complete 
cds 


0.22 


4617 


AJ245965 


Danio rerio mRNA for pbxy homeodomain protein (pbxy gene) 


0.068 


461 R 


XM_031524 


Homo sapiens AF15ql4 protein (AF15Q14), mRNA 


0 


461 Q 


AL096876 


S.pombe chromosome II cosmid c418 


0.001 


4670 


XM 051190 


Homo sapiens similar to frizzled (Drosophiia) homolog 4 (H. sapiens) 
(LOC93401), mRNA 


6 


469 1 


XM 005411 


Homo sapiens interferon, omega 1 (IFNW1), mRNA 


0.22 


4622 


Z50156 


D.discoideum gene for 34 kD actin binding protein 


0.027 


4623 


BC006182 


Homo sapiens, Similar to calmodulin 2 (phosphorylase kinase, delta), 
clone MGC:715 IMAGE:3528814, mRNA, complete cds 


2 


4624 


AF106933 


Drosophiia melanogaster plexin B (plexB) mRNA, complete cds 


0.66 


4625 


U44392 


Human Down Syndrome region of chromosome 21, clone A12H1- 
1G10 


0.077 


4626 


AF145452 


Arabidopsis thaliana branched chain alpha-keto acid dehydrogenase 
El beta subunit (din4) mRNA, complete cds; nuclear gene for 
mitochondrial product 


6.3 


4627 


AE006315 


Lactococcus lactis subsp. lactis IL1403 section 77 of 218 of the 
complete genome 


0.62 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQED 
NO 


ACCESSN 


DESCRIP 


P VALUE 


4628 


X55978 


T. brucei ESAG 8 gene for a leucine-rich repeat family protein 


0.24 


4629 


U41529 


Caenorhabditis elegans cosmid EGAP3 


0.59 


4630 


XM_039502 


Homo sapiens associated molecule with the SH3 domain of ST AM 
(AMSH), mRNA 


5.3 




AF141650 


Nyctalus noctula microsatellite P223 sequence 


1.5 


4611 


AL096887 


Human DNA sequence from clone RP6-152C18 on chromosome 
Xq26. 1-27.3, complete sequence [Homo sapiens] 


0.02 


4634 


AL 137458 


Homo sapiens mRNA; cDNA DKFZp434E222 1 (from clone 
DKFZp434E2221) 


0.074 


4635 


AE002292 


Chlamydia muridarum, section 24 of 85 of the complete genome 


5.1 


4636 


XM_048200 


Homo sapiens KIAA0276 protein (KIAA0276), mRNA 


5.4 


4637 


AFO 17041 


Dictyostelium discoideum LTR-retrotransposon Skipper, partial 
genomic sequence, 5' end 


0.16 


4638 


Z98853 


Caenorhabditis elegans cosmid R08A2, complete sequence 


5.7 


4639 


XM_0 17899 


Homo sapiens LOC89635 (LOC89635), mRNA 


0 


4640 


BC006943 


Mus musculus, Similar to RIKEN cDNA 3230401N03 gene, clone 
MGC:6906 IMAGE:2655807, mRNA, complete cds 


2 


4641 


AF275345 


Lycopersicon esculentum MADS-box transcription factor jointless 
gene, complete cds 


0.62 


4642 


AK018612 


Mus musculus adult male cecum cDNA, RIKEN full-length enriched 
library, clone:9130011Lll, full insert sequence 


1.9 


4643 


AE006568 


Streptococcus pyogenes Ml GAS strain SF370, section 97 of 167 of 
the complete genome 


6.6 


4644 


AF057019 


Dictyostelium discoideum interaptin (abpD) gene, complete cds 


0.67 


464 S 
tut J 


XM010198 


Homo sapiens immune dysregulation, polyendocrinopathy, 
enteropathy, X-linked (IPEX), mRNA 


5E-29 


4646 


NM_022599 


Rattus norvegicus outer membrane protein (Omp25), mRNA 


1E-14 


4647 


U08455 


HIV-1 isolate 965 clone 26 from Malawi, envelope glycoprotein (env) 
gene, partial cds 


0.72 


4648 


AK023304 


Homo sapiens cDNA FLJ13242 fis, clone OVARC1000578 


6 


4649 


U20803 


Gallus gallus fibrinogen alpha and alpha-E subunit gene, exons 5 and 
6, and complete cds for alpha and alpha-E subunits 


2 


4650 


AF 189768 


Homo sapiens leukocyte immunoglobulin-like receptor 5 (LIR5) gene, 
complete cds 


0.025 


4651 


AK002614 


Mus musculus adult male kidney cDNA, RIKEN full-length enriched 
library, clone:0610012K15, full insert sequence 


2.1 


4652 


AFO 10568 


Plasmodium falciparum microsatellite TA80 sequence 


0.05 


4653 


Arl)4zZoJ 


Schizosaccharomyces pombe sulfide dehydrogenase (hmt2) gene, 
complete cds 


0.63 


4654 


Y16262 


Daucus carota mRNA for neutral invertase 


0.22 


4655 


XM 035796 


Homo sapiens hypothetical gene supported by AF001893; AH005586 
(LOC91063), mRNA 


1.5 


4656 


AB008565 


Homo sapiens gene for deoxylribonuclease II, 5'-upstream region 


0.73 


4657 


Y17275 


Lycopersicon esculentum p69a gene, complete CDS 


0.51 


4658 


X03100 


Human HLA-SB(DP) alpha gene 


0.23 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 



SEQ ID 
NO I ACCESSN 



DESCRIP 



P VALUE 



Human DNA sequence from clone 346P1 1 on chromosome Xq21.2- 
4659 I AL121822 21.33, complete sequence [Homo sapiens] 



1E-08 



4660 AE0O1813 Thermotoga maritima section 125 of 136 of the complete genome 



0.18 



4661 



Mus musculus adult male lung cDNA, RIKEN full-length enriched 
AK004553 library, clone: 1200002022, full insert sequence 



0.61 



Fagonia cretica chloroplast tRNA-Leu intron and trnL-trnF intergenic 
4662 | AJ387942 jspacer 



0.6 



Carabus venustus subsp. liaoningensis mitochondrial gene for NADH 
4663 AB053560 dehydrogenase subunit 5, partial cds, isolate: 73 -Mo Ve99LB02 



0.027 



4664 
4665 



Z78708 Hsapiens flow-sorted chromosome 6 HindlH fragment, SC6pA14H12 



2E-63 



AF010563 Plasmodium falciparum microsatellite TA73 sequence 



0.16 



4666 AB030620 



Mus musculus Chetk-alpha gene for choline/ethanolamine kinase- 
I alpha, exon 4, 5, 6, 7, 8 



0.73 



4667 L77036 Homo sapiens (subclone 5^d9 from PI H19) DNA sequence 



0.007 



Oryctolagus cuniculus eukaryotic initiation factor 2B-epsilon mRNA, 
4668 | U23037 | complete cds 



2.3 



4669 AY031967 



HIV-1 isolate NC5413-1999 from USA pol polyprotein (pol) gene, 
(partial cds 



0.67 



4670 AF026069 



Homo sapiens phosphomevalonate kinase (HUMPMKI) gene, partial 
cds 



6.1 



4671 XM_005151 Homo sapiens RecQ protein-like 4 (RECQL4), mRNA 



6.1 



4672 M34662 Human chaperonin (HSP60) non-functional pseudogene 2 



5.3 



4674 AF009255 



Homo sapiens putative chloride channel gene (CLCN6), exons 14 15 
16, and 17 



2.1 



4675 AF298224 [Homo sapiens RPCI-II 289E 13 CD21 gene, partial cds 



0.00004 



4676 NC_002087 |Plasmid pRL765, complete sequence 



0.7 



4677 AF270613 Homo sapiens clone 5qteLctg0024_c 103 t3c80t7_stelrpt sequence 



0.078 



4678 



Homo sapiens genomic DNA, chromosome 21q22.3, 
AP001467 clone:21B27A14, telomere region, complete sequence 



Rattus norvegicus T:G mismatch thymine glycosylase mRNA, 
4679 | A Y026945 | complete cds 



0.000001 



2.1 



4680 NM_017783 Homo sapiens hypothetical protein FLJ20357 (FLJ20357), mRNA 



0.67 



Hsapiens CpG island DNA genomic Msel fragment, clone 182d9, 
4681 | Z57523 [ reverse read cpg!82d9. it lb 



3E-55 



Cryptosporidium parvum KSU-1 heat shock 70 kDa protein (hsp70) 
4682 I U11761 | gene, complete cds 



0.076 



Picea abies isolate D nadl gene, intron b/c; mitochondrial gene for 
4683 I AF142642 | mitochondrial product 



0.08 



4684 I XM_018315 [Homo sapiens hypothetical protein FL J 12078 (FL J 12078), mRNA 



0.64 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESbN 


DESCRIP 


P VALUE 


4685 


U60297 


Gallus gallus DNA polymerase gamma mRNA, nuclear gene 
encoding mitochondrial protein, partial cds 


0.65 


4686 


M17837 


D.melanogaster (strain Ja-F) alcohol dehydrogenase gene (allele Adh- 
F), complete cds 


0.08 


4687 


X17265 


Human germline kappa-immunoglobulin variable region (V-kappa-II) 
gene Vcc 


0.25 


4688 


AM 030896 


Homo sapiens similar to hypothetical protein MGC5384 (H. sapiens) 
(LOC90321), mRNA 


7E-32 


4689 


AF083064 


Mus musculus paxillin-like protein (Hic5) gene, complete cds 


e-103 


4690 


X57248 


L.helveticus galK, galT and galM genes for galactokinase, galactose-1 
P uridyl transferase and mutarotase, partial cds 


0.003 


4692 


AF074020 


Mus musculus integral membrane protein 2A (Itm2a) gene, complete 
cds 


0.059 


4693 


D 17668 


Clostridium septicum gene for alpha-toxin, complete cds 


0.08 


4694 


AF089042 


Molothrus badius cytochrome b (cytb) gene, mitochondrial gene 
encoding mitochondrial protein, partial cds 


0.076 


4695 


AK004553 


Mus musculus adult male lung cDNA, RIKEN full-length enriched 
library, clone: 1200002022, full insert sequence 


0.63 


4696 


AF047579 


Bacillus thuringiensis insecticidal crystal protein (NRcry V) gene, 
complete cds 


0.69 


4697 


AF049489 


Cards familiaris factor VIII mRNA, complete cds 


6.3 


4698 


X57599 


S.xylosus xylR, xylA, xylB genes for Xyl repressor, xylose isomerase, 
xylulokinase 


0.024 


4699 


L04873 


Mus musculus cystic fibrosis transmembrane regulator (CFTR) gene, 
exon 1, and upstream region 


0.25 


4700 


XM_041483 


Homo sapiens hypothetical protein MGC 15827 (MGC 15827), mRNA 


2.2 


4701 


U30355 


Plocamium cartilagineum complete internal transcribed spacer region 
including ITS 1, 5.8s rDNA and ITS 2 


0.082 


4702 


AF346599 


Gallus gallus myostatin (MSTN) gene, exons 1,2 and 3 and complete 
cds 


1.8 


4703 


XM_007118 


Homo sapiens A kinase (PRKA) anchor protein 1 1 (AKAP1 1), mRNA 


6.2 


4704 


XM_042395 


Homo sapiens superkiller viralicidic activity 2 (S. cerevisiae homolog) 
like (SKIV2L), mRNA 


3.5 


4705 


Z78791 


H.sapiens flow-sorted chromosome 6 Hindlll fragment, SC6pA16H6 


2E-08 


4706 


AE006402 


Lactococcus lactis subsp. lactis EL1403 section 164 of 218 of the 
complete genome 


0.7 


4707 


A? / O / %J 


n.acipiciia j-a pruauigi<inuiii receptor pseuuogene / 


o.o 


4708 


AP001438 


Homo sapiens genomic DNA, chromosome 21q22.2, clone:T695, 
LB7T-ERG region, complete sequence 


0.0001 


4709 


XM_049375 


Homo sapiens WW domain binding protein 4 (formin binding protein 
21)(WBP4), mRNA 


3E-16 


4710 


ABO 16828 


Luehdorfia longicaudata mitochondrial ND5 gene for NADH 
dehydrogenase subunit 5, partial cds 


0.45 
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Table 3A Nearest Neighbor CBlastN vs. Genhankl 


SEQK 
NO 


) 

ACCESSN 


DESCRIP 


P VALUE 


4711 


U23439 


Bovine herpesvirus 4 1. 1-kb mRNA 


0.64 


4712 


AP001438 


Homo sapiens genomic DNA, chromosome 21q22.2, clone:T695, 
LB7T-ERG region, complete sequence 


0.00003 


4713 


AP000378 


Arabidopsis thaliana genomic DNA, chromosome 5, BAC 
clone:T4M5 


0.24 


4714 


AC064850 


Homo sapiens clone RP11-117H9, complete sequence 


0.009 


4715 


Z93943 


Human DNA sequence from cosmid U235H3 on chromosome X 


0.006 


4716 


AJ132901 


Drosophila guanche mitochondrial A+T-rich region 


0.052 


4717 


AE006291 


Lactococcus lactis subsp. lactis IL1403 section 53 of 218 of the 
complete genome 


0.24 


4718 


AF227741 


Rattus norvegicus protein kinase WNK1 (WNK1) mRNA, complete 
cds 


6.1 


4720 


273524 


S.cerevisiae chromosome XVI reading frame ORF YPL168w 


0.072 


4721 


AF142484 


Sebastes melanops microsateliite Sme2 sequence 


0.19 


4722 


X51938 


D. melanogaster DNA for a region 3* to the white gene, corresponding 
to the FB-NOF insertion site in TE146 


2 


4723 


Z80361 


H.sapiens HLA-DRB pseudogene, repeat region; 


0.55 


4724 


AF3 12620 


Plasmodium vivax clone CI 9b thrombospondin-related adhesive 
protein gene, partial cds 


2.1 


4725 


XM017198 


Homo sapiens hypothetical protein FLJ12085 (FLJ12085), mRNA 


1E-35 


4726 


AJ408943 


Homo sapiens chromosomal breakpoint fragment derivative 1 1, 
MLL/AF4 fusion, patient UPN05 1 


0.001 


4727 


XM 050754 


Homo sapiens KIAA1594 protein (KIAA1594), mRNA 


0.5 


4728 


AE002938 


Drosophila melanogaster genomic scaffold 142000013385520, 
complete sequence 


0.008 


4729 


AE001390 


Plasmodium falciparum chromosome 2, section 27 of 73 of the 
complete sequence 


0.21 


4730 


AK000109 


Homo sapiens cDNA FLJ20102 fis, clone COL04783 


2.1 


4731 


AF163325 


Trichoderma harzianum mitochondrial plasmid pThrl, complete 
plasmid sequence 


0.028 


4732 


AL137277 


Homo sapiens mRNA; cDNA DKFZp434Ml 1 14 (from clone 
DKFZp434M1114) 


1.9 


4733 


U43145 


Plasmodium chabaudi repeat organellar protein gene, complete cds 


0.64 


4734 


X71875 ] 


Hsapiens gene for protein serine kinase PSK-H1, first coding exon 


1.7 


4735 


] 

AE001479 


Helicobacter pylori, strain J99 section 40 of 132 of the complete 
genome 


0.74 


4736 


M29586 ] 


Elops saurus Ig germline H-chain pseudogene V-region, partial cds 


0.66 


4737 


] 

AK020838 1 


VIus musculus adult retina cDNA, RIKEN full-length enriched 
ibrary, clone:A930009F24, full insert sequence 


0.025 


4738 


I 

AB038343 i 


Melon yellow spot virus NSs and N genes for nonstructural protein 
md nucleoprotein, complete cds, isolate.Tospo-melo 


0.68 


4739 


AB005 130 | Candida maltosa gene for EPD 1 , complete cds 


0.25 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQ ID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


4740 


AF361581 


Arabidopsis thaliana AT4g35320/F23E12 120 gene, complete cds 


1.9 


4741 


Z75032 


S.cerevisiae chromosome XV reading frame ORF YOR124c 


0.025 


4742 


U17681 


Human neurofibromatosis 1 (NF1) gene, exons 16, 17, 18 and 19a 


0.45 


4743 


AK025048 


Homo sapiens cDNA: FLJ21395 fis, clone COL03557 


0.66 


4744 


AF198444 


Homo sapiens 10q21 mRNA sequence 


0.008 


4745 


AE005556 


Escherichia coli 0157:H7 EDL933 genome, contig 3 of 3, section 175 
of 290 


0.71 


4746 


X74784 


M.musculus mk2e mRNA 


0.17 


4747 


AB029480 


Oryzias latipes gene for Y3 olfactory receptor, complete cds 


0.077 


4749 


AF072932 


Medicago sativa cold acclimation responsive protein CAR1 (CAR1) 
mRNA, complete cds 


0.66 


4750 


AF176056 


Capitonus sp. 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 


0.009 


4751 


X03366 


Bovine spleen trypsin inhibitor II (SI) gene 


0.12 


4752 


L22951 


Human immunodeficiency virus type 1 (UG275) proviral DNA 
encoding env, tat, vpu, rev, and nef genes 


0.23 


4753 


U95851 


Amblyomma darwini 12S ribosomal RNA gene, mitochondrial gene 
for mitochondrial RNA, partial sequence 


5.4 


47 54 


AE001372 


Plasmodium falciparum chromosome 2, section 9 of 73 of the 
complete sequence 


2.2 


4755 


AF147813 


Protrama radicis elongation factor 1 alpha gene, partial cds 


0.24 


4756 


U77974 


Tribolium castaneum transcription factor homolog (Tc-eve) gene, 
complete cds 


0.003 


4757 


XM_045423 


Homo sapiens KIAA0701 protein (KIAA0701), mRNA 


6.3 


4758 


J04357 


Red clover necrotic mosaic virus RNA-1, complete sequence 


0.96 


4759 


XM_010136 


Homo sapiens O-linked N-acetylglucosamine (GlcNAc) transferase 
(UDP-N-acetylglucosamine:polypeptide-N-acetylglucosaminyl 
transferase) (OGT), mRNA 


0.69 


4760 


XM_030855 


Homo sapiens similar to KIAA1361 protein (H. sapiens) (LOC90315), 
mRNA 


0.008 


4761 


AY031575 


HIV-1 isolate NC5000-1999 from USA pol polyprotein (pol) gene, 
partial cds 


6.2 


4767 


XM 007592 


Homo sapiens Homer, neuronal immediate early gene, 2 (HOMER- 
2B), mRNA 


2.2 


4763 


ACO 16707 


Homo sapiens BAC clone RP1 1-22 1K4 from Y, complete sequence 


0.75 


4764 


AF001848 


Oxytricha nova phosphoglycerate kinase (PGK) gene, complete cds 


5.7 


4765 


BC006693 


Mus musculus, clone MGC:7897 IMAGE:3582679, mRNA, complete 
cds 


1.9 


4766 


L81574 


Homo sapiens (subclone 7 _gl from PI HI 1) DNA sequence, complete 
sequence 


0.00003 


4767 


U67508 ] 


Methanococcus jannaschii section 50 of 150 of the complete genome 


0.23 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


4768 


AY021494 


Oryza sativa microsatellite MRG3819 containing (TA)X27, genomic 
sequence 


0.023 


4769 


L75945 


Borrelia burgdorferi flagellar hook protein (flgE), flbD, flagellar 
motor apparatus (motAB), fliL, fliM, fliZ, flagellar export apparatus 
(fliPQR, flhB), flhF, flbE genes 


0.76 


4770 


AL590606 


S.pombe chromosome II BAC pB7E8 


0.068 


4.771 


AF060728 


Conospermum mitchelii chloroplast atpB-rbcL intergenic spacer 
region, partial sequence 


0.2 


dill 


AB056833 


Macaca fascicularis brain cDNA clone:QflA- 14255, full insert 
sequence 


0.068 


4773 


AP001416 


Homo sapiens genomic DNA, chromosome 21q22,2, clone:D34, 
LB7T-ERG region, complete sequence 


0.0003 


4774 


XM_007838 


Homo sapiens nuclear factor of activated T-cells 5, tonicity-responsive 
(NFAT5), mRNA 


0.59 


477S 


AC024777 


Caenorhabditis elegans cosmid Y42H9AR, complete sequence 


0.74 


±11« 


AF333186 


Dictyostelium discoideum beta-alanine synthase (pyd3) mRNA, 
complete cds 


0.057 


4778 


DO 1021 


Anthocidaris crassispina mRNA for dynein beta-heavy chain, 
complete cds 


0.69 


4.770 


U65750 


Xenopus laevis fork head related (XFD-1') gene, complete cds 


0.079 


4780 


NM_008176 


Mus musculus GROl oncogene (Grol), mRNA 


1.8 




M88115 


Hylobates lar Myc gene, complete cds 


0.73 


4782 


AP000388 


Arabidopsis thaliana genomic DNA, chromosome 3, PI clone:MRI12 


0.022 


4783 


AE007246 


Sinorhizobium meliloti plasmid pSymA section 52 of 121 of the 
complete plasmid sequence 


6.7 


4784. 


XM_043069 


Homo sapiens KIAA1464 protein (KIAA1464), mRNA 


0.08 


4.78S 


U41482 


Human Down Syndrome region of chromosome 21 genomic sequence, 
clone A39D1-1H8 


0.001 


*T / OU 


AL1 10134 


Homo sapiens mRNA; cDNA DKFZp564H072 (from clone 
DKFZp564H072) 


0.058 




XM_005044 


Homo sapiens KIAA0628 gene product (KIAA0628), mRNA 


0.002 


*T /OO 


XM_052430 


Homo sapiens E3 ubiquitin ligase SMURF1 (SMURF1), mRNA 


2E-30 




NM_031320 


Rattus norvegicus cadherin EGF LAG seven-pass G-type receptor 3 
(Celsr3) s mRNA 


1.9 




AF184590 


Plasmodium falciparum ADA2-like protein gene, partial cds 


0.009 


4792 


AF061434 


Vicia faba amino acid transporter a (AAPA) mRNA, partial cds 


1.9 


4793 


AL1 17594 


Homo sapiens mRNA; cDNA DKFZp564C1563 (from clone 
DKFZp564C1563) 


0.074 


4794 


AJ133025 


Megaselia abdita zen gene, exons 1-2 


0.02 


4795 


XM.033087 


Homo sapiens hypothetical protein FLJ20359 (FLJ20359), mRNA 


0.024 


4796 


AE001462 


Helicobacter pylori, strain J99 section 23 of 132 of the complete 
genome 


1.1 


4797 


XM_042814 


Homo sapiens hypothetical protein FLJ13397 (FLJ13397), mRNA 


0.063 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


4798 


AL137325 


Homo sapiens mRNA; cDNA DKFZp434M0835 (from clone 
DKFZp434M0835) 


2E-14 


4799 


AB030620 


Mus musculus Chetk-alpha gene for choline/ethanolamine kinase- 
alpha, exon 4, 5, 6, 7, 8 


0.64 


4800 


AB037417 


Oryza sativa gene for aspartate carbamoyl transferase, complete cds 


0.026 


4801 


AK024522 


Homo sapiens cDNA: FLJ20869 fis, clone ADKA02377 


0.21 


4802 


X62463 


H.sapiens 3' flanking region for estrogen receptor (placenta) gene 


0.7 


4803 


AE001127 


Borrelia burgdorferi (section 13 of 70) of the complete genome 


0.2 


4804 


Z70751 


Caenorhabditis elegans cosmid Fl 1 A3, complete sequence 


0.22 


4805 


XM_038231 


Homo sapiens KIAA0793 gene product (KIAA0793), mRNA 


6E-53 


4806 


AF223425 


Mus musculus MAB21L2 (MAb2112) gene, complete cds 


4.5 


4807 


XM_039048 


Homo sapiens hypothetical protein MGC1 1303 similar to Zink 
transporter 2 (MGC11303), mRNA 


2 


4808 


U75604 


Tilapia nilotica type I estrogen receptor mRNA, complete cds 


0.057 


4809 


XM_0 15243 


Homo sapiens stimulated trans-acting factor (50 kDa) (STAF50), 
mRNA 


1.7 


4810 


AF247970 


Caenorhabditis elegans cell cycle checkpoint protein Rad9 gene, 
complete cds 


0.008 


4811 


X12574 


Parsley PcPRl-2 mRNA for pathogenesis-related protein type A 


0.077 


4812 


AF332562 


Simian virus 40 strain 777, complete genome 


0.64 


4813 


XM_005118 


Homo sapiens general transcription factor IIE, polypeptide 2 (beta 
subunit, 34kD) (GTF2E2), mRNA 


1 


4814 


BC006700 


Mus musculus, Similar to RKEN cDNA 2310035M22 gene, clone 
MGC:6680 IMAGE: 3 50 1285, mRNA, complete cds 


0.21 


4815 


AB025319 


Yaba monkey tumor virus DNA, BamHl restriction fragment E, M 
and partial C, partial and complete cds 


0.18 


4816 


XM_048433 


Homo sapiens ATPase, H+ transporting, lysosomal (vacuolar proton 
pump), subunit 1 (ATP6S1), mRNA 


0.55 


4817 


Y11095 


Rice stripe virus UNA 3 


5.9 


4818 


AL589991 


Human DNA sequence from clone RP1 1-241 J5 on chromosome 6, 
complete sequence [Homo sapiens] 


3E-78 


4819 


XM_043498 


Homo sapiens G protein-coupled receptor 44 (GPR44), mRNA 


0.0002 


4820 


XM 047477 


Homo sapiens KIAA1638 protein (KIAA1638), mRNA 


0.077 


4821 


AJ242480 


Streptococcus thermophilus ORF1, ORF2, ORF3, ORF4, ORF5 and 
ORF6 DNA, strain StO 


0.18 


4822 


XM_0 16395 


Homo sapiens region containing hypothetical protein; mutL (E. coli) 
homolog 3 (LOC82389), mRNA 


6E-74 


4823 


XM_043492 


Homo sapiens 42259 (KIAA1728), mRNA 


0.0000006 


4824 


L43603 


Gallus gallus Na+/K+-ATPase alpha- 1 subunit gene, promoter region 


0.41 


4825 


AF257746 


Rattus norvegicus multidrug resistance protein la (Pgyl) mRNA, 
complete cds 


0.5 


4826 


X68794 


D.melanogaster "ten-a" mRNA for secreted protein 


1.2 
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Table 3A Nearest Neighbor (BlastN vs. Genbanlri 


SEQIE 
NO 


i 

ACCFSSN 

n V/ v J-< u ox i 


UJiaCRlP 


P VALUE 


4827 


AF330118 


Homo sapiens voltage-gated sodium channel type III alpha subunit 
(SCN3A) gene, exon 2 


0.12 


4828 


XM 011230 


numu dapicii^ iusea gene irom oqji (Ar5v231), itiRNA 


5.8 


4829 


XM 030669 


Homo sapiens hypothetical gene supported by AL13301 1 

fi OC9098JO mPNA 


1.9 


4830 


U36927 


Plasmodium yoelii rhoptiy protein gene, complete cds 


0.21 


4831 


AF201353 


v^uiuiiiua nvid. giuuuiuiic receptor \^ gene, partial cds 


5 


4832 


AF331435 

ill JJltJJ 


ru v-i iyz Hum Australia envelope protein (env) gene, partial cds 


0.07 


4833 


AF362391 


^cuiuiua diDiccuis neai shock protein rispiU4 (HoF104) gene, HSP104- 
b allele, complete cds 


1.1 


4834 


XM_027927 


nuuiu bctpiciib iiypomeucai gene supported by Zv/017 (1AJC89989), 
mRNA 


0.48 


4835 


AE001127 


x^unciid uuiguuiicn ^becuon ij 01 /u^ oi we complete genome 


0.72 


4836 


AL390126 


S. pombe chromosome III cosmid c233 


0.023 


4837 


AF1 10213 


rvdiiua iHjivcgicuo ixiauiiii receptor precursor, exon 14 


0.56 


4838 


M38534 


Paramecium aureiia lmmoDiuzation antigen (5 1 C) gene, exon 5 and 
complete cds 


0.015 


4839 


AE001430 


nasmoaium iaiciparum enromosome 2, section 67 of 73 of the 
complete sequence 


0.052 


4840 


AF181475 


Mus musculus synaptonemal complex protein 3 (Sycp3) gene, exon 5 


0.19 


4841 


AF17774? 


ouuo bdiians cione pt3jvui4u mitocondnai mimcircle DNA 


0.024 


4842 


XM_004347 


riomo sapiens eiongauon or very long chain fatty acids (FENl/E!o2, 
SUR4/Elo3, yeast)-like 2 (ELOVL2), mRNA 


0.66 


4843 


AF083494 


ivius muscuius pnospnoiipase JJl (FLJJ1) gene, exons 23 and 24, 


0.53 


4844 


U89684 


Lycopersicon esculentum protein kinase (LePK7) gene, partial cds 


0.22 


4845 


AR07S716 


oidpiiyiocuccus aureus gene ior rmttJ, complete cds 


0.57 


4846 


AFO 1718ft 


Schizosaccharomyces pombe hexose transporter (Ght2) mRNA, 


0.87 


4847 


226314 


r . idiL/ipdi tun gene iur o i /vt\_r anugen 


0.007 


4848 


XM_045160 


Homo sapiens KIAA0275 gene product (KIAA0275), mRNA 


1 


4849 


AK0 16495 


ivius muscuius aaun maie testis cjlina, kxkjin lull-length enriched 
library, clone :493 143 1C02, full insert sequence 


0.62 


4850 


BC004496 


numu bdpiciib, oiixuidj 10 nypoineiicai protein rLJi4U3o, clone 
EMAGE 3831313 mRNA 


1. / 


4851 


XM_005684 


Homo sapiens tolloid-like 2 (TLL2), mRNA 


1.6 


4852 


AF249077 


Sphaerotheca pluvialis cytochrome b (cytb) gene, partial cds; 
mitochondrial gene for mitochondrial product 


0.047 


4853 


XM 016934 1 


Homo sapiens hypothetical protein FLJ12891 (FLJ12891), mRNA 


5.4 


4854 


X65747 ] 


Etnorvegivus gnat-3 mRNA for gustducin 


0.2 


4855 


XM_0 11705 J 


Somo sapiens paraneoplastic antigen MA2 (PNMA2), mRNA 


0.2 
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I SEQ ID 

NO I ACCESSN 



Table 3A Nearest Neighbor (BlastN vs. Genbank) 



DESCRIP 



Arabidopsis thaliana cytosine-5 methyltransferase mRNA, complete 
4856 | L10692 cds 



Arabidopsis thaliana gene for MYB transcription factor Atmyb2 
4857 | AB052250 [complete cds, strain: Yo-0 



Homo sapiens region containing tryptase beta 2; tryptase beta 1 
4859 [ XM 033535 |(LOC82314), mRNA 



Rattus norvegicus stem cell factor gene, 5' flanking region and exon 1 
4860 | U49846 [partial sequence 



Tetrahymena thermophila kin2 gene for kinesin-II homologue, exons 
4861 I AJ244021 1-4 



4862 AF201021 



Stachyarrhena sp. Jansen-Jacobs 4707 ribosomal protein S16 (rpsl6) 
gene, partial intron sequence; chloroplast gene for chloroplast product 



4863 AJ245636 |Mus musculus P2Y1 gene for P2Y1 receptor 



4864 AK023175 [Homo sapiens cDN A FLJ13 113 fis, clone NT2RP3 0025 90 



4865 XM 034638 



Homo sapiens hypothetical gene supported by AK024355 
|(LOC90867), mRNA 



4866 ABO 1 1 006 H Rattus norvegicus gene for leptin receptor, partial cds 



Homo sapiens hypothetical protein DKFZp761I172 (DKFZP76 11172) 
4867 I XM 031128 mRNA 



4868 AK001442 I Homo sapiens cDNA FL J 10580 fis, clone NT2RP2003 53 3 



4870 AF307954 



Saccharomyces servazzii 46.1 kDa protein, KAR4-like protein, and 
|SPBl-like protein genes, complete cds 



y 5- 



4871 AF055079 



Panulirus argus inositol 1,4,5-trisphosphate receptor (IP3R) mRNA, 
I complete cds 



4872 X88849 Ccoli ceuB, ceuC, ceuD, ceuE, OrfA, QrfB genes 



4873 XMJ)39699 Homo sapiens hypothetical protein FLJ23309 (FLJ23309), mRNA 



4874 



Lactococcus lactis bacteriophage ul36.2 ORF1 1 lb (ORF1 1 lb), 
ORF78b (ORF78b), ORF61b (ORF61b), ORF245 (ORF245), ORF364 
(ORF364), and putative replisome organizer (ORF255) genes, 
AF2 12847 complete cds; and ORF241 (QRF241) gene, partial cds 



4875 XMJ)52223 Homo sapiens DKFZP564G092 protein (DKFZP564G092), mRNA 



4876 AK024606 [Homo sapiens cDNA: FLJ20953 fis, clone ADSE01979 



4877 



U96097 



IClavibacter michiganensis subsp. sepedonicus species-specific DNA 
clone Cms85 



4878 Z48599 [L.mexicana lmcpc gene cathepsin B-like cysteine proteinase" 



4879 NM_014160 [Homo sapiens HSPC070 protein (HSPC070), mRNA 



4880 XM 031388 



iHomo sapiens NADH dehydrogenase (ubiquinone) 1 alpha 
subcomplex, 5 (13kD, B13) (NDUFA5), mRNA 



4881 AF059531 



Homo sapiens protein arginine N-methyltransferase 3 (PRMT3) 
I mRNA, partial cds 



4882 AF240700 



jGymnocrotaphus curvidens cytochrome b (cytb) gene, partial cds; 
I mitochondrial gene for mitochondrial product 



P VALUE 



2.1 



0.36 



1.3 



0.18 



0.063 



0.057 



0.2 



0.2 



1E-08 



5.5 



0.22 



2E-12 



0.19 



0.0008 



3E-42 



1.9 



0.69 



3E-89 



0.47 



0.063 



0.55 



0.18 



0.65 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQIE 


ACCESSN 


DESCRIP 


P VALUE 


4883 


AE005988 


Caulobacter crescentus section 3 14 of 359 of the complete genome 


2 


4884 


AF332785 


Neocalanus plumchrus cytochrome oxidase subunit I (COI) gene, 
partial cds; mitochondrial gene for mitochondrial product 


0.65 


too J 


AJ278986 


Pichia etchellsii plasmid pPElB 


0.051 


4886 


AL512660 


Human DNA sequence from clone RP1 1-353M9 on chromosome 10, 
complete sequence [Homo sapiens] 


6.3 


too f 


AF147082 


Homo sapiens gamma-glutamyl hydrolase gene, exons 3 through 7 


2 


tooo 


AL389874 


Human DNA sequence from clone RP1 1-563A22 on chromosome 20 
Contains the 5* end of a novel gene weakly similar to Xenopus laevis 
putative Zic3 binding protein, complete sequence [Homo sapiens] 


0.008 


too? 


AC084157 


Caenorhabditis elegans cosmid Y46E12BR, complete sequence 


0.084 


Ho? 1 


AE001183 


Borrelia burgdorferi (section 69 of 70) of the complete genome 


0.66 


4892 


AE002137 


Ureaplasma urealyticum section 38 of 59 of the complete genome 


0.52 


f 07J 


XM 028606 


Homo sapiens Spl transcription factor (SP1), mRNA 


5 


4894 


NC_001453 


Strongylocentrotus purpuratus mitochondrion, complete genome 


6 


4oVj 


AK023104 


Homo sapiens cDNA FLJ13042 fis, clone NT2RP30O1318 


0.064 


4896 


XM_0 15243 


Homo sapiens stimulated trans-acting factor (50 kDa) (STAF50), 
mRNA 


1.7 


4897 


XM_043823 


Homo sapiens hypothetical protein FLJ20958 (FLJ20958), mRNA 


0.000004 


48yo 


AB049900 


Macaca fascicularis brain cDNA, clone:QnpA-19713 


6.1 


*foyy 


AE001390 


Plasmodium falciparum chromosome 2, section 27 of 73 of the 
complete sequence 


0,23 




AE007346 


Streptococcus pneumoniae section 29 of 194 of the complete genome 


0.73 


4901 


X17339 


Dengue-2 virus NS1 gene for nonstructural protein (patient M2) 


0.24 




AK000864 


Homo sapiens cDNA FLJ10002 fis, clone HEMBA1000046 


0.00001 


4903 


AF180076 


Panorpa helena cytochrome oxidase I (COI) gene, partial cds; 
mitochondrial gene for mitochondrial product 


0.6 


4904 


AC008855 


Homo sapiens chromosome 5 clone CTD-2178M23, complete 
sequence 


3E-12 




K02990 


Human hepatitis A virus, complete genome 


0.23 




AE005310 


Escherichia coli 0157:H7 EDL933 genome, contig 1 of 3, section 134 
of 155 


0.078 




NM 032717 ] 


Homo sapiens hypothetical protein MGC1 1324 (MGC1 1324), mRNA 


0.68 


4908 


] 

BC004118 < 


Homo sapiens, clone MGC: 1 1 170 IMAGE:3843 148, mRNA, 
:omp!ete cds 


1.5 


4909 


] 

L29252 ( 


Human (clone D13-2) L-iditol-2 dehydrogenase gene, exon 4, exon 5, 
jxon 6 and exon 7 


0.0000001 


4910 


i 

S68117 < 


-PLP-A=prolactin-like protein A {5* region, exon 1, intron 1} [rats, 
3enomic, 1187 nt] 


0.11 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


4911 


NM 016701 


Mus musculus nestin (Nes), mRNA 


0.21 


4912 


AE006418 


Lactococcus lactis subsp. lactis IL1403 section 180 of 218 of the 
complete genome 


0.009 


4913 


XM_041162 


Homo sapiens hypothetical protein KIAA1165 (KIAA1165), mRNA 


4E-50 


4914 


AF147358 


Homo sapiens full length insert cDNA clone YB61E03 


0.00003 


4915 


AE000642 


Helicobacter pylori 26695 section 120 of 134 of the complete genome 


1.9 


4916 


XM 040436 


Homo sapiens nebulin (NEB), mRNA 


2 


4917 


AB026296 


Pisum sativum mRNA for hsr203 J homolog, complete cds 


0.024 


4918 


AF159148 


Danio rerio Noggin 2 mRNA, complete cds 


0.23 


4919 


AF234184 


Lucilia cuprina sex-lethal protein SXL2 mRNA, complete cds, 
alternatively spliced 


0.026 


4920 


AK007229 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone: 1700121L03, full insert sequence 


0.21 


4921 


AJ404228 


Saccharomyces douglasii mitochondrial tRNA gene cluster 


0.021 


4922 


X71604 


H.sapiens son-pseudogene 


0.53 


4923 


AF169122 


Homo sapiens DNA (cytosine-5) methyltransferase gene, intron 4, 
partial sequence 


0.059 


4924 


XM_040202 


Homo sapiens protein phosphatase, EF hand calcium-binding domain 
1 (PPEF1), mRNA 


6E-09 


4925 


AY027893 


Homo sapiens voltage-dependent calcium channel beta 2 subunit 
(CACNB2) gene, exon 1 


3.3 


4926 


AF136008 


Desulfobacterium sp. BSv41 16S ribosomal RNA gene, partial 
sequence 


5.6 


4927 


NM_007439 


Mus musculus anaplastic lymphoma kinase (Alk), mRNA 


0.071 


4928 


XM_006758 


Homo sapiens frizzled (Drosophila) homolog 10 (FZD10), mRNA 


2 


4929 


AL139329 


Human DNA sequence from clone RP1 1-228P1 on chromosome 6, 
complete sequence [Homo sapiens] 


4E-38 


4930 


L03188 


Saccharomyces cerevisiae integrin analogue gene, complete cds 


0.23 


4931 


AB039881 


Canis familiaris mRNA for cytochrome P450c21, complete cds 


0.55 


4932 


AK004783 


Mus musculus adult male lung cDNA, RIKEN full-length enriched 
library, clone: 1200015E14, full insert sequence 


6.2 


4933 


U41343 


Human prolargin (PRELP) gene, exon 2 


0.0000004 


4934 


AF157369 


Homo sapiens Niemann-Pick CI protein (NPC1) gene, exon 5 


0.58 


4935 


XM J) 16505 


Homo sapiens KIAA0999 protein (KIAA0999), mRNA 


9E-26 


4936 


U07807 


Human metallothionein IV (MTIV) gene, complete cds 


0.008 


4937 


NM_018818 


Mus musculus choroidermia (Chm), mRNA 


1.7 


AQ1Q 


BC009075 


Mus musculus, beta- 1 ,3-N-acetylglucosaminyltransferase 1, clone 
MGC:6892 IMAGE:2654354, mRNA, complete cds 


1.2 


4939 


AF304354 


Homo sapiens proteoglycan 3 (PRG3) gene, complete cds 


0.24 


4940 


AK000864 


Homo sapiens cDNA FLJ10002 fis, clone HEMBA1000046 


0.00001 


4941 


AF076601 


Dictyostelium discoideum developmental protein (DG1105) gene, 
partial cds 


0.95 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 


ACCESSN 


DESCRIP 


P VALUE 


4y4z 


AF025542 


Bombyx mori insulin receptor-like protein precursor (BIR) mRNA, 
complete cds 


0.64 


A l\A *> 

4943 


XM_038714 


Homo sapiens hypothetical protein dJ37E16.5 (DJ37E16.5), mRNA 


6.3 


4944 


AC016707 


Homo sapiens BAC clone RP11-221K4 from Y, complete sequence 


0.054 


4945 


XM_005684 


Homo sapiens tolloid-like 2 (TLL2), mRNA 


1.1 


4946 


AJ276170 


Rattus norvegicus partial Vps541 processed pseudogene, 3' end 


0.041 


4947 


AJ132986 


Vibrio sp. partial 16S ribosomal RNA, isolate A515 


5.9 


4948 


XM_009513 


Homo sapiens hypothetical protein (HSPC207), mRNA 


0.21 


4949 


AF 18 1668 


Homo sapiens clone 173pl7 polymorphic microsatellite sequence 


0.17 


4950 


AF045453 


Dictyostelium discoideum protein kinase YakA (yakA) mRNA, 
complete cds 


0.007 


4951 


AF279135 


Dictyostelium discoideum spore coat structural protein SP65 (cotE) 
gene, complete cds 


0.003 


4952 


AK012631 


Mus musculus 10, 11 days embryo cDNA, RIKEN full-length 
enriched library, clone:2810001G20, full insert sequence 


0.071 


4953 


AB047630 


Macaca fascicularis brain cDNA, clone:QnpA-13555 


0.69 


4954 


AJ011848 


Hordeum vulgare chloroplast rpsl5 (partial), ndhH, ndhl, ndhG, 
ndhE, psaC, ndhD and ndhA genes 


0.054 


4955 


U34363 


Plasmodium falciparum CTRP gene, complete cds 


0.18 


4956 


AF220175 


Homo sapiens acid ceramidase (ASAH) gene, exons 5 through 14, and 
complete cds 


0.0009 


4958 


D26442 


Yeast gene for PES4 PAB-like protein, complete cds 


0.003 


4959 


NC__002184 


Penaeus monodon mitochondrion, complete genome 


1.9 


4960 


XM_039251 


Homo sapiens 37381 (KIAA1660), mRNA 


5 


4961 


XM_004462 


Homo sapiens signal sequence receptor, alpha (translocon-associated 
protein alpha) (SSR1), mRNA 


0.72 


4962 


AF252420 


Bos taurus clone MNB-5 microsatellite sequence 


0.08 


4963 


XM 035186 


Homo sapiens RCE1 (S. Cerevisiae) homolog, prenyl protein protease 
(RCE1), mRNA 


2 


4964 


AL110134 


Homo sapiens mRNA; cDNA DKFZp564H072 (from clone 
DKFZp564H072) 


0.058 


4965 


AL049452 


Homo sapiens mRNA; cDNA DKFZp586C1322 (from clone 
DKFZp586C1322) 


0.67 


4966 


AB047614 


Macaca fascicularis brain cDNA, clone:QnpA-12025 


0.077 


4967 


XM 043315 


Homo sapiens KIAA0229 protein (KIAA0229), mRNA 


6.2 


4968 


AKO 17495 


Mus musculus 8 days embryo cDNA, RIKEN full-length enriched 
library, clone:5730405K23, full insert sequence 


0.025 


4969 


AB012141 


Candida glabrata TEF3 gene for translation elongation factor3, 
complete cds 


0.003 


4970 


AK022914 


Homo sapiens cDNA FLJ12852 fis, clone NT2RP2003445 


0.0003 


4971 


AF133211 


Homo sapiens COT kinase proto-oncogene, partial cds, alternatively 
spliced 


0.008 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 

XT/"* 

NO 


ACCESSN 


DESCRIP 


P VALUE 


4972 


AF349681 


Simian immunodeficiency virus strain SIVrcmNg409 from Nigeria 5* 
long terminal repeat, partial sequence; gag protein (gag) gene, 
complete cds; pol protein (pol) gene, partial cds; and vif gene, 
complete sequence 


0.68 


4973 


AF350254 


Homo sapiens fibrinogen gamma chain precursor (FGG) gene, 
complete cds, alternatively spliced 


0.21 


4974 


AF3 14052 


Rhizobium sp. CIAT613 nifHc gene, partial sequence 


1.9 


4975 


XM_036277 


Homo sapiens inhibitor of DNA binding 4, dominant negative helix- 
loop-helix protein (ID4), mRNA 


0.076 


4976 


AF057740 


Homo sapiens 17-beta-hydroxysteroid dehydrogenase IV (HSD17B4) 
gene, exon 24 and complete cds 


0.00003 


4977 


XM_051777 


Homo sapiens similar to NAD(P) dependent steroid dehydrogenase- 
like (M. musculus) (LOC93517), mRNA 


4E-11 


4978 


AF3 14052 


Rhizobium sp. CIAT613 nifHc gene, partial sequence 


2 


4979 


BC010911 


Homo sapiens, Similar to adaptor protein containing pH domain, PTB 
domain and leucine zipper motif, clone IMAGE:4295177, mRNA 


0.076 


4980 


AF2 14529 


Homo sapiens dystrophin (DMD) gene, intron 51, partial sequence 


0.001 


4981 


AK024235 


Homo sapiens cDNA FLJ14173 fis, clone NT2RP2002755 


e-104 


4982 


AB035089 


Homo sapiens SCCA2 gene for squamous cell carcinoma antigen 2, 
complete cds 


0.23 


4983 


M97555 


Schistosoma mansoni tropomyosin mRNA, complete cds 


0.22 


4984 


AE002745 


Drosophila melanogaster genomic scaffold 142000013385404, 
complete sequence 


0.003 


4985 


X99288 


B.calamita repeat region, clone-lib micro-8 


0.23 


4987 


AJ308840 


Coffea arabica microsatellite DNA, clone 16-6CTG 


0.025 


4988 


AB049134 


Cucurbita moschata CmATSl;l gene for acyl-(acyl-carrier-protein): 
glycerol-3-phosphate acyltransferase, complete cds 


0.53 


4989 


X03991 


Human glucagon gene 


0.074 


4990 


AY042871 


Arabidopsis thaliana zinc finger-like protein (F3C22_200) mRNA, 
complete cds 


0.077 


4991 


AF2 14529 


Homo sapiens dystrophin (DMD) gene, intron 51, partial sequence 


0.001 


4992 


AF269319 


Staphylococcus epidermidis strain SRI clone step.l002a02 genomic 
sequence 


2 


4993 


AB009801 


Homo sapiens gene for osteonidogen, intron 2 


0.69 


4994 


M69146 


Torulopsis glabrata metal-activated transcription factor (amtl) gene, 
complete cds 


0.65 


4995 


AL121822 


Human DNA sequence from clone 346P1 1 on chromosome Xq21.2- 
21.33, complete sequence [Homo sapiens] 


5E-09 


4996 


AF072273 


Lactuca sativa resistance protein candidate RGC2N (RGC2N) 
pseudogene, complete sequence 


0.22 


4997 


AL358273 


S.pombe chromosome II cosmid c29C10 


0.22 


4998 


BC010011 


Homo sapiens, Similar to RIKEN cDNA 493 1428D14 gene, clone 
MGC: 15407 IMAGE:4309613, mRNA, complete cds 


0.22 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 






P VALUE 


4999 


AF090889 


Homo sapiens clone HQ0092 


0.71 


5000 


M93140 


Glycine max cv Prize protein kinase mRNA 


2.1 


5001 


AE005266 


cscnencnia con uid /.h/ fciJL,933 genome, contig 1 of 3, section 90 
of 155 


1.8 


5002 


AE007350 


Streptococcus pneumoniae section 33 of 194 of the complete genome 


0.23 


5003 


BC008973 


Homo sapiens, transmembrane protein with EGF-like and two 
follistatin-like domains 2, clone MGC: 17007 IMAGE:4180709, 


0.023 


5004 


X13439 


Cochliobolus heterostrophus mitochondrial gene for ATPase subunit 6 


0.19 


5005 


AF138069 


ni v-i pjtuzo-n ironi uoa envelope glycoprotein (env) gene, partial 
cds 


2.1 


5006 


NM_002847 


riomo sapiens protein ryrosme pnospnatase, receptor type, N 
polypeptide 2 (PTPRN2), mRNA 


6 


5007 




Homo sapiens hypothetical gene supported by X04201; X04588; 
AK026559; AY004867; BC000771; BC008407; BC008425 


1 


5008 




iviub mubcuius riDonuciease/angiogemn inniDitor 2 (KnJiZ), mRNA 


1.7 


5009 




Homo sapiens, Similar to nuclear protein, clone IMAGE:4248516, 

11 LEVI r\ 


5.4 


5010 




^vyieiia iasucuosa va^c, section oi Zzy oi trie complete genome 


5.1 


5011 


XJM Oil IRQ 


ivuis muscuius n do nuc i ea se/ angi ogeni n inniDitor 2 (Rniiz), mRNA 


1.7 


5012 


YU 01674.7 


Homo sapiens latent transforming growth factor beta binding protein 


2c.-l\) 


5013 


iVIO JJV/t 


Schistosoma mansoni homeodomain protein (smox-5) mRNA, 


U.UU9 


5014 




oacciictruinyccs ccrcvisiae z micron circle piasmiu, complete sequence 


0.65 


5015 




Drosophila melanogaster rad21 mitotic cohesin (drad21) gene, exons 
j uuuugii o oiiti lAJiupieie cos 


0.003 


5016 


XM_052430 


Homo sapiens E3 ubiquitin ligase SMURF1 (SMURF1), mRNA 


2E-30 


5017 


yfi7R i 


v^.iaiiiiiiaiia oxvr / z. iiuuNA ior signal recognition particle 


0. lo 


5018 


AF01 1 R4R 
ru v/ j io*to 


Dermacentor marginatus 12S ribosomal RNA gene, mitochondrial 
^ciic iur nil Luv/iiuiiui i<u ivLNrv, pariiai sequence 


O.Ool 


5019 


XM 003317 


Homo sapiens PDZ domain containing guanine nucleotide exchange 
factor(GEF)l (PDZ-GEF1), mRNA 


0.67 


5020 


XM 030673 


Homo sapiens methyl-CpG binding domain protein 4 (MBD4), 
mRNA 


0.6 


5021 


L38957 1 


Saccharomyces cerevisiae nuclear-encoded mitochondrial isoleucyl- 
tRNA synthetase (ISM1) gene, complete cds 


2.1 


5022 


] 

AF345089 \ 


Hepatitis C virus isolate SCpreSC2cl4 nonfunctional polyprotein 
gene, partial sequence 


0.59 
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SEQID 




NO 


I ACCESSN 


5023 


AC091124 


5024 


X66451 


5025 


XMJ) 10563 


5026 


AF256722 


5027 1 


AF385609 


5028 


AF355103 


5029 


XM_034863 


5030 1 


AF374005 



5031 AK014447 









Ly. 


5032 


AL355577 


i |J 
IJ 


5033 


AB030616 


CO 


5034 


S66671 


CO 


5035 


AB038481 


in 


5036 


U05013 


Hi 


5037 


L81904 




5038 


AF242384 




1 5039 1 


X52863 



5043 
5044 
5045 



Table 3A Nearest Neighbor (BlastN vs. Genbank^ 

, - DESCRIP 

Caenornapoms elegans cosmid Y47A7, complete 

s E uST inaiUS 8Cne f ° r mA I second largest 

Homo sapiens hypothetical gene supported by AB033071- AB051480 1 
AK000726; NM 013383 (LOC92396V mRTsTA ^ AB05I480, | 

Salmo salar clone BHMS458 microsatellite seo^cT " 
Homo sapiens minisatellite sequence" " 

Pleurotus ostreatus linear rmtrchondrial plasmid ml P 2, partial 
| sequence 

[Rattus norvegicus bradykinin B2 re ceptor gene, promoter se n,.^ 
Mus musculus 18 days pregnant adult female placenta and extra 
embryonic tissue cDNA, RIKEN full-length enriched library 
clone:3830421F13, full insert sequent* 

Human DNA sequence from clone RP1 1-299G2 on chromosome 6 
Contains a G SS, complete sequence [Homo saoiensl 
M " s m«sculus Chetk-alpha gene for choline/ethanolantine kin^T 
alpha, exon 1 and 5'-flanking region 

pol [simian Mason-Pfizer retrovirus SM-PDT, systemic lupus 

erythematosu s patient TFA 68, Genomic RNA, 1 13 ntl 
Codium lucasii chloroplast rbcL gene tor ribulose-l,5-bisphosphate 
carboxylase/oxygenase large subunit, partial cds 

IRattus norvegicus Sprague-Dawley heme oxygenase-2 non-reducing 

lisoform gene, complete cds 

Homo sapiens (subclone l_h7 from PI H69) DNA sequence, complete! 

I SCCJUCI1C6 ■ 



Homo sapiens p hytanoyl-CoA hydroxylase (PHYH) gene exon 7 
G.max Gy4 gene for glycinin ~ ~ 

|Homo sapiens ankyrin repeat and SOCS box-containing 3 (ASB3) 
mRNA 

Mus musculus Dp71 (DMD) gene, partial cdT 
Oryza saliva gene for mitochondrial ribosomal portein S14, succinate 
aenydrogenase iron-protein subunit (SD HB'> 

-rTniH 8 ! ■ r^" ^^^ «*tkm *<> of iso of the ^ r 

XM_048782 Homo sapiens c6. 1 A (C6. 1 A), mRNA - 

AB055263 iMacaca fascicularis brain cDNA, clone:<MA^067T 



_ XMJ)0262 5 
U00789 

AB017429 



VALUE 
1.7 

0.066 



5046 S65281 



5047 
5048 



drebrin {clone eDcg5} [chickens. Ge nomic, 160 nt segm ent 3 of 10] 
AF296203 E^"' ^<--» 16S ribosoma l RNA gene, partial 
X06266 l Anthirrhinum majus DNA for transposon ta m2" 

399 



8.3 



0.18 



0.002 
0.2 
0.003 

0.61 

0.25 
0.007 



|SEQID| 
NO 



Table 3A Nearest Neighbor (BlastN vs. Genbank) 



ACCESSN 



DESCRIP 



P VALUE 



5049 
5050 



AF272387 Homo sapiens leucine zipper nuclear factor (BLZF1) gene, exon 6 



NC_001587 [Human papillomavirus type 34, complete genome 



7E-22 



0.25 



5051 NM_017078 Rattus norvegicus Acetylcholine receptor alpha 5 (Chrna5). mRNA 0.64 



5052 
5053 



XMJ)30510 Homo sapiens microfibrillar-associated protein 2 (MFAP2). mRNA 
Z48433 iL.odomensis vicilin pseudogene 



5054 M21084 



P.hybrida 5-enolpyruvylshikimate 3-phosphate synthase mRNA, 
[complete cds 



0.047 



0.2 



5055 U67518 Methanococcusjannaschii section 60 of 150 of the complete genome | 0.002 



5056 AF2 15555 



jBlakea trinervia NADH dehydrogenase subunit F (ndhF) gene, partial 
cds; chloroplast gene for chloroplast product 



5057 AE001379 



Plasmodium falciparum chromosome 2, section 16 of 73 of the 
| complete sequence 



0.63 



ru 



5059 
5060 



Candida albicans heat shock protein Hspl04 (HSP104) gene HSP104- 
AF362391 |b allele, complete cds 



5.4 



AJ000056 jHomo sapiens tyrosine aminotransferase gene, 5' flanking region 



4.4 



5061 AF1 12541 



HIV-1 isolate 08108v3 from USA, envelope glycoprotein (env) gene, 
I partial cds 



0.57 



0.007 



5062 
5063 



AF300534 
J00571 



Diadasia consociata cytochrome oxidase subunit I gene, partial cds; 
tRNA-Leu gene, complete sequence; and cytochrome oxidase subunit 
II gene, partial cds; mitochondrial genes for mitochondrial products 



5064 
5065 



X95275 
Y09746 



Mouse Ig kappa unproductively rearranged V-T2; V-J region 



0.22 



P.falciparum complete gene map of plastid-like DNA (IR-A) 



0.001 



5066 X95536 [Rsapiens earl gene 



|H.oligactis mRNA for heat shock protein 70 



0.062 



0.46 



5067 
5068 



D 10997 
273968 



jC.ellipsoidea rbcL, rpsl4, trnM, trnG, trnD, tmS, rps4, atpE, atpB 
[genes, complete cds 



2E-64 



5069 
5070 



XMJH2733 
AE000745 



jCaenorhabditis elegans cosmid C05A2, complete sequence 



1.8 



[Homo sapiens KIAA1012 protein (KIAA1012), mRNA 



0.67 



Aquifex aeolicus section 77 of 109 of the complete genome 



0.24 



5071 AF208521 



jDiploptera punctata high-affinity Na+-dependent glutamate 
transporter (EAAT1) mRNA, complete cds 



0.21 



5072 
5073 



XM 032656 
Z28063 



iHomo sapiens mannosidase, alpha, class 2B, member 1 (MAN2B1) 
mRNA 



0.08 



5074 



IS.cerevisiae chromosome XI reading frame ORF YKL063c" 



1.8 



XM_050445 [Homo sapiens KIAA0164 gene product (KIAA0164), mRNA 



6.2 



0.66 



Pectobacterium chrysanthemi ArgG (argG) gene, partial cds; 
regulatory protein PecS (pecS), regulatory protein PecM (pecM), 
indigoidine systhesis protein IdgA (idgA), and indigoidine systhesis 
AF2652 1 1 |protein IdgB (idgB) genes, complete cds, and indigoidine sys> | 0.72 



5075 



400 



SEQH)] 
NO 



50761x8720 I 



Table 3A N earest Neighbor ( BlastN vs. Ge-nh^ 



ACCESSN | DESCRIP 

° rfA - b> q p » f - 

MOBP^TH^t" 355 ^ ^^"^c basic proteins 
AFl 9047. I^f 170 J"* MOBP69 (MobP) gene, exons 3 through 5 and 
AF120475 c omplete cds, alternatively SD iiced K 

AF270407 ££2^"" eP1QemUdlS ^ *" ^ "fr 40 *** 
AB049420 "x^^ 



U47138 



iu 



in 



5084 
5085 



AF009415 

AE00139 5 

U62542 

AL445527 
XM 016052 

XM 03778 1 

X96932 
AJ006986 
M60858 

AK026299 



5091 


AC006025 


5092 


X03100" 


5093 


AF181671" 


5094 


AF356600 


5095 


I U36619 


5096 


AF243117 


5097 


XM 050127 


5098 


AJ251892 


5099 


AF036696 


5100 


AF052294 


5101 


AK026741 


5102 1 


K02212 



SI S T OSUS l h0lme P^tive regulatory 

pro em (cudC), glycine betaine aldehyde dehydrogenase (cudA) and 
choline dehydrogenase (cudB) genes, comm.,. JL 

Plasmodium lalciparum chromosome 2, section 32 of 73 of the 
complete sequence 

|Drosophila melanogaster dead ringer (dead nnger) mRNA, complete 

Human DNA sequence from clone KP1 1-1 15C2 on chromosome 1 
complete sequence fHomo sapiensl 
Homo sapiens LOC86766 (LOC86766). mRNA 
|HomoMpiens hypothetical gene supported by J04178 (LOC91343>; 

IN.tabacum gene encoding ascorb ate oxidase-relatedW ^T 
Streptococcus pneumoniae type 33 F DNA, capsular e ^h^ 
Human nucleolin gene, complete cd s 
Homo sapien s cDNA: FLJ22646 fis, clone HSI07m 

Homo ^Piens PAC clone RP5-1188N21 from 7ql 1 23-q21 1 
complete sequence ' 
Human HLA-SB(DP) alpha gene 

[Homo sapiens clone /7jl3 polvmorD hic microsatellit e seo"^ 
EomptetecT 1 dlSCOideum filament-interacting protein (fip) mRNA, 

tZZT^^ f ™~™ ^ (CUB-NT^ 
Homo sapiens oligophrenin 1 (OPHN1) . mRNA 

iDrosophila melanogaster mRNA for LIM domain protein (espinas 

[ gene j 

Caenorhabditis elegans cosmid F15B 10 

Mus musculus phospholipase D2 gene, exons 13 through 25 and 
complete cds 

[Homo sapiens c DNA: FLJ23088 lis, clone LNG07026 
Human alpna-i -antitrypsin gene (S variant), comol^k 

401 



P VALU E 
0.076 



0.072 
0.2 



Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQ ID 








NO 


ACCESSN 


DESCRIP 


P VALUE 


5103 

5104 
5105 
5106 


AB055370 

AK008762 

M71245 
AY007089 


iMacaca fascicularis brain cDNA, clone:QflA- 12708 

jMus musculus adult male stomach cDNA, RDCEN full-length 

enriched library, clone:2210021I22, full insert sequence 
ixvai piubiaiciii siiDmiU gene, complete cds 

Homo sapiens clone TCCCIA001 10 mRNA sequence 


0.079 

2.1 
0.69 
0.000009 


5107 
5108 

5109 
5110 


AF1 13007 
AF125461 

AF016179 
AK021604 


IHomo sapiens PRO0066 mRNA, complete cds 
ICaenorhabditis elegans cosmid Y8A9A, complete sequence 
jRattus norvegicus putative pheromone receptor (Go-VN2) mRNA, 
(complete cds 

IHomo sapiens cDNA FLJ1 1542 fis, clone HEMBA1002801 


0.0007 

1 O AO | 

0.071 
0.026 


5111 
5112 
5113 


Z73968 
U40995 
NC 001728 


ICaenorhabditis elegans cosmid C05A2, complete sequence 

[Danio rerio Hox-al gene, complete cds 
lOdontoglossum nngspot virus, complete genome 


0.54 
0.009 
1 0.027 1 



W 
fU 



O 
to 



2056.P10.gz4 1 



gi|8778469| 
gb|AAF794 

|77.1|AC02[ 
2492_21 

|(AC022492| 
) F1L3.30 

I [Arabidopsl 



5115 


AF234932 


-1 - 877846S 

chtenopteiyx sicula clone 9 actin gene, partial cds 


*| is thaliana] | 
0.24 


5116 


M60858 


1 Human nucleolin gene, complete cds 


0.66 


5117 


XM_003100 


Homo sapiens hypothetical protein FLJ1 1 142 (FLJ1 1 142) mRNA 


0.23 


5118 
5119 


AF129278 
AF283665 


iBuchnera aphidicola natural-host Diuraphis noxia gluconate-6- 
phosphate dehydrogenase (gnd) gene, partial cds 
IHeterodera glycines guanylyl cyclase mRNA, complete cds 


6.4 


5120 
5121 


AJ005205 
AF135472 


IHomo sapiens 5HT3 gene for serotonin 3 receptor 

1 Clostridium beijennckii Rep protein gene, complete cds 


5.4 
0.24 


5122 


XM_036007 


IHomo sapiens transmembrane protein 1 (TMEM1) mRNA 


0.23 
2.1 


5123 1 


AL031426 


Human DNA sequence from clone CTA-191D12 on chromosome 1 
22ql3.1 Contains two exons of the APOL2 gene for apolipoprotein L 
2, ESTs and GSSs, complete sequence fHomo sapiens] | 


0.000001 


5124 


XM 049821 


Homo sapiens hypothetical protein MGC14832 (MGC14832), mRNA | 


5.4 


5125 


XM 028344 


IHomo sapiens nbosomal protein L5 (RPL5) mRNA 


0.16 


5126 


AF0 16653 


Caenorhabditis elegans cosmid C41D7, complete sequence | 


0.7 


5127 


Z96752 


|H.sapiens telomenc DNA sequence, clone 8QTEL025 read 
8QTELOO025.seq ' | 


0.006 


1 5128 | 


M61883 


Sus scrofa (pig) apomucm mRNA, 3* end 


0.026 I 
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|SEQ JD\ 
NO 



Table 3A Nearest Neighbor (BlastN vs. Genbank) 



ACCESSN 



DESCRIP 



5129 AY005107 



Pentachondra pumila specimen-voucher Cherry 96/1 maturase (matK) 
gene, partial cds; chloroplast gene for chloroplast product 



P VALUE 



5130 
5131 



AF195044 
AK021963 



Homo sapiens beaded filament component protein (CP49) gene, 
| partial cds 



2.2 



Homo sapiens cDNA FLJ1 1901 fis, clone HEMBA1007347 



0.57 



0.0005 



5132 AF034085 ICaenorhabditis elegans UNC-45 (unc-45) gene, complete cds | O.OOOf 



5133 
5134 



U91929 
NM 010781 



Mus musculus LI neural cell adhesion protein gene, 5' sequence and 
exon 1 



Mus musculus mast cell protease 6 (Mcpt6), mRNA 



2.1 



5135 AB017918 



Mus musculus gene for gross cystic disease fluid protein 15, complete 
cds 



0.68 



5136 
5137 



U55243 
AF061296 



Dictyostelium discoideum glyceraldehyde-3-phosphate dehydrogenase 
l(GAPDH) gene, partial cds 1 



0.001 



jHomo sapiens sulfonylurea receptor (SUR2) gene, exon 9 



0.24 



9E-45 



5138 AL050314 



[Human DNA sequence from clone 100G10 on chromosome 22ql3.3 1- 
13.33. Contains GSSs, complete sequence [Homo sapiensl 



5139 XM 036394 



Homo sapiens calcium/calmodulin-dependent protein kinase IV 
|(CAMK4), mRNA 



6.4 



5140 I XM_047419 [Homo sapiens LQC87974 (LOC87974), mRNA 



0.003 



5141 I AJ4043 14 1 Amphisorus hemprichii 18S rRNA gene, isolate 720 



0.21 



5142 AF324889 



Homo sapiens myosin phosphatase target subunit 2 (MYPT2) gene, 
lexon 1 



5143 J04434 



[Bacteriophage PBS2 (from B.subtilis) uracil-DNA glycosylase 
inhibitor gene, complete cds 



0.71 



0.23 



5144 XM 041213 



Homo sapiens inter-alpha (globulin) inhibitor, HI polypeptide 
|(ITIH1), mRNA 



0.61 



5.5 



5145 AF187881 Homo s apiens leukointegrin alpha d gene, promoter and partia l cds | 0.026 

5146 Z70180 iP.quinquangular is chloroplast matK eene 

5147 1 1 " — 



AK025364 [Homo sapiens cDNA: FLJ2171 1 fis, clone COL10156 



5148 XM 045594 



Homo sapiens hypothetical protein MGC4816 (MGC4816), mRNA 



5E-82 



5150 



AB029068 Mus musculus gene for Ser/Thr kinase KKIAMRE, exon 1, 2, 3, 4, 5 



5151 AF2 15555 



iBIakea trinervia NADH dehydrogenase subunit F (ndhF) gene, partial 
cds; chloroplast gene for chloroplast product 



0.63 



0.71 



5152 



5153 



XM_026967 Homo sapiens hypothetical protei n MGC2668 (MGC2668) 

MTTVMi irninionr^+n T5 1 7— 7- — 



M32328 jC.pipiens esterase Bl gene, completecds 



mRNA 



1.9 



5 1 54 I U58744 Caenorhabditis elegans cosmid F55F1 



0.008 



5155 



Escherichia coli AgaR (agaR), KbaZ (kbaZ), AgaV (agaV), AgaW 
(agaW), AgaE (agaE), AgaF (agaF), AgaA (agaA), AgaS (agaS) 
KbaY (kbaY), AgaB (agaB), AgaC (agaC), AgaD (agaD), and Agal 
AF228498 |(agal) genes, complete cds 



0.43 



0.72 



403 



SEQ ID) 




NO 


ACCESSN 


5156 


L36898 


5158 


D88262 


5159 


AKO 18080 


5160 


X95275 


5161 


D16417 



Table 3A Ne arest Neighbor (BlastN vs. Genbank) 

DESCRIP ^ 

jSaccharomyces cerevisiae mitochondrion transfer RNA-Glu (tRNA- 
Glu) gene 

Pisum sativum PsCHS5 gene for chalcone synthase, complete cds 
Mus musculus 1 1 days embryo head cDNA, RKEN full-length 
enriched libra ry, clone:6230403H02, full insert sequence 
P.falciparum complete gene map of plastid-like DNA (IR- A) 
jDictyostelium discoideum mRNA~ ~ 



5162 AF267204 



Candidatus Carsonella ruddii natural-host Cacopsylla brunneipennis 
ATP synthase alpha subunit (atpA) gene, partial cds; ATP synthase 
gamma subunit (atpG) gene, complete cds; and ATP synthase beta 
subunit (atpD) gene, partial cds 



5163 
5164 



AF264650 
AF003143 



Asterella tenella trnL gene, intron sequence; chloroplast gene for 
chloroplast product 

Caenorhabditis elegans cosmid C53H9, complete sequent" 



iy 

CIS S 

m 
Q 



5165 
5166 



M19459 
AK001362 



5167 
5168 



AF170023 
AJ012333 



I Plasmodium falciparum interspersed repetitive DNA and an open 
reading frame 

IHomo sapiens cDNA FLJ10500 fis, clone NT2RP2000369 



J Anopheles gambiae clone 138 genomic sequence 
jBuchnera aphidicola plasmid pBTc2, trpE and trpG geireT 



5169 AF158077 



I m. ~ r 1 M-^ r-W ^VIlWl} 

Pisacha naga 16S ribosomal RNA gene, partial sequence; 
mitochondrial gene for mitochondrial product 



5170 
5171 



U68721 
AK026372 



Botryotinia fiickeliana endopolygalacturonase 5 (BcPGA5) gene, 
| complete cds 

Homo sapiens cDNA: FLJ22719 fis, clone HSI14307 



5172 
5173 



NM_031130 
AY034470 



Rattus norvegicus nuclear receptor subfamily 2, group F member 1 

(Nr2fl), mRNA 

IHomo sapiens clone BGL2 mRNA sequence 



5174 
5175 



AF108841 
AJ251957 



Homo sapiens human endogenous retrovirus HERV-H10 pol protein 
|(pol) gene, partial cds; env pseudogene and 3* LTR, complete 
sequence 

Ciona intestinalis mRNA for nuclear lamin (lamin L2 gerao" 



5177 AF177983 



5178 AF181632 



Homo sapiens NAD+-dependent 15-hydroxyprostaglandin 
dehydrogenase (PGDH) gene, promoter, exons 1 and 2 and 

iDrosophila melanogaster BcDNA.GH05095 (BcDNA.GH05095) 

| mRNA, complete cds 



5179 



Mus musculus 10 days neonate cerebellum cDNA, RKEN full-length 
AK018339 enriched library, clone:6530414F19, fall insert sequence 



5181 



Medicago truncatula partial mRNA for nodulin 25 (nod25 gene) 
AJ308588 clone 3 6 



6.4 
0.022 
0.009 



6.4 



0.23 



0.008 



0.23 



0.006 



0.019 



0.21 



0.064 



0.009 



1.9 



Homo sapiens a disintegrin and metalloproteinase domain 33 
.5176 I XM_045437 [(ADAM33), mRNA 



0.71 



partial cds I 9.9 



0.23 



2.1 



ci on i H ° m ° ^ picns win ^ less ' t yP e MMTV integration site family, member 

5180 XM 050838 |10A (WNT10A), mRNA 



0.22 



4.3 



404 



|SEQID 

NO I ACCESSN 



Tabic 3A Nearest Neighbor (BlastN vs. Genbank) 



DESCRIP 



5182 



Homo sapiens cis-golgi SNARE (GOSR1) gene, exon 9 and partial 
AF229797 cds 



5185 



Z68107 [Caenorhabditis elegans cosmid F48C5, complete sequence 



5186 



Mus musculus germ cell-specific unknown protein mRNA, complete 
AY026045 cds 



5187 J A F309515 Staphylococcus aureus serine protease operon, complete sequence 
jPIeurotus ostreatus linear mitochondrial plasmid mlp2, partial 
AF355 103 [sequence 



5188 



5192 J XM_003320 Homo sapiens hypothetical protein FLJ1 1 155 (FLJ1 1 155), mRNA 
[Legionella fallonii macrophage infectivity potentiator (mip) gene 

5193 | AF148987 | partial cds ' 
5194 



X54200 |Chicken mRNA for GARS-AIRS-GART 



5196 
5197 



U09859 Bos taurus enterokinase mRNA, complete cds 



5198 
5199 



Populus balsamifera subsp. trichocarpa PTD protein (PTD) gene, 
AF057708 {complete cds 



AF238381 Homo sapiens PTOV1 (PTQV1) gene, complete cds 



5200 
5201 



Plasmodium falciparum para-aminobenzoic acid synthetase gene, 
AF1 19554 [complete cds 



X82675 |B.taurus BoLA-Al 1 gene (exon 6-8) 



5203 
5204 



Homo sapiens serologically defined colon cancer antigen 16 
NM_006649 |(SPCCAG16), mRNA 



AF240597 Homo sapiens clone 7ptelcl lt7 sequence 



5205 
5206 



Mus musculus, RKEN cDNA 4930529008 gene, clone MGC:7935 
BC006702 |lMAGE:3583936, mRNA, complete cds 



AF295769 Mus musculus SMAD1 gene, intron 1, partial sequence 



5207 AF1 18099 



Toxoplasma gondii PITSLRE-like protein kinase (tpkl) gene, partial 
cds 



5208 AF143952 I Homo sapiens PELOTA (PELOT A) gene, complete cds 



5209 AF028710 



Influenza A virus (A/HongKong/156/97(H5Nl)) nucleoprotein 
| mRNA, complete cds 



5210 | XMJ) 17097 | Homo sapiens active BCR-related gene (ABR), mRNA 



P VALUE 



2.1 



0.079 



0.069 



5.2 



Panulirus argus inositol 1 ,4,5-trisphosphate receptor (IP3R) mRNA 

5189 I AF055079 complete cds 

Arabidopsis thaliana chromosome II section 163 of 255 of the 

5190 I AC007184 complete sequence. Sequence from clones Tl 1P1 1 I 6.1 

5191 | AB044348 [Arabidopsis thaliana AtSUGl mRNA, complete cds TT 



5.3 



0.067 
0.019 



5195 J AE005885 Caulobacter crescentus section 211 of 359 of the complete genome | 0.62 
Escherichia coli AgaR (agaR), KbaZ (kbaZ), AgaV (agaV), AgaW 
(agaW), AgaE (agaE), AgaF (agaF), AgaA (agaA), AgaS (agaS), 
KbaY (kbaY), AgaB (agaB), AgaC (agaC), AgaD (agaD), and Agal 
AF228498 |(agal) genes, complet e cds J 0 63 



6.2 



0.68 



0.18 



0.13 



0.71 



5202 AF303087 Tilapia mossambica glycoprotein alpha subunit mRNA, partial cds 0.64 



1.7 



e-112 



6.3 



0.007 



1.9 



0.44 



1.7 



0.7 



405 



Table 3A Nearest Neighbor (BlastN vs. Genbank) 



|SEQID 

NO | ACCESSN 



DESCRIP 



5211 
5212 



M33312 
L05616 



Rat hepatic steroid hydroxylase IIA1 (CYP2A1) gene, complete cds 



Dictyostelium purpureum (Dpp4) DNA sequence, repeat region 



5213 AE001425 



Plasmodium falciparum chromosome 2, section 62 of 73 of the 
complete sequence 



5214 XM_006574 

5215 I XM 011195 



Homo sapiens similar to plakophilin 2 (H. sapiens) (LOC93271) 
mRNA 



Homo sapiens centromere protein E (312kD) (CENPE), mRNA 



5216 AF181881 



Helicobacter pylori TolB (tolB) and PAL (excC) genes, complete cds 



5217 



U33208 



Human B-Iymphocyte activation antigen (B7. 1) gene, enhancer region 
and5'UTR 



5218 AB052187 



Macaca fascicularis brain cDNA, clone.QnpA-12170 



5219 AY032741 



Drosophila melanogaster H.M.S. Beagle transposon long terminal 
repeat, complete sequence; and heat shock protein Hsp70Ab gene, 
promoter and partial cds 



5220 AJ276592 



Pisum sativum lipl mutant copl gene for constitutively 
jhotomorphogenic 1 protein, exons 1-19 



5221 XM 045136 



Homo sapiens similar to cadherin related 23 (H. sapiens) 
(LOC92444), mRNA 



5222 AK014534 



Mus musculus 0 day neonate skin cDNA, RIKEN full-length enriched 
library, clone:4631424J17, full insert sequence 



5223 



U67572 



Methanococcus jannaschii section 114 of 150 of the complete genome 



5224 AK026607 



Homo sapiens cDNA: FLJ22954 fis, clone KAT09813, highly similar 
to AF010315 Homo sapiens Pigl 1 (PIG1 1) mRNA 



5225 



Y08580 



F.rubripes hsp70-3 gene, 3'UTR 



5226 AL359819 



Human DNA sequence from clone RP1 1-109B17 on chromosome 1, 
complete sequence [Homo sapiens] 



5227 AF1 18099 



Toxoplasma gondii PITSLRE-like protein kinase (tpkl) gene, partial 
cds 



5228 



U48937 



Homo sapiens amiloride-sensitive epithelial sodium channel gamma 
subunit gene, promoter region and exon 1 



5229 AK017057 



Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4933434I20, full insert sequence 



5230 AF044770 



Phaeoprogne tapera microsatellite HrU6 allele 2 repeat region 



5231 XM 012268 



Homo sapiens amino acid transporter 2 (KIAA1382), mRNA 



5232 BC008544 



Mus musculus, Similar to hypothetical protein, clone MGC.7703 
IMAGE:3497634, mRNA, complete cds 



5233 AB046353 



Sapporo-like virus Yak2-2000-jp pol gene for RNA-dependent RNA 
polymerase, partial cds 



5234 



L07305 



Histoplasma capsulatum (clone pMS3) H-ATPase gene, complete cds 



5235 NM 012828 



Rattus norvegicus Calcium channel subunit beta 3 (Cacnb3), mRNA 



5236 AF277959 



Helicobacter pylori strain 96-74 BabB (babB) gene, partial cds 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQIE 
NO 


) 

ACCESSN 


DESCRIP 


P VALUE 


5237 


Z16681 


H. sapiens (D8S266) DNA segment containing (CA) repeat; clone 
AFM151ye3; single read 


c\ 00*2 


5238 


AB049452 


Staphylococcus epidermidis composite transposon DNA, complete 
sequence 


vj.v) 1 / 


5239 


X83673 


X.laevis SSB1 gene 


1 c 

L.J 


5240 


XM_045182 


Homo sapiens serine (or cysteine) proteinase inhibitor, clade F (alpha- 
2 antiplasmiiL pigment eDithelium derived fartnrt m<=»mtv*r i 
(SERPINF2), mRNA 


o.z 


5241 


AF263074 


Cobitis paludica cytochrome b eene nartia1 cd«s* mitnrfinnrTriai opup 

* J iu viuwmv v gwuv, yjixi. ucii VfUO, lmiU^UUULUlal gene 

for mitochondrial product 


0.15 


5242 


XM 029785 


Homo sapiens Kell blood group (KEL), mRNA 


9 1 


5243 


AE006148 


Pasteurella multocida PM70 section 115 of 204 of the mmniete 

^ ■* * v X X V/X ^\J^T \JX vAly* vUlllUlCLv 

genome 


5.5 


5244 


X81975 


B. vulgaris mRNA for sucrose 6 -phosphate synthase 


1 1 

i.j 


5245 


NM 033105 


Homo sapiens beta cysteine string protein (LOC85479), mRNA 


0.000006 


5246 


AC006164 


Homo sapiens clone UWGC:v28eao from 6d21 comolete sennence 


0 4.7 


5247 


AJ279971 


Cams familiaris microsatellite DNA, clone DTRcn.20 




5248 


AE005618 


Escherichia coli 0157:H7 EDL933 eenome cornier ^ nf% certinn 
of 290 




5249 


AJ012338 


Listeria monocytogenes NotI restriction qite mimher twn anH fi^nVino 
sequences 


1.6 


5251 


D 14027 


Mouse DNA for TL antigen, complete cds 


j 


5252 


Y12285 


H.armigera mRNA for putative serine protease, clone SR106 




5253 


AF281912 


Homo sapiens clone 16qtel_c89bt7 sequence 


1E-68 


5254 


NC 001790 


Ovine papillomavirus 2, complete genome 


0.61 


5255 


XM_031110 


Homo sapiens kallikrein 10 (KLK10), mRNA 




5256 


XM_042732 


Homo sapiens guanine nucleotide binding protein (G protein), alpha 
inhibiting activity DolvDeDtide 3 fGNAn^ mRNA 




5257 


XM 033541 


Homo sapiens KIAA0704 protein (ORP3), mRNA 


0.022 


5258 


AL5 12432 


Human DNA sequence from clone RP1 1-245H22 on chromosome 6 
Contains GSSs, complete sequence [Homo sapiens! 




5259 


S82508 


Human adenovirus type 9 E4 protein (E4), Orf2, Orf3, Orf4, and Orf6 
genes, complete cds; and Orf7 gene, partial cds 


1 f% 


5260 


XM 015579 ] 


Homo sapiens hypothetical protein FLJ13105 fFLJ13105^ mRNA 


1.2 


5261 


] 

AF009177 j 


Helicobacter pylori pfs protein homoloe fofs) and sigma 80 TrnoD^ 
genes, complete cds 


0.6 


5262 


a 

ABO 15470 ( 


Arabidopsis thaliana genomic DNA, chromosome 5, TAC 
:lone:KlB16 


5.2 




] 

L08970 ( 


Nematode choline acetyltransferase (cha-1) gene, exons 1-11 and 
complete cds 


0.29 


5265 


D88193 1 


3rassica rapa DNA for S-receptor kinase, complete cds 


0.69 


5266 


XM 046294 ] 


iomo sapiens testis expressed sequence 15 (TEX 15), mRNA 


0.21 


5267 


XM 027078 I 


iomo sapiens ATPase, Class V, type 10D (ATP10D), mRNA 


0.22 
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NO 


> 
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P VALUE 


5268 


AKO 19309 


Mus musculus adult male hippocampus cDNA RIKEN full-leneth 
enriched library, clone:2900009I19, full insert sequence 


2E-13 


5269 


NM 016985 


Mus musculus myotubularin related protein 1 (Mtmrl), mRNA 


0.0004 


5270 


AK020049 


Mus musculus 13 days embiyo male testis cDNA, RIKEN full-length 
enriched library, clone:6030408K18, full insert sequence 


1.8 


5271 


AF323989 


Danio rerio transcriptional intermediary factor 2 mRNA, complete cds 


6.3 


5272 


AF175275 


Pisum sativum trans-cinnamic acid hydroxylase (CYP73 A9) gene, 
CYP73A9-vl allele, partial cds 


0.073 


5274 


AF327749 


Mus musculus smoothelin gene, promoter and partial sequence 


6E-11 


5275 


AJ276633 


Glomus mosseae partial TOR2 gene, exons 1-9 


6 


- 

5276 


AF110612 


Boophilus microplus cytochrome b aooenzvme f PvtM e&ne. nartial 
cds; tRNA-Ser, tRNA-Leu, tRNA-Cys, and tRNA-Met genes, 
complete sequence; and NADH dehydrogenase subunit 2 (ND2) gene, 
partial cds, mitochondrial genes for mitochondrial products 


0.072 


5278 


XM_043395 


Homo sapiens fatty acid desaturase 2 (FADS2), mRNA 


3E-36 


5279 


XM 046528 


Homo sapiens zinc finger protein 133 (clone pHZ-13) (ZNF133), 
mRNA 


2 


5280 


U20807 


Bos taurus protein tyrosine phosphatase BA14 mRNA, complete cds 


0.19 


5281 


XM 036368 


Homo sapiens hypothetical gene supported by AK026802 
(LOC91 132), mRNA 


0.026 


5282 


XM 007023 


Homo sapiens G protein-coupled receptor kinase-interactor 2 (GIT2), 
mRNA 


7E-44 


5283 


U86362 


Saccharomyces cerevisiae MAL gene divergent promoter region, and 
hypothetical protein, complete cds 


0.24 


5284 


Y11740 


H.sapiens whn gene, exon la and lb 


0.22 


5285 


S78915 


{Mul element insertion site, clone 11} [maize, Transposon, 300 nt] 


0.002 


5286 


AF3 10896 


Dictyostelium discoideum RacJ (racJ) gene, complete cds; LagC-like 
protein (lagC3) gene, partial cds; and unknown genes 


0.23 


5287 


AJ012333 


Buchnera aphidicola plasmid pBTc2, trpE and trpG genes 


0.027 


5288 


AF282046 


Homo sapiens clone 15qtel_c321bt7 sequence 


4E-30 


5289 


L25128 


Ly copers icon esculentum auxin-induced proteinase inhibitor (ARPI) 
gene, complete cds 


0.51 


5290 


XM 049605 


Homo sapiens KIAA1467 protein (KIAA1467), mRNA 


2E-46 


5291 


AE002114 


Ureaplasma urealyticum section 15 of 59 of the complete genome 


0.054 


5292 


AF208678 


Aedes aegypti clone p502, Pony-Aa-B6 MITE repeat region 


3.9 


5293 


AK024656 


Homo sapiens cDNA: FLJ21003 fis, clone CAE03685 


5E-55 


5294 


NC_002355 ] 


Bombyx mori mitochondrion, complete genome 


2.1 


5295 


AK023898 ] 


Homo sapiens cDNA FLJ13836 fis, clone THYRO1000734 


0.008 


5296 


AE002342 


Chlamydia muridarum, section 70 of 85 of the complete genome 


1.9 
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Table 3A Nearest Neighbor (BlastN vs. Genbardrt 


SEQIE 
NO 


> 

ACCESSN 


DESCRIP 


P VALUE 


5297 


XM 032347 


Homo sapiens region containing TLS-associated serine-arginine 
protein 1; TLS-associated serine-arginine protein 2; TLS-associated 
serine-arginine protein 2; TLS-associated serine-arginine protein 2; 
TLS-associated serine-arginine protein 1; TLS-associa> 


0 


5298 


Z63116 


Ksapiens CpG island DNA genomic Msel fragment, clone 78e4, 
reverse read cpg78e4.rtla 


0.24 


5299 


AE006833 


Sulfolobus solfataricus section 192 of 272 of the complete genome 


6.4 


5300 


XM 032811 


Homo sapiens similar to ZINC FINGER PROTEIN 20 (ZINC 
FINGER PROTEIN KOX13) (DKFZP572P0920) (H. sapiens) 
(LOC90591),mRNA 


0.21 


5301 


AF202552 


Homo sapiens DNA methyltransferase (DNMT1) gene, exons 2, 3, 
and 4 


0.23 


5302 


U92963 


Pygathrix roxellana NADH dehydrogenase subunit 3 (ND3) gene, 
partial cds, tRNA-Arg gene, complete sequence, NADH 
dehydrogenase subunit 4L (ND4L) and NADH dehydrogenase subunit 
4 (ND4) genes, complete cds, and tRNA-His, tRNA-Ser and tRNA- 
Leu genes, comp> 


0.17 


5303 


Z99833 


Euglena deses chloroplast psbC gene: complete group III twintron, 
complete internal matl gene, partial 5' and 3* psbC exons 


4.7 


5304 


AB047962 


Macaca fascicularis brain cDNA, clone:QnpA-13041 


e-125 


5305 


AF159913 


Euplotes crassus transposon Tecl clone Tec 1-2 orf 2 and orf 3 
pseudogenes, complete sequence 


0.5 


5306 


Y15435 


Kluyveromyces lactis PDC1 gene, promoter region 


0.087 


5307 


U67506 


Methanococcus jannaschii section 48 of 150 of the complete genome 


1.4 


5308 


U73943 


Bacillus subtilis phosphofructokinase I (pfkl) gene, partial cds; and 
pyruvate kinase I gene, complete cds 


2.2 


5309 


U72027 


Bos taurus photoreceptor disk rim specific protein rom-1 (ROM1) 
mRNA, partial cds 


0.06 


5310 


X16509 


Rice alpha-amylase gene 


2.1 


5312 


AB049900 


Macaca fascicularis brain cDNA, clone:QnpA-19713 


0.01 


5313 


Z24756 


S.pombe rhp51 and rpal genes, complete CDS's 


0.003 


5314 


XM 002989 


Homo sapiens similar to hypothetical protein FLJ10546 (H. sapiens) 
(LOC93548), mRNA 


5E-62 


5316 


AL358272 


S.pombe chromosome I cosmid c458 


0.61 


5317 


XM_044123 ] 


Homo sapiens cadherin 20, type 2 (CDH20), mRNA 


4E-48 


5318 


] 

AF038866 i 


Bacteroides fragilis transposon Tn5520 transposase (bipH) and 
mobilization protein BmpH (bmpH) genes, complete cds 


0.73 


5319 


AF332577 ] 


tfomo sapiens prosomal P27K protein (PSMA6) gene, partial cds 


2E-51 


5320 


X68650 < 


D.cuniculus mRNA for ryanodine receptor 


0.55 


5321 


XM 038450 I 


lomo sapiens hypothetical protein FLJ20694 (FLJ20694), mRNA 


0 
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SEQIE 
NO 


> 

ACCESSN 
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P VALUE 


5322 


Z65960 


H.sapiens CpG island DNA genomic Msel fragment, clone 69d2, 
reverse read cpg69d2.rtlb 


1.7 


5323 


AF3 17661 


Candida albicans 60S acidic ribosomal protein type P2-A (p2A) gene, 
complete cds 


5.8 


5324 


AE001776 


Thermotoga mantima section 88 of 136 of the complete genome 


0.64 


5325 


NM_031987 


Rattus norvegicus carnitine octanoyltransferase (COT), mRNA 


0.62 


5326 


AB035500 


Rana rugosa gene for FTZ-F1, exon 1 


0.009 


5327 


U67490 


Methanococcus jannaschii section 32 of 150 of the complete genome 


0.2 


5328 


AB060000 


Glycine max mRNA for hypothetical protein, complete cds, 
clone:SSC2 


0.22 


5329 


U50840 


Borrelia garinii 70 kbp plasmid D6 protein gene, complete cds 


0.072 


533a 


U67535 


Methanococcus jannaschii section 77 of 150 of the complete genome 


2.1 


5331 


AK001125 


Homo sapiens cDNA FLJ10263 fis, clone HEMBB1000991 


3E-43 


5333 


AF029114 


Ascogaster sp. 16S ribosomal RNA gene, mitochondrial gene for 
mitochondrial RNA, partial sequence 


0.068 


5334 


AF320594 


Homo sapiens PKD1P4 pseudogene, exons 2 through 15 


6.3 


5335 


AY021494 


Oryza sativa microsatellite MRG3819 containing (TA)X27, genomic 
sequence 


0.024 


5336 


AF146651 


Homo sapiens glyoxalase-I gene, complete cds 


0.056 


5337 


AB054516 


Globicephala melas DNA, SINE flanking sequence Mago22 locus 


0.024 


5338 


AC001052 


Homo sapiens (subclone l_f6 from PI H56) DNA sequence, complete 
sequence 


0.003 


5339 


U00951 


Human clone A9A2BR11 (CAC)n/(GTG)n repeat-containing mRNA 


0.65 


5340 


AF298180 


Caenorhabditis elegans tropomyosin isoform IV mRNA, complete cds 


0.64 


5341 


XM_042336 


Homo sapiens hypothetical gene supported by AK025398 
(LOC92036), mRNA 


2.1 


5342 


L44121 


Homo sapiens (clone pHK2.1D) CMT1A gene, repeat unit 


0.0003 


5343 


AE005556 


Escherichia coli 0157:H7 EDL933 genome, contig 3 of 3, section 175 
of 290 


0.23 


5344 


AB035185 


Homo sapiens RHD gene, intron 9, complete sequence 


0.63 


5345 


AC013455 


Homo sapiens BAC clone CTD-2347014 from 7pl2-pl4, complete 
sequence 


0.23 


5346 


AJ234722 


Hordeum vulgare genomic DNA fragment; clone MWG203 l.rev 


0.24 


5347 


X79699 


H.sapiens ALU repeat, 230bp 


0.0003 


5348 


XM_050246 


Homo sapiens secretory protein SEC8; KIAA1699 protein (SEC8), 
mRNA 


0 




] 

AB063075 . 


Macaca fascicularis brain cDNA clone:QtrA-l 1888, full insert 
sequence 


2E-19 


5350 


] 

Z56909 ] 


H. sapiens CpG island DNA genomic Msel fragment, clone 152h9, 
reverse read cpgl52h9.rtla 


0.69 


5351 


L44121 1 


Homo sapiens (clone pHK2.1D) CMT1A gene, repeat unit 


0.069 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 



DESCRIP 



Drosophila melanogaster genomic scaffold 142000013385665, 
5352 | AE002722 | complete sequence 



Stachyarrhena sp. Jansen-Jacobs 4707 ribosomal protein S16 (rpsl6) 
5353 I AF201021 gene, partial intron sequence; chloroplast gene for chloroplast product 



5354 Y17506 [Euplotes octocarinatus tRNA-Cys gene 



5356 XM_040513 Homo sapiens hypothetical protein FLJ14454 (FLJ14454), mRNA 



5357 



Mus musculus DREAM/calsenilin/KChIP3 gene, complete cds and 
AF287736 exons 3, 4 and 5, alternatively spliced 



Danio rerio PITP-less RdgB-like protein (plrdgB) mRNA, complete 
5358 | AF230807 cds 



5359 



U67563 Methanococcus jannaschii section 105 of 150 of the complete genome 



5360 AE000935 



Methanobacterium thermoautotrophicum from bases 1655364 to 
1666496 (section 141 of 148) of the complete genome 



Rattus norvegicus FAT tumor suppressor (Drosophila) homolog (Fat) 
5361 | NM 031819 mRNA 



5362 XMJ)05889 [Homo sapiens kinesin-like 1 (KNSL1), mRNA 



5363 AF339829 Homo sapiens clone IMAGE:609847, mRNA sequence 



Mus musculus circadian locomoter output cycles kaput (Clock) 
5364 | NM 007715 mRNA 



5365 XM 016117 



Homo sapiens hypothetical protein DKFZp586F1122 similar to 
laxotrophin (DKFZP586F1 122), mRNA 



Homo sapiens regulator of differentiation (in S. pombe) 1 (ROD1), 
5366 | XM 005554 mRNA 



5368 X763 1 1 H.sapiens endothelial nitric oxidase synthase gene, exons 15 and 16 



5369 XM 001738 



Homo sapiens proteoglycan 4, (megakaryocyte stimulating factor, 
articular superficial zone protein) (PRG4), mRNA 



5370 AE000935 



Methanobacterium thermoautotrophicum from bases 1655364 to 
1666496 (section 141 of 148) of the complete genome 



5371 I Y18011 iPhaedoncochleariae mRNA for chitinase 



5372 Z99768 [Flaveria trinervia gdcsP pseudogene 



5373 



AF342852 



Fusobacterium ulcerans strain NCTC 121 12 16S ribosomal RNA 
gene, partial sequence; internal transcribed spacer 1, complete 
sequence; and 23S ribosomal RNA gene, partial sequence 



5374 BC003397 



iHomo sapiens, hypothetical protein FLJ20505, clone MGC:4960 
IMAGE:3448518, mRNA, complete cds 



5375 AF385088 



Homo sapiens NOD2 (NOD2) gene, NOD2-2722G>C allele, exon 8 
|and partial cds 



5376 NC_000863 [Papio cynocephalus provirus, complete genome 



5377 AF121217 



Rattus norvegicus pro-alpha-2(I) collagen (coIla2) mRNA, complete 
I cds 



5378 BC009075 



I Mus musculus, beta-l,3-N-acetylglua)saminyltransferase 1, clone 
|MGC:6892 IMAGE:2654354, mRNA, complete cds 



P VALUE 



1.5 



0.072 



0.024 



0.7 



6.1 



0.24 



0.21 



5.2 



3.6 



0.022 



e-116 



6.3 



0.7 



0.63 



0.23 



0.021 



4.5 



0.18 



1E-69 



0.58 



0.22 



5.5 



1.8 
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|SEQID 
NO 



5379 



5380 



5381 



5383 



5384 



5385 



5386 



5387 



5388 



if! 



W 



Q 

: s S 



5389 



5390 



5391 



Table 3A Nearest Neighbor (BlastN vs. Genbank) 



ACCESSN 



DESCRIP 



P VALUE 



XM_001341 Homo sapiens DKFZP547E1010 protein (DKFZP547E1010), mRNA 3E-15 



XM_046408 [Homo sapiens KIAA1041 protein (KIAA1041), mRNA 



Mus musculus 6 days neonate skin cDNA, RIKEN full-length 
AK020701 enriched library, clone:A030009B12, full insert sequence 



0.52 



0.000001 



U3271 1 Haemophilus influenzae Rd section 26 of 163 of the complete genome | 0.021 
XM_046598 Homo sapiens DKFZP434K156 protein (DKFZP434K156), mRNA 0.58 



L43551 



Buchnera aphidicola anthranilate synthase large subunit (trpE) gene 
and anthranilate synthase small subunit (trpG) gene, complete cds 



XM 012233 Homo sapiens MDS023 protein (MDS023), mRNA 



AF139358 |Homo sapiens BUB1 protein gene, exon 17 



Mus musculus adult retina cDNA, RIKEN full-length enriched 
AK020916 library, clone: A930031L 14, full insert sequence 



Homo sapiens mRNA; cDNA DKFZp547E184 (from clone 
AL390170 |DKFZp547E184) 



AK02 1659 [Homo sapiens cDNA FLJ1 1597 fis, clone HEMBA1003856 



5392 



5393 



5394 



5395 



5396 



5397 



5398 



5399 



5400 



5401 



5402 



5403 



5404 



5406 



5407 



5408 



AK024200 [Homo sapiens cDNA FLJ14138 fis, clone MAMMA 1002765 



AJO 1 1727 Homo sapiens partial TOP2 beta gene, sequence G 



I Homo sapiens leukemia inhibitory factor receptor (LIFG) gene, 
AFO 18079 alternative promoter which is functional in non-placental tissues 



Macaca fascicularis brain cDNA clone:QtrA- 14876, full insert 
AB063085 | sequence 



Homo sapiens olfactory receptor HPFHIOR (HPFHIOR) gene, 
AF 154673 | complete cds 



275199 IS.cerevisiae chromosome XV reading frame ORF YOR291w 



Bombyx mori Pao-like retrotransposon Kamikaze DNA, similar to 
AB042120 Ipolyprotein coding region 



AF379854 Cloning vector pVLH/hsp, complete sequence 



XM_009648 [Homo sapiens HSPC072 protein (HSPC072), mRNA 



AB056333 [Macaca fascicularis brain cDNA, clone:QflA-13486" 



AK021824 [Homo sapiens cDNA FLJ1 1762 fis, clone HEMBA1005670 



Human DNA sequence from clone RP1 1-30M17 on chromosome 6, 
AL513010 | complete sequence [Homo sapiens] 



AK023753 [Homo sapiens cDNA FLJ13691 fis, clone PLACE2000100 " 



AKO 14452 



Mus musculus 18 days pregnant adult female placenta and extra 
embryonic tissue cDNA, RIKEN full-length enriched library, 
|clone:3830422K02, full insert sequence 



XM_003220 | Homo sapiens polybromo 1 (PB1), mRNA" 



0.65 



0.6 



0.008 



1.5 



1.8 



0.21 



0.00003 



0.00003 



6.2 



0.026 



0.23 



AE007505 Streptococcus pneumoniae section 188 of 194 of the complete genome | 6.6 



0.081 



0.24 



0.0003 



0.022 



9E-14 



3E-46 



0.026 



0.077 



0.17 



1E-35 



Z78918 jHsapiens flow-sorted chromosome 6 Hindlll fragment, SC6pA19C6 | 1.5 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 



DESCRIP 



5409 
5410 



L13289 
AC024203 



[Mycoplasma pirum purine phosphorylase gene, 3' end, 
deoxyriboaldolase, thymidine phosphorylase, cytidine deaminase, 
I phosphomannomutase genes, complete cds, and triose phosphate 
[isomerase gene, 5' end 



Caenorhabditis elegans cosmid Y72A2A, complete sequence 



5411 AF373586 



Homo sapiens translocation breakpoint MLLT1/MLL fusion gene, 
| partial sequence 



A.pleuropneumoniae tfbA gene for transferrin-binding protein (1949 
5412 I Z46774 Ibp) 



5413 AF3 19044 



Methanosarcina acetivorans strain C2A MtaF (mtaF) and MtaG 
|(mtaG) genes, complete cds 



5414 AE001933 



Deinococcus radiodurans Rl section 70 of 229 of the complete 
chromosome 1 



Human platelet-derived growth factor A type receptor mRNA, 
5415 | M22734 [complete cds 



5416 AF264650 



Asterella tenella trnL gene, intron sequence; chloroplast gene for 
| chloroplast product 



5417 AF378571 



Sambucus nigra clone VBL85 thaumatin-like protein (TLP1) mRNA, 
[complete cds 



5418 U53 183 Rattus norvegicus 250 kDa estrous-specific protein mRNA, partial cds 



5419 AK026673 (Homo sapiens cDNA: FLJ23020 fis, clone LNG00943 



5420 XM 041367 



Homo sapiens NADH dehydrogenase (ubiquinone) 1 beta subcomplex, 
|4 (15kD, B15) (NDUFB4), mRNA 



5421 AY039939 



Arabidopsis thaliana unknown protein (F11F12.23) mRNA, complete 
cds 



5422 AF250135 



Mus musculus splicing factor Sc35 (Pr264) mRNA, partial cds, 
lalternatively spliced 



5423 M32328 |C.pipiens esterase Bl gene, complete cds 



5424 



U95369 



Bos taurus inward rectifier potassium channel BEK mRNA, complete 
teds 



5425 AE002157 



5426 



Ureaplasma urealyticum section 58 of 59 of the complete genome 
Homo sapiens clone 5qtel c25t3 seauence 



AF270633 [Homo sapiens clone 5qtel_c25t3 sequen ce 

Ic : t -*rr ^- - 1 



5427 Z28252 IS.cerevisiae chromosome XT reading frame ORF YKR027w 



5428 AE002601 



Drosophila melanogaster genomic scaffold 142000013385253, 
[complete sequence 



5429 AJ403990 [Hyla arborea microsatellite DNA, clone WHA1-61 



Rattus norvegicus protein tyrosine phosphatase, non-receptor type 5 
5430 | NMO 19253 |(Ptpn5), mRNA 



5431 AE006543 



I Streptococcus pyogenes Ml GAS strain SF370, section 72 of 167 of 
| the complete genome 



5432 AF146363 



Homo sapiens islet cell autoantigen ICA69 (ICA1) gene, exon 1 and 
| partial cds 



5433 M96943 [Human profllaggrin gene exons 1-3, 5' end 



5434 | AK025246 | Homo sapiens cDNA: FLJ2 1593 fis, clone CQL07050 



P VALUE 



0.06 



6.3 



0.54 



0.025 



0.44 



6.3 



0.59 



6.4 



1.6 



0.7 



0.24 



2E-10 



5.5 



2.1 



0.003 



0.009 



5.7 



9E-56 



0.18 



0.2 



3.8 



0.73 



0.61 



0.073 



0.066 



0.18 
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|seqid| 

NO 



ACCESSN 



5435 I XM 027835 

5437 AJ223441 

5438 AB048345 

5439 I AB049844 



5440 XM 036740 



Table 3A Neares t Neighbor (BlastN vs. Genbank) 

DESCRIP 

jHomo sapiens general transcription factor IIIC, polypeptide 4 (90kE» 
(GTF3C4), mRNA J 

Branchiostoma floridae Pax6 gene (AmphiPax6), clonelT 
lEquus caball us DNA, chromosome Xp21, microsatellite TKY3 9" 

Macaca fascicularis brain cDNA, clone:QnpA-18 434 

iHomo sapiens similar to nuclear pore membrane glycoprotein 210 (M 
musculus) (LOC91181), mRNA 



I P VALU E 

5.9 
0.24 
0.73 
2.2 



5441 AK0 19537 



Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4921505C17, full insert sequence 



5442 
5443 



AE003964 
X95276 



Xylella fastidio sa 9a5c, section HQ of 229 of the complete genome 
P.falciparum complete gene map of plastid-like DNA (IR -B) 

I fYiri c oriflp Iah^h M n\T A ... ' -i 



AF3 10264 jOvis aries leptin mRNA, partial 3' UTR 



5445 



0.19 
0.081 





5446 
5447 


AC007133 
AJ011727 


. ~ ^wiw** x wxut^i/ ^rjUJLU^ZV), mKiNA 

JArabidopsis thaliana chromosome II section 209 of 255 of the 

1 Complete seouence Seauenee from r\r\noc tzl aii m -)Tn 

IHomo sapiens partial TOP2 beta gene, sequence G 


2.2 
2 


ru 

£3 


1 5448 


AF2 14665 


jHomo sapiens cAMP responsive element modulator (CREM) gene 
jexon H 


0.00003 
0.52 


^? s 


5449 


1 AF269319 


Staphylococcus epidermidis strain SRI clone step.l002a02 genomic 
I seouence 

I JVUUvllvv 


1.9 


S 


5450 


AE001393 


I Plasmodium falciparum chromosome 2, section 30 of 73 of the 
I complete sequence 


0.72 


1:1:1 ■ 
f 

CF1 


5451 
5452 


XM 002437 
AF298224 


(Homo sapiens cAMP-regulated guanine nucleotide exchange factor II 
(CAMP-GEFII), mRNA 

Homo sapiens RPCI-II 289E13 CD21 gene, partial cds 


7E-32 
0.007 


r**s 

5 


5453 
5454 


AF254641 
1 XM 010303 


IHomo sapiens oxytocin/vasopressin intergenic region 
JHomo sapiens phosphate cytidylyltransferase l, choline, beta isoform 
(PCYTIB), mRNA 


0.01 
2 




5455 


AK014447 


Mus musculus 18 days pregnant adult female placenta and extra 
embryonic tissue cDNA, RIKEN full-length enriched library 
clone:3830421F13, full insert sequence ' | 


0.22 




5456 


M21084 


iP.hybnda 5-enolpyruvylshikimate 3-phosphate synthase mRNA, 
(complete cds 


0.24 




5457 


XM 006648 


Homo sapiens Microfibnl-associated gIycoprotein-2 (MAGP2), 
JmRNA ' 1 


1.4 




5458 
5459 


AF1 75434 
AE000775 


Gallus gallus T cell receptor delta chain (TCRD) pseudogene, partial 
(sequence 

Aquifex aeolicus section 107 of 109 of the complete genome [ 


0.76 




5460 


XM 031261 


Homo sapiens similar to KIAA0377 gene product (H. sapiens) 
(LOC90371), mRNA 


0.0008 
1.6 




1 5461 j 


BC008389 


Homo sapiens, Similar to hypothetical protein LOC5782 1 clone 
IMAGE:4076369, mRNA 


0.24 



414 



|seqid| 

NO 



ACCESSN 



5462 J AF327739 

AY030939 
XM 049237 

5465 I AY017147 



Table 3A Nearest Neighbor (BlastN vs. Genbank) 

DESCRIP 

iStreptococcus thermophilus Pebl (pebl), GlnQ (glnQ), response , 
regulator Rrl (rrl), Hpk2 (hpk2), YycJ-like protein (yycJ), and MurM 
l(murM) genes, complete cds 

fflV-1 isolate NC3940-1998 from USA pol polyprotein (pel) gene " 
[partial cds 

Homo sapiens KIAA0841 protein (KIAA0841), mRNA 

jParacheilognathus imberbis mitochondrial control region, partial 
I sequence 



P VALUE 



0.63 

0.026 
0.053 

0.25 



X81451 
AF271036 



IN frontalis enolase gene, promotor region 



|Mus musculus clone rF12m odorant receptor gene, partial cdT 



5468 
5469 



J05637 
X75420 



Yeast (S.cerevisiae) 70 kDa heat shock protein (SSA4) gene, complete 
I cds 



0.22 



P.falciparum (FCR3) cpn60 gene 



5470 



Acmetobacter calcoaceticus KHP18 partial pKLffi plasmid including . 
AT?01 _ aberrant mercury resistance transposon TnPKLH2, truncated insertion 
AF213017 sequence IS101LD1, and determinants for CinH resolution system 1 



5472 AF183178 



Drosophila melanogaster filamin isoforms A and B (Filamin) gene, 
lexons 1 and 2 



5473 AF209925 



Plasmodium falciparum STARP antigen (STARP) gene, complete cds 



5471 I AC024806 ICaenorhabditis elegans cosmid Y52E8A7complete s^ 1 q.qL 



0.043 



2.1 



5474 
5475 



AJ4 10864 
NC 001889 



Ectomycorrhizal isolate ECM 133 internal partial internal transcribed 
spacer 1 (ITS1), 5.8S ribosomal RNA gene, and partial internal 
transcribed spacer 2 (ITS2); isolate ECM 133 



Dictyostelium discoideum plasmid Ddp5, complete sequence ' 



5476 
5477 



AY009594 



Boophilus microplus clone PNC 189 isolate Kilkivan2 microsatellite 
[sequence 



0.19 



U55723 



JHuman ataxia-telangiectasia (ATM) gene, exon25~ 



5478 
5479 



AF197229 



jSaguinus oedipus MHC class II antigen (Saoe-DRBl) gene Saoe- 
DRB1*0303 allele, partial sequence 



0.00004 



AE001290 



{Chlamydia trachomatis section 17 of 87 of the complete genolnT 



5480 



™ n,«, i KT™if S Slmilar t0 *™&*nu* DNA glycosylase (H. sapiens) 
XM_0 16611 |(LOC93577), mRNA 



0.069 



5481 
5482 



AJ224148 



f Saccharomyces cerevisiae mitochondrial Tyr-tRNA Asn-tRNA and 
|Met-tRNAgenes 



0.026 



AK022243 Homo sapiens cDNA FLJ12181 fis, clone MAMMA1000746 



5483 



U07354 



Legionella pneumophila Philadelphia- 1 intracellular multiplication 
region (icmV, icmW, icmX, lphB) genes, complete cds 



0.33 



5484 



Plasmodium falciparum chromosome 2, section 3 of 73 of the 
AE001366 I complete sequence 



0.14 



5485 



Plasmodium falciparum chromosome 2, section 22 of 73 of the 
AE001385 [complete sequence 



1.5 



5486 
5487 



L81686 



Homo sapiens (subclone l_a3 from PI H54) DNA sequence, complete 
I sequence 



0.62 



XMJ)17648|Homo sapiens leucine-rich repeat-containing 2 (LRRC2V ^RnF 



0.0006 



415 



I'll 



SEQID 


NO 


1 ACCESSN 


5488 


1 AE001372 


5489 


1 Z46943~ 


5490 


1 U68246 


5491 


1 AK009952 


5492 


1 AF 190794 


5493 


AF336796 


5494 j 


AB035844 


5495 


AF127240 


5496 


AY019789 


5497 


AF1Q6953 [j 


5498 


XM 007172 ] 



Table 3A Nearest Mriri.^ (B lastN ys riBnh , wlr , 

. DESCRIP 

Plasmodium lalaparum chromosome 2, section 9 of 73 of the ' 
complete sequence 

H-sapiens apopr otein lipase (LPL) n> - 
Dictyo.U.un, disooideum WacA rwacA. ^ ^ rM _ r|| . 

K" 8 "'T , adUlt ^ t0ngue cDNA * RIKEN full-length enriched 
library, clone:2310057C01, full insert c-,.,^ engthennched 

^^^^^^^^^^ 
Umodtam falciparum dynamm-like protein (dyn) mRNA, complete 

Coturnix cotumix japonica DNA. nte^ut- r,,^ 

jury^tiva microsatelhte MK G21 14 containing (AT)X78, genomic 
Homo sapiens S OS! (SOSl) gene, mrriai ^, 



f 0 

3 :: 

CP 

3..JJ 



5508 



XM 04142 8 

S50200 
AE00117 8 
Z74797 
X15979 



AF221538 
AF083226 
AF262729 



5512 

5514^ 

5515 



5516 | XM 034682 



NM 006915 Ca ^ Pigm em0Sa 2 

~ X95961 A.thaliana CER3-like gene 

U.^sap.ens mositol 1,4 ,5-triphosphate receptor, type 1 (rmu X " 

famine beta-hvdroxvlase |mice, Genomic/mPMA ,,,, „ t] 
Borreha purgdorferi of 70) of the cn m ^7~T 
IS.cerevisiae cliro moso me XV reading fan, npp v ™ 
Plasmodium falciparum alnha-tubulin I gene" 

?^!Ssr l Ma heat shnrk pmtein 70 fflsp7 °^ — — ■ - 1 

|^mSe tdt UCfeaUS " Udear WB ^ nr NHR - 8 °*"A, ' 

|Orrumorhyn^ wng 

(A^P 6^r OSenSI f AT ! SynthaSe 8 (ATP 8) ^ ATP ^thase 6 " 
p^ teC dS; nUt ° Ch0ndnal ««■ & miSondrial 

cI-mNA a A 8enta B ^ ° NA P 01 ^ 56 < Mt PoD g^e, complete 

SJf^T genC ' 0011115,616 S6quence ' and unknown genes 
mitochondrial genes for mitochondrial nnvi,,^ S ' 

H.sapiens ayst rophin-gene (introns 13/42) 

SST"" Strai " SK1 d ° ne genomic' 

Homo sapiens GE36 gene, exon 8 
S.pombe chromosome I cosmid c!7 D4 

416 



|_P VALU E | 

1.9 
0.58 
0.003 

0.08 

1.9 

9.6 

0.026 

0.42 

0.003 
0.43 
0.014 



5509 AF263057 



AF061244 
X64612 



AF269472 
AF297021 
AL391744 



0.003 
0.071 

0.18 
2.1 
0.026 

2.1 
0.000 8 

1.8 

0.69 

6E-10 



0.003 
0.69 



7E-22 



• ft 





Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQ ID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


5517 


A TTI/r 1017 


A rnUi rlrtriri c tV% oltqnq A f- 1 r»ftA*71ft fCA IXC T) a-ama *<m.*wm1a*a 

Araoiuopsis uiaiiana Aiiguo / ju/r4riD_22 gene, complete cos 


ft AG 


5518 




A.inaiiana .r Arvr ITiKJNA IOr .r Alvr pFOtem 


ft AO 


5519 


AT lOftlAQ 


Homo sapiens mRNA; cDNA DKFZp547D064 (from clone 

LWVTZ<pj4 lUKJm) 


A 1 Oft 


5520 


NM U32141 


Homo sapiens hypothetical protein DKFZp434K1421 


A 1 AO 

e-ioy 


5521 


XM 037214 


Homo sapiens laminin, alpha 5 (LAMA5), mRNA 


0.62 


5522 


AT 11 A^TC 

AL1 10675 


Botrytis cinerea strain T4 cDNA library under conditions of nitrogen 
deprivation 


ft ftftQ 


5523 


AF124216 


Dasyurus viverrinus/maculatus mixed DNA library microsatellite 
Q4.4.10 sequence 


1.9 


5524 


A T CO! *7/C 1 

AJL5o37o2 


Human chromosome 14 DNA sequence Partial sequence from BAC R- 
1109N18_PCR1 of library RPCI-11 from chromosome 14 of Homo 
sapiens (Human), complete sequence 




5525 


AC024758 


Caenorhabditis elegans cosmid Y37E1 1AM, complete sequence 


1.4 


5526 


AF260528 


Homo sapiens MER receptor tyrosine kinase (MERTK) gene, exons 
17 and 18 


JE-JJ 




AF2 46997 


Plasmodium falciparum malaria exported protein-1 (EXP-1) gene, 
complete cds 


ft ftl O 


5528 


"Vfc jf A1 101A 

XM_01 1810 


Homo sapiens sigma receptor (SR3 1747 binding protein 1) (SR-BP1), 

*«T>XT A 

mRNA 




5529 


XM011100 


Homo sapiens complement component 7 (C7), mRNA 


5.9 


5530 


X64315 


S.scrofa mRNA for Na/D-glucose cotransporter regulatory-suburut 


ft ftftftT 
U.UUU / 


5531 


XM_04o945 


Homo sapiens bromodomain adjacent to zinc finger domain, 2 A 

/T) a 7^ A \ mOXT A 

(dAZzAJ, mKISA 




5532 


XM 046506 


Homo sapiens ui^z J rjooiiU24 protein (UKrZr!>ooiiU24) J mKJNA 


1JD-1U 


5533 


XM_002977 


Homo sapiens ring finger protein 13 (RNF13), mRNA 


2.8 


5534 


AJ300359 


Homo sapiens partial RYR2 gene for ryanodine receptor 2, exon 20 


1E-36 


5535 


BC008590 


Homo sapiens, hypothetical protein rU2l3l3, clone Muclo©2U 
IMAGE:4148772, mRNA, complete cds 


e-154 


5536 


XM_05024o 


Homo sapiens secretory protein SEC8; KIAA1699 protein (SEC8), 

_.n\T A 

mRNA 


ft 
u 




AL442094 


Homo sapiens mRNA; cDNA DKFZp547E024 (from clone 
DKFZp547E024) 


2 


5518 


AF225975 


Tribolium castaneum Tc-tailless gene, upstream regulatory region and 
partial cds; tRNA-Tyr gene, complete sequence; and unknown gene 


0.24 


5539 


AJ304934 


Sidalcea malviflora x Sidalcea oregana 5.8S rRNA gene, internal 
transcribed spacer 1 (ITS1) and internal transcribed spacer 2 (ITS2) 


2.1 


5540 


AE001528 


Helicobacter pylori, strain J99 section 89 of 132 of the complete 
genome 


0.009 



417 





w w — 

Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


5541 


XM_046698 


Homo sapiens putative DNA/chromatin binding motif (PLU-1), 
mRNA 


2.5 


5542 


AF101309 


Caenorhabditis elegans cosmid H24G06, complete sequence 


2.1 


5543 


AL023839 


Caenorhabditis elegans cosmid Y39A1C, complete sequence 


1.4 


5544 


XM_017673 


Homo sapiens hypothetical protein FLJ21934 (FLJ21934), mRNA 


0 


5545 


U27317 


Homo sapiens 1 1 beta-hydroxysteroid dehydrogenase 2 (HSD1 1B2) 
gene, complete cds 


2E-92 


5546 


XM_045095 


Homo sapiens KIAA1320 protein (KIAA1320), mRNA 


0.6 


5547 


AF143066 


Lepidopilum scabrisetum ribosomal protein system 4 (rps4) gene, 
partial cds 


0.25 


5548 


BCO 10880 


Homo sapiens, Similar to dihydropyrimidinase-like 2, clone 
MGC:1757 IMAGE:3542012, mRNA, complete cds 


5.2 


5549 


L25128 


Lycopersicon esculentum auxin-induced proteinase inhibitor (ARPI) 
gene, complete cds 


0.55 


5550 


NM_030665 


Homo sapiens retinoic acid induced 1 (RAI1), mRNA 


9E-57 


5552 


AF127699 


Erythroxylum argentinum chloroplast atpB-rbcL spacer 


5.5 


5553 


U97192 


Caenorhabditis elegans cosmid C01F4, complete sequence 


2.7 


5554 


AF145302 


Arabidopsis thaliana 14-3-3 protein GF14 epsilon (GRF10) gene, 
complete cds 


8.9 


5555 


AF169241 


Citrus sinensis capsanthin/capsorubin synthase (CCS) gene, complete 
cds 


0.21 


5556 


AF153268 


Tetrahymena thermophila dynein heavy chain (DYH6) gene, partial 
cds 


1E-15 


5557 


L13966 


Mouse delta/YYl/NF-El/UCRBP transcription factor, exon 3 


0.65 


5558 


AF172730 


Streptococcus gordonii Sagp-like protein gene, complete cds; and Oct- 
like protein gene, partial cds 


1.9 


5559 


AL591422 


Human DNA sequence from clone RP1 1-124P3 on chromosome X, 
complete sequence [Homo sapiens] 


18 


5560 


AF328433 


Homo sapiens clone 2qtel_c275_24t3 sequence 


4.8 


5561 


AF052640 


Podarcis pityusensis cytochrome b (cytb) gene, mitochondrial gene 
encoding mitochondrial protein, partial cds 


2.3 


5562 


AF133000 


Cicindela marutha 16S large subunit ribosomal RNA, mitochondrial 
gene for mitochondrial product, complete sequence 


0.024 


5563 


AJ239057 


Antirrhinum majus far gene for farinelli protein 


5:1 


5564 


AL032651 


Caenorhabditis elegans cosmid Y6D1A, complete sequence 


2 


5565 


AJ278974 


Homo sapiens partial SNAP-23 gene for synaptosome associated 
protein-23, exons 6-8 


1.8 


5566 


M33379 


Guinea pig lipoprotein lipase (gplpl) gene, exon 3-7 


0.46 


5567 


AF303443 


Rheum reticulatum tRNA-Leu (trnL) gene and trnL-trnF intergenic 
spacer, partial sequence; chloroplast gene for chloroplast product 


0.018 


5568 


L81826 


Homo sapiens (subclone l_gl0 from PI H39) DNA sequence, 
complete sequence 


4E-16 


5569 


NM_005958 


Homo sapiens melatonin receptor 1A (MTNR1A), mRNA 


2 


5570 


AJ133356 


Homo sapiens MSMB gene (partial), exons 2-3 


0.66 



418 



|SEQID| 
NO 



Table 3A Nearest Neighbor (BlastN vs. Genbank) 



ACCESSN 



DESCRIP 



5571 



D86577 



Human herpesvirus type 7 U38 gene, U39 gene for glycoprotein B and 
U40 gene for DNA polymerase, complete and partial cds 



P VALUE 



5572 AE006578 



Streptococcus pyogenes Ml GAS strain SF370, section 107 of 167 of 
I the complete genome 



I Streptococcus mutans ATPase operon, complete sequence" 



0.008 



5575 AF301004 



Schistosoma mansoni thioredoxin peroxidase 3 (TPx3) gene, complete 
cds ' 1 



5576 AF269295 



Staphylococcus epidermidis strain SRI clone step.l000d09 genomic 
[sequence 



0.23 



5577 AF1 14934 



Saccharomyces sp. S6U ATP synthase subunit 8 (ATP8) gene, 
mitochondrial gene encoding mitochondrial protein, complete cds 



0.008 



5578 AF261143 Hom o sapiens chromosome 9 lung cancer-associated mRNA 

^LJ X95276 IP. falciparum complete gene map of plastid-like DNA (IR-B) , WU uuu 

5580 NM_030844 IRattus norvegicus islet cell autoantigen 1, 69 kDa (leal), mRNA | T63 



0.00003 



5581 XM_001738 



Homo sapiens proteoglycan 4, (megakaryocyte stimulating factor, 
[articular superficial zone protein) (PRG4), mRNA 



5582 AB060860 



Macaca fascicularis brain cDNA clone:QtrA-l 1624, full insert 
| sequence 



0.21 



5583 I NC_001566 lApis mellifera ligustica mitochondrion, complete genome 1 (TB" 



5584 
5585 



AF239566 
X92893 



Polycycnis ornata tRNA-Leu (tmL) gene, partial sequence; and trnL-F| 
allergenic spacer region, complete sequence; chloroplast gene for 
|chloroplast product 



[C.roseus GGPP synthase gene 



0.008 



5586 XMJ)02601 [Homo sapiens desmin (PES), mRNA" 



1.6 



fen* 



5587 J NM_012745 
5588 



Rattus norvegicus CD94 antigen (located within the rat natural killer 
gene complex) (Klrdl), mRNA 



5.7 



AK022035 Homo sapiens cDNA FLJ1 1973 fis, clone HEMBB100122T 



1.4 



5589 AB020595 



Plasmodium falciparum 1-CysPxn mRNA for 1-cys peroxidoxin, 
| complete cds 



0.023 



1.4 



Human DNA sequence from clone RP5-839B11 on chromosome 20. 
Contains the first coding exon of the gene for a novel protein with a 
iKunitz/Bovine pancreatic trypsin inhibitor domain and WAP-type 
AL121778 (Whey Acidic Protein) 'four-disulfide core' domains and an e> | 0 07 



5590 



5591 
5592 



AF222338 



IRattus norvegicus voltage gated N-type calcium channel alphalB 
gene, partial cds, alternatively spliced 



AF247193 jMus musculus endobrevin (Vamp8) gene, exonT 



5.8 



0.18 



5593 AC026061 



Homo sapiens BAC clone RP11-223K9 from Y, complete sequence 



5594 XM 030132 



Homo sapiens ubiquitin specific protease 9, X chromosome 
(Drosophila fat facets related) (USP9X), mRNA 



e-163 



5595 
5596 



AK015052 
AK023333 



Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4930402K13, full insert sequence 



0.21 



5597 
5598 



AB058702 
Z36800 



'Homo sapiens cDNA FLJ13271 fis, clone OVARC1001000 



0.22 



Homo sapiens mRNA for KIAA1799 protein, partial cds 



0.002 



|H.sapiens (xs!49) mRNA, 250bp 



0.007 



0.0003 



419 



• 



111 



£3 
CO 



|SEQID 

NO I ACCESSN 



Table 3A Nearest Neighbor (BlastN vs. Gcnbank) 



DESCRIP 



5599 XM 044123 

5600 XM 030272 



Homo sapiens cadherin 20, type 2 (CDH20), mRNA 



Homo sapiens LOC88835 (LOC88835), mRNA 



5601 
5602 



AF030944 
U59882 



Brugia malayi microfilarial sheath protein SHP3a (Bmshp3a) and 
microfilarial sheath protein SHP3 precursor (Bmshp3) genes, 
I complete cds 



|Mus musculus MLH1 gene, exon 2 



5603 
5604 



AL591498 
AB002169 



Human DNA sequence from clone RP1 1-1 13L12 on chromosome 13, 
| complete sequence [Homo sapiens] 



Rattus norvegicus RT1.P1 pseudogene for TL antigen 



5605 



AB009907 Luciola kuroiwae mitochondrial DNA for 16S rRNA, partial sequence 



5606 



AF336829 Rattus norvegicus NEDD8-activating enzyme mRNA, complete cds 



Scyliorhinus canicula partial mRNA for follicle stimulating hormone 
5607 | AJ3 10344 |beta subunit (fsh beta gene) 



5608 
5609 



D00653 Homo sapiens gene for enteric smooth muscle gamma-actin, exon 8 



AF203 972 Shuttle vector pB A complete sequence 



5610 AK004804 



Mus musculus adult male lung cDNA, RIKEN full-length enriched 
library, clone:! 2000 16B 11, full insert sequence 



5611 AF270009 



Staphylococcus epidermidis strain SRI clone step.l040e03 genomic 
I sequence 



5612 BC006448 [Homo sapiens, clone IMAGE:3637085, mRNA 



5613 AF298202 



Dictyostelium discoideum putative transposon DDT-B, complete 
I sequence 



5614 AF27200 1 I Ebola virus subtype Zaire strain Mayinga complete genome 



5615 M86544 Cow prostaglandin F synthetase II (PGFSII) mRNA, complete cds 



5616 XM_039684 Homo sapiens putative G-protein coupled receptor (SH120), mRNA 



Medicago truncatula mRNA for putative AUXl-like permease (lax3 
5617 | AJ299399 |gene) 



5618 L39214 Ovis aries glucose transporter type 3 (GLUT-3) mRNA, complete cds 



5619 I XM_010126 [Homo sapiens stromal antigen 2 (STAG2), mRNA 



H. sapiens (D17S931) DNA segment containing (CA) repeat; clone 
5620 | Z23801 |AFM248tg5; single read 



5621 



I ' O •> * 

AF210842 Homo sapiens HARP (HARP) gene, exon 17 and complete cds 



5622 



5623 



Staphylococcus aureus DNA, complete structure of cassette 
AB047239 chromosome(SCC)-like element, strain:ATCC25923 
AB035 193 [Homo sapiens RHCE gene, intron 7, complete sequence 



5624 XMJ)45035 Homo sapiens hypothetical protein FLJ21343 (FLJ21343), mRNA 

5625 XM_052223 Homo sapiens DKFZP564G092 protein (DKFZP564G092), mRNA 



5626 | Z28007 | S.cerevisiae chromosome XI reading frame QRF YKL007w 



P VALUE 



4E-48 



0.3 



0.089 



5.2 



1.8 



0.003 



0.58 



0.22 



5.3 



0.037 



1.9 



0.019 



0.002 



0.075 



0.16 



6.4 



0.23 



e-127 



5E-19 



0.65 



0.25 



2.1 



2.4 



0.67 



0.23 



420 



Table 3A Nearest Neighbor (BlastN vs. Genbank) 



SEQK 
NO 


) 

ACCESSN 


DESCRIP 


P VALUE 


5627 


M14080 


Herpesvirus saimiri thymidylate synthase gene, complete cds 




5628 


AF067218 


Caenorhabditis elegans cosmid K10E9 


0.071 


5629 


AK017102 


Mus musculus adult male testis cDNA, RDCEN full-length enriched 
library, clone:4933437K13, full insert sequence 


0.76 


5630 


XM 038524 


Homo sapiens hypothetical protein FLJ23059 (FLJ23059) mRNA 


C-l lo 


5631 


AB052773 


Bombyx mori Bmdsx gene, exon 1, 2, 3, 4. female-soeciflc snlicinf* 
product complete cds 


0.24 


5632 


AF233069 


Galdieria sulphuraria maturase (matK) gene, partial cds; 50S 
ribosomal protein, ribulose-L5-bisohosDhate ca rhn wl a Qp/nwapn <* 
large subunit (rbcL), ribulose-l,5-bisphosphate carboxylase/oxygenase 
small subunit (rbcS), acetohydroxy-acid synthase small> 


V.J J 


5633 


AK027218 


Homo sapiens cDNA: FLJ23565 fis, clone LNG10846 


0.24 


5634 


AF198462 


HIV-1 isolate WM710-5 from Australia envelope elvconrotein fenv 1 * 
gene, partial cds 


\J.KJ 1 


5636 


AF213717 


Euhadra quaesita haplotype Mana-B 16S ribosomal RNA gene, partial 
sequence; mitochondrial gene for mitochondrial product 


0 9 1 


5637 


AE002730 


Drosophila melanogaster genomic scaffold 142000013385730, 
complete sequence 


1.4 


5638 


AF1 10027 


Candida albicans A TP-dependent transporter (YCF) eene comolete 
cds 


0.069 


5639 


Z73978 


Caenorhabditis elegans cosmid ZC302, complete sequence 


0 71 


5640 


AF379877 


Aclista sp. M226 16S ribosomal RNA eene oartial seanence- 
mitochondrial gene for mitochondrial product 


0.52 


5641 


AF364544 


Lanx lyallii microsatellite LLY13 sequence 




5642 


AF346905 


Dermatophagoides farinae allergen Def f II precursor, mRNA, partial 
cds 


o oor 


5643 


AE001426 


Plasmodium falciparum chromosome 2, section 63 of 73 of the 
complete sequence 


0 001 


5644 


XM 002850 


Homo sapiens TU3A protein (TU3 A), mRNA 


0.078 


5645 


AB032899 


Rattus norvegicus PIPK2 alpha mRNA for DhosDhatidvlinositnl S- 
phosphate 4-kinase alpha, complete cds 




5646 


AK017370 


Mus musculus 6 days neonate head cDNA, RDCEN full-length 
enriched library, clone:5430431E21, full insert sequence 




5647 


XM 042126 


Homo sapiens similar to GTP-RHO BINDING PROTEIN 1 
(RHOPHILIN) (M. musculus) (LOC91997), mRNA 


^F-47 

J Aw" *T / 


5648 


] 

XM_048666 ] 


Homo sapiens runt-related transcription factor 1 (acute myeloid 
leukemia 1; amll oncogene) (RUNX1), mRNA 


6.3 


5649 


AJ000259 ( 


Caenorhabditis elegans osm-6 mRNA 


0.6 


5650 


] 

AK017713 1 


Vlus musculus 8 days embryo cDNA, RDCEN full-length enriched 
ibrary, clone:5730488L07, full insert sequence 


0.087 


5651 


] 

XM_010294 i 


lomo sapiens arylsulfatase E (chondrodysplasia punctata 1) (ARSE), 
nRNA 


8E-36 


5652 


XM_039942 I 


*>mo sapiens Meis (mouse) homolog 3 (MEIS3), mRNA 


0.53 



421 






Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 




P VALUE 


5653 


AKO 19524 


Mus musculus 0 day neonate head cDNA, RIKEN full-length enriched 

libra rv elone*4RT34^Q01 7 fiilt incArf cAmion^ 
iii/ioij, wuuu.toj j*tJ7vyi / , in ii iiioci t sequence 


I 

1.7 


5654 


XM 045122 


Homo sapiens KIAA1451 protein (KIAA1451), mRNA 


5E-38 


5656 


AK009345 


N/fllC mil C/*i line oHillt malp ir\nmia nFWTA T5 TVCXT -C.11 1 — _ • _i_ j 

iviub inu^cuius auuit maie tongue cuina, KJJvbiN tull-lengtn enriched 
library, clone:2310014M14, full insert sequence 


3E-35 


5657 


AF 192496 


irixfULKt aapiciiD ruj\rL gene, j iiaiiKing region 


1.5 


5658 


XM__052620 


Homo sapiens KIAA0935 protein (KIAA0935), mRNA 


0.0003 


5659 


XM 049514 


Homo *wiriipn*I hvnotll£*Hr'5il r\rr\te*i n T7T Tl"?l^"3 /T7T Till ^*3\ m n\TA 
nuuiu &apiciia iiypuuieuL/di protein rL*J 1 j ID J (r'.LJU 1 jj), ITLKNA 


5.6 


5660 


AJ400965 


PrntpiiQ mirsihilic ORP"l TYMA Vmt A nana o*^sl man r • r>-r> 

n uicus iiuictuiiib kjsw i L/iN/v, KaiA gene ana meiLb gene, strain PR 


6.5 


5661 


NM 031809 


Rattus norvegicus cyclic nucleotide-gated channel beta subunit 1 
rCnffbn mRNA 


0.63 


5662 


AF230741 


Notothenia coriiceps alphaNCPl (alphaNCPI), betaNCPl (betaNCPI), 
dipiidiN^i'z ^dipiidTNL^r'ii^, ana oetaiNCrz (DeuuNUPll) genes, 

wv/iii^fiW'Lw uia 


1.6 


5663 


AC079878 


Homo (saoipn*; R A plnnp ppi 1 ."i/t^P0 1 -fiv^m *7 r^mv-tiA+A r<Aj-»iA« nin 

Iiuiuu »apicii& uiuiic xvjri L-o^ofzi irom /, complete sequence 


0.086 


5664 


XM 029904 


Homo sapiens hypothetical gene supported by AK023961 
fLOC90205^ mRNA 


1.9 


5665 


AY042792 


Arabidopsis thaliana translin-like protein (At2g37020; Tl J8.20) 
mRNA, complete cds 


4.6 


5666 


ABO 18347 


Homo QanipnQ rnRTsT A for K"TA A flSHA nmt^in nortioi 

iiumu aapiuia liuvLNrv xui ivi/Y/w/ou^f protein, partial cos 


0.068 


5667 


AY018017 


aduvd iiuirobcuciiue xvlkouj4z contaimng (ALi)Xz2, closest to 
marker G1185, genomic sequence 


0.059 


5668 


AE005310 


iz.bL'iiciicmd con v^i^/.ri/ jQ.L/i^yjj genome, contig i oi 3, section 134 
of 155 


0.49 


5669 


XM 034498 


Homo sapiens farnesyl diphosphate synthase (farnesyl pyrophosphate 
^yiiuicidbc, uiiiieuiyidiiyiuransu^ansierase, geranyitranstransierase) 
fFDPS> mRNA 


5.3 


5670 


AK027123 


Homo sapiens cDNA: FLJ23470 fis, clone HSI 11950 


0.17 


5671 


NM_0 13696 


Mus musculus thyrotropin releasing hormone receptor (Trhr), mRNA 


0.23 


5673 


L23758 


Bos taurus lysozyme 7A mRNA 


0.8 


5674 


AY034963 


/\xauiaopsis inaiiana puiauve polypeptide cnam release factor 

V- 1 - -L-J-EV^*/. 1. / J llLCVl^l /A., IAJ111LM.CIC LvUo 


1 A 

1.4 


5675 


XM_028009 


Homo sapiens KIAA1563 protein (KIAA1563), mRNA 


2E-71 


5676 


XM_046473 


Homo sapiens Rho GDP dissociation inhibitor (GDI) alpha 
(ARHGDIA), mRNA 


0.59 


5677 


NM_000148 


Homo sapiens fucosyltransferase 1 (galactoside 2-alpha-L- 
fucosyltransferase, Bombay phenotype included) (FUT1), mRNA 


0.0000001 


5678 


AF203477 


Drosophila melanogaster Sticky chl (stichl) mRNA, partial cds 


2.1 


5679 


] 

AF269242 


Plasmodium falciparum transmission-blocking target antigen Pfs230 
(s230) gene, complete cds 


0.23 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQBD 
NO 


ACCESSN 


DESCRIP 


P VALUE 


5680 


AE006954 


Mycobacterium tuberculosis CDC1551, section 40 of 280 of the 
complete genome 


2.1 


5681 


Z93386 


Caenorhabditis elegans cosmid Rl 1H6, complete sequence 


0.023 


5682 


XM_012723 


Homo sapiens chromosome 18 open reading frame 1 (C18orfl), 
mRNA 


5.8 


5683 


AF171785 


Drosophila melanogaster clone 69 mRNA sequence 


0.73 


5684 


AF159105 


Homo sapiens intestinal trefoil factor gene, 5* flanking region 


0.02 


5685 


NM 011889 


Mus musculus septin 3 (Sept3), mRNA 


4.6 


5688 


NM 008541 


Mus musculus MAD homolog 5 (Drosophila) (Madh5), mRNA 


0.0000002 


5689 


XM 043345 


Homo sapiens similar to inner centromere protein antigens (135kD, 
155kD) (H. sapiens) (LOC92175), mRNA 


0.075 


5690 


AL590109 


Human DNA sequence from clone RP1 1-405J10 on chromosome 10, 
complete sequence [Homo sapiens] 


0.003 


5691 


XM_051593 


Homo sapiens hypothetical gene supported by AF055004 
(LOC93477), mRNA 


0.55 


5692 


U22345 


Human chromosome 20ql2 locus-specific repeat 


0.008 


5693 


X63385 


B.thunngiensis IS23 IF DNA 


0.72 


5694 


AF288224 


Brachyspira hyodysenteriae DNA Gyrase B subunit gene, complete 
cds 


0.006 


5695 


AF279456 


Lycopersicon hirsutum sesquiterpene synthase 2 (SSTLH2) mRNA, 
complete cds 


0.06 


5696 


AF055992 


Homo sapiens Duffy antigen/chemokine receptor (FY) gene, FY*X 
allele, complete cds 


1.5 


5697 


AF167306 


Homo sapiens synuclein alpha interacting protein (SNCAIP) gene, 
exon 10 and complete cds 


0.99 


5698 


AJ277985 


Nicotiana tabacum dreppl gene, exons 1-4 


0.22 


5699 


NM 030238 


Mus musculus dynein, cytoplasmic, heavy chain 1 (Dnchcl), mRNA 


4E-25 


5701 


X95343 


N.tabacum mRNA for HSR201 protein 


5.9 


5702 


AE002116 


Ureaplasma urealyticum section 17 of 59 of the complete genome 


0.68 


5704 


XM_029907 


Homo sapiens anillin (Drosophila Scraps homolog), actin binding 
protein (ANLN), mRNA 


0.72 


5705 


AJ304804 


Homo sapiens ANKRD2 gene for skeletal muscle ankyrin repeat, 
exons 1-9 


0.22 


5706 


AF371373 


Cricetulus griseus hypothetical protein Al-3 mRNA, complete cds 


0.72 


5707 


AE007446 


Streptococcus pneumoniae section 129 of 194 of the complete genome 


0.18 


5708 


L44121 


Homo sapiens (clone pHK2.1D) CMT1A gene, repeat unit 


6.6 


5709 


X90848 


S.scrofa ppk98 gene 


0.23 


5710 


AF397903 


Pisum sativum AAA-metalloprotease FtsH (FTSH) mRNA, complete 
cds; nuclear gene for mitochondrial product 


3.8 


5711 


XM_027365 


Homo sapiens ADP-ribosylation factor-like 6 interacting protein 
(ARL6IP), mRNA 


1E-30 
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Table 3A Nearest Neighbor (BlastN vs. Genbanlrt 


SEQIE 
NO 


> 

ACCESSN 


DESCRIP 


P VALUE 


5712 


L01574 


Homo sapiens P-selectin CD62 fGRMP 1 * pptip v pnH anH nmmAtM 
region 


0.00001 


5713 


AY023618 


Orvza sativa microsatellite MR 05941 mntainino ^tta\y<;s «i At . 0(1 * 

J wvm.«^tu iiuuivjaLwiiiv ivuvvjj^tj i*uxiuixxxiiig ^1 I /\ JJ\^3D^ ClOSeSl 

to marker RG908, genomic sequence 


0.00009 


5714 


AE004377 


Vibrio cholerae chromoQfvmp TT cpr*H/vn "XA Af oi svF+ttA ^nm^i 

t ^* ,u ^*ivfiv^iav/ wu umuauiuc xx, ocvuuii j*f oi "j oi me complete 
chromosome 


0.58 


5715 


AL1 15472 


Botrvtis cinerea strain T4 rDNA liHrorv lin^pr ^nn/ittinnn ftfnj+» rt ,»«« 
uuujruii wxiL^/iv^a Duoiii x *t L<i_^iN/A. nuiciiy untie r conuiiions oi mtrogen 

deprivation 


0.2 


5716 


XM 042855 


Homo saniens hvnothf*Hra1 opnp ciirmnrtpH Kv Rrnn^i^Q" ornnom 
(LOC92105), mRNA 


8E-23 


5717 


XM_0 15203 


Homo sanieris tgjHoti ftfintninintr H^l^t^H in <i7Ancruirmiin i. /iaia«a^ • 
^h 1 " 13 wi/iiuuiuii^ uciclcu in d^oosperniia z, ueieteo. in 

azoospermia (LOC90902), mRNA 


1E-19 


5718 


AYO 19789 


Orvza Sativa microsatellite \fRfr? 1 14 rrmtninint* /'A^Y7fi fTonnmio 

nuviuaaiwuic ivjjxvjzxit cuxiuiiiiiiig ^/\ i j^/o, genomic 

sequence 


0.0000003 


5719 


ABO 10300 


Garlic vims A genomic RNA corrmletp «¥mpnrp 


U.UU1 


5720 


AF171785 


Drosoohila melaxiof?a<tfer clnnp fiO mRMA cpnnpn^p 


ft T5 


5721 


AF325188 


Phaseolus coccineus suspensor-specific protein (C5414ike) gene, 
complete cds 


U.Z4 


5722 


XM_048692 


Homo sapiens similar to hvnnthptiral nrntpin n-i coT^io»^c^ 
(LOC92998), mRNA 


lii-lo 


5723 


AE006878 


Sulfolobus solfataricus section 237 of 272 nf thp mmni ptp opnnmp 


z.z 


5724 


AF038919 


Dictyostelium discoideum Psl A (psl A) gene, complete cds 


0.026 


5725 


AF368257 


Bacillus thurinffiensis nlacmiH pnmHpH PtvIRq (nr\r\ Ro\ n ono 
complete cds 


2.1 


5726 


AF148319 


Homo sapiens imprinting center AS-SRO rp<Hnn 

v**** lining wiiivt ^ iiu Ljivvy IV'K^lvlll 


1 Q 

X.O 


5727 


D49412 


Human gene for interleukin 3 receptor alpha subunit, exon 1 1 


0.23 


5728 


AJO 12488 


Mus musculus gene encoding serotonin receptor 5-HT2B, exons 1-3 


0.25 


5729 


AL445224 


Human DNA <*eoiiencp frnm Hnnp PPi i _t ^opin nn ^rAiMAPAma ^ 
ixuiiiaii .i^j.^.r-v owv^Li^xiv^ xxuxii LfXuxic xvr i on cnxomosome o ? 

complete sequence [Homo sapiensl 


z 


5730 


AK025021 


Homo saniens cDNA* FT T91^fiR fic Hr»nn rni n'ln^ t^iVkUr 

oap&^iio riwj/,ijoo iis>, ciune lajjlajjloo, iiigiiiy similar 

to AF 1585 5 5 Homo sapiens ffhitamina^p P mPMA 


e-i / 1 


5731 


D38252 


Rao nanus sativus darlc-inrfnpihlp anH cpnpc^pnr>p_accA^i itoH rrana 
Ax **i' ii ** iii -* ,j "jnuvuij ucuiw uiuiuniuib cixlu aciicstueiivc uo&ociaieu gene, 

exon 1 to 5 


0.62 


5732 


XM_051880 


Homo saniens hvnothptiral apnp amrwvrtwi Kir a v no ^'70'? 
(LOC93533), mRNA 


J.O 


5733 


K00769 


veast fs.pombe) 5s rrna eene and flankc Hnnp ncnri l 


f\ nil 


5734 


AL031254 


Caenorhabditis elegans cosmid 4R79, complete sequence 


0.082 


5735 


AK001945 


Homo sapiens cDNA FLJ1 1083 fis, clone PLACE1005232 


2.1 


5736 


XM_0 12452 


Homo sapiens ryanodine receptor 3 (RYR3), mRNA 


0.008 


5737 


XM_046442 ] 


Homo sapiens hypothetical protein FLJ23045 (FLJ23045), mRNA 


6E-33 


5738 


AF072513 ] 


[Ilex argentinus microsatellite Ia203 


0.69 


5739 


XM_050747 ] 


Homo sapiens TClO-like Rho GTPase (TCL), mRNA 


5E-20 


5740 


AC079757 ] 


Homo sapiens clone RP11-109N2, complete sequence 


0.62 
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SEQIE 
NO 


> 


DESCRIP 


P VALUE 


5741 


XM 042128 


Homo sapiens KIAA1691 protein (KIAA1691), mRNA 


4.4 


5742 




Homo sapiens caldesmon 1 (CALD1), mRNA 


4E-67 


5743 


BC002303 


Mus musculus, clone IMAGE:3591199, mRNA 


0.69 


5744 


AF273045 


Homo sapiens CTCL tumor antigen sel4-3 mRNA, complete cds 


5E-28 


5745 


AE001798 


Thermotoga maritima section 1 10 of 136 of the complete genome 


0.71 


5746 


ATZ / jzZj 


Homo sapiens Fanconi anemia complementation group D2 protein 
{rJKjXLsUz) gene, exon 7 


0.009 


5747 


AT Sft0090 


Human DNA sequence from clone RP1 1-1 17A20 on chromosome 6, 
complete sequence [Homo sapiens] 


0.026 


5748 


AF101092 


Homo sapiens collagen type XI alpha- 1 (COL1 1 Al) gene, exon 29 


0.019 


5749 


S80521 


dMax-basic hehx-loop-hehx/leucine zipper protein {alternatively 
spliced} [mice, WEHI 231 B-lymphoma cells, mRNA Partial, 752 nt] 


0.003 


5750 


AF291825 


Lronlla gonlla von Hippel-Lindau tumor suppressor (VHL) gene, 
promoter region 


0.078 


5751 


AT71 A 


Kluyveromyces lactis cytochrome c oxidase subunit III (COX3) gene, 
mitochondrial gene encoding mitochondrial protein, complete cds 


0.027 


5752 


NM_001423 


Homo sapiens epithelial membrane protein 1 (EMP1), mRNA 


5E-31 


5753 


AB045575 


Danio rerio DNA, fkk-7c locus, insertion site of the medaka fish Tol2 
element 


1.7 


5754 


AL050120 


Homo sapiens mRNA; cDNA DKFZp586D211 (from clone 
DKFZp586D211); partial cds 


2 


5755 




Homo sapiens genomic DNA, chromosome 21q22.3, clone:f60A9, 
complete sequence 


0.009 


5756 




Homo sapiens cDNA FLJ10787 fis, clone NT2RP4000481, weakly 
similar to A IF-DEPENDENT RNA HELICASE DOB1 


0 


5757 


INIVI U J I J 1 I 


Rattus norvegicus Insulin-like growth factor II (somatomedin A) 

IgJA), mRNA 


0.025 


5758 


AT 


Homo sapiens mRNA; cDNA DKFZp434B249 (from clone 
UJSJ , Z,p4J4r>z4Vj 


0.71 


5760 


AF3 17787 


Rattus norvegicus Cd36 gene, promoter region 


2.1 


5761 




Homo sapiens calreticulin (CALR), mRNA 


6E-41 


5762 


XM 037847 


Homo sapiens neurexophilin 3 (NXPH3), mRNA 


4.2 


5763 


AF288818 


Rattus norve incus cvtochrome P4.SO 4FS opnp ^nmni^ ^Ac 

tv^ivuo vjr iwiuunib rt jv/ tr J gene, dJlIipieie COS 


U.U26 


5764 


AF269237 


Cloning vector pTX-GFP, complete sequence 


0.71 


5765 


AK021493 ] 


Homo sapiens cDNA FLJ1 143 1 fis, clone HEMBA 100 1094 


8E-76 


5766 


AF034077 ] 


Equus caballus alpha- 1 -antitrypsin (Spi2) gene, complete cds 


0.0003 


5767 


] 

XM 030851 i 


Homo sapiens G protein-coupled receptor kinase-interactor 1 (GIT1), 
mRNA 


5.4 


5768 


AY039919 ( 


krabidopsis thaliana unknown protein (AT4g09340) mRNA, 
complete cds 


4.8 
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SEQIE 
NO 


> 

ACCESSN 


DESCRIP 


P VALUE 


5769 


AK018211 


MllS milSClllllS adillt malt* TTlpHlll In nK1r»nrroto ATYNIA DTVUXT 41,11 

length enriched library, clone:6330510M09, full insert sequence 


6.4 


5770 


XM_006622 


iiuiuu oapi^na a x jrrvr iiiid<u*uii££ pruiein, uinoing protein l (iipnn 
beta 1) (PPFIBP1), mRNA 


0.079 


5771 


U72335 


Human platelet activating factor acetylhydrolase, brain isoform, 45 
kDa subunit fLISl * eene exons 3 anH 4 


0.2 


5772 


XM 048810 


Homo sapiens hvoothetical nrotein FLJ221 16 fFT T771 \f*\ mUMA 


V 


5773 


AF2 18257 


ArabldODSlS thaliana 1 ^.-oxo-nlivtnHipnrmtf^ rpHnrtocp nana onmniA+a 

x u uwxvtv/^oA^ uuuicuia x^-v/^yvj lyiljf LvlilClllJcllC ICUULuiaG tiCllC, COmpiete 

cds 




5774- 


AKO 14450 


Mus musculus 18 days pregnant adult female placenta and extra 
embryonic tissue cDNA RTKFN fiill-lpnoth ^nrir-h^ nKron f 
clone:3830422A13, full insert sequence 




5775 


AK005342 


MuS mUSCllllI*; adult malp PPrpViplliitri rT^KT A -fiill lan^li 

enriched library cIone*1500032H18 fiill incprt cpnupnrp 


l.o 


5776 


M99412 


Human interleukin-8 receptor (IL8RB) gene, complete cds 


0.063 


5777 


NC_001619 


Canine oral r>ani Horn ?i vim q mmnlptp opnnmA 


1.8 


5778 


NM 030238 


Mus musculus dynein, cytoplasmic, heavy chain 1 (Dnchcl), mRNA 


0 


5779 


AL031254 


Caenorhabditis eleeans cosmiH 4R7Q mmniptp cpnnpnrp 


U.Uo4 


5781 


Z74797 


S.cerevisiae chromosome XV reading fnmp orf vht ci^^r* 


L.L 


5782 


XM 040841 


Homo saoiens cnllin 5 fPTTT ^ i mPtslA 


0.23 


5783 


U20807 


Bos taums Drotein tvro^inf* nhnonluitQCP' R A 14 mDM A ^mnioto «j p 

x^v/>9 uiiuuo piui^iu xjriuaixic piiuapiidiaaC X>/\l £ f IIlxvXN/\ ? COmpieie CQ.S 


0.22 


5784 


AE006648 


SulfolobuS SOlfetaricUS Section 7 Cif 979 rrFfhf* rnmnlotp nnnnma 


0.00 / 


5785 


AY018666 


v^i^a wuvd iiuwiuadLciiiLc ivuvouyyi containing (Ai jAiy, closest to 
marker R1713, genomic sequence 


0.008 


5786 


AK026299 


Homo SaDienS cDNA' FLJ22646 fis Hnnp 1-1^1107 17ft 


0.23 


5787 


X91787 


L.luteus mRNA for tRNA-glutamine synthetase 


0.17 


5788 


XM_0 17281 


Homo saoiens endosulfine alnha rFNSAl mPMA 


1 Q 

l.y 


5789 


XM_0 17281 


Homo saoiens endosulfine alnha fFNSA i mRNA 


1 o 

i.y 


5790 


AF229090 


Homo saoiens mvosin VI ^MY06 1 jypnp pynn 1 1 


U.oV 


5791 


AK001945 


Homo saoiens cDNA FLJ 11083 fi^ clnne PT A PF inn ^7 77 




5792 


AF189157 


i^OOt-and— mOlltll rflQfyiCP viniC ( ctrCJin I i nnhmrntoin r^A-mnlntf-M 

* vyvM - c * Ai ^* 111.V/U.U11 uio^ao^ VUU3 ^oii din v-/i ^ puiyproiein gene, complete 
cds 


6.5 


5793 


AK013313 


MuS musculus 10 11 davs emhrvn rDNA RfK'PM fiill l^ncrtli 
enriched library, clone:2810449G22, full insert sequence 


6.1 


5794 


AJ005205 


Homo sapiens 5HT3 gene for serotonin 3 receptor 


0.69 


5795 


NM_0 19083 


Homo sapiens hypothetical protein (FLJ10287), mRNA 


6E-63 


5796 


AK021451 


Homo sapiens cDNA FLJ1 1389 lis, clone HEMBA1O0O557 


2E-54 


5797 


Z34885 


T.pyriformis (CGL) gene for TCPlgamma protein 


0.6 


5798 


] 

AF288408 < 


Rattus norvegicus transient receptor potential Trp4beta mRNA, 
complete cds 


0.081 


5799 


XM_015806 ] 


Homo sapiens hypothetical protein FLJ22621 (FLJ22621), mRNA 


0 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 




r> \/at TTC 
if VAJLUxi 


5800 


AF332918 


Plasmodium falciparum Indochina III/CDC erythrocyte binding 

orotein BAEBL fliaehl^ {tptip rnmnlptp r*Hc 


1 o 


5801 


XM 052063 


Homo sapiens hypothetical protein DKFZp547A023 
rDKFZo547A023i mRNA 


U 


5802 


ABO 13796 


Ppratnntpri** riphnrHii mT? XT A for PTJWR^l prtmnlotA r*Ac 
v^^iaiu^L^iia liviiaxixxi iiirviN/A. iui ^ivilDO, COIIipiCie CQS 


0.024 


5803 


AF288408 


rvuLLus liuivc^iwua uctiLaiciii receptor poienuai i rp^+DCia mivINA, 
comrjlete cds 


O.OoZ 


5804 


AF174213 


Terebra subulata 16S ribosomal RNA gene, partial sequence; 
mitochondrial f?ene for mitnrhnTirfrial nrnHnrt 


o ft 


5805 


AF308286 


Homo sapiens serologically defined breast cancer antigen NY-BR-18 
nVRNA nartial cds 


n so 

U.J7 


5806 


AF030199 


MllQ mil SClllllS tvne 1 Sipma rppPTYfor apnp rnmnlptp /*Hc 
1UUI3 uiuawmuo kj^jk/ x oigxiia l^/CJMUl gCllG, dJllipivlC WUa 




5807 


M99412 


Human interleukin-8 receptor (IL8RB) gene, complete cds 


0.51 


5808 


Z34885 




O SQ 


5809 


AF305539 


Mus musculus B lymphocyte induced maturation protein 1 (Prdml) 
f?ene exon 8 and comnlptp pHq 


1 JJ.-0 / 


5810 


AF288408 


xxtiiLus injivc^iL-uo tidiioiciii levcpiur poieniid.1 i rp^t-Deia rnivCNA, 
comnlete cds 


u.uo / 


5811 


AE001181 


Borrelia burgdorferi flection 67 nf 70^ nf thp rrimnlAt** apnnmp 




5812 


AY027894 


Homo sapiens voltage-dependent calcium channel beta 2 subunit 
rCACNB2^ eene exon 2 


\J.\J\J\J\Jo 


5813 


XM 029481 


Homo sapiens similar to KIAA0732 protein (H. sapiens) (LOC90154), 
mRNA 


U.Oo 


5814 


AK021659 


Homo sapiens cDNA FLJ11597 lis, clone HEMBA1003856 


2.1 


5815 


U39721 


iviy vupiaollul gGxxlUxilUlll aCCUUll *tj Ox J 1 Ul II 1C COITipiCie genome 


A 1Q 


5816 


Z73267 


S.cerevisiae chromosome XII reading frame ORF YLR095c 


0.009 


5817 


XM_029453 


Homo sapiens oculocerebrorenal syndrome of Lowe (OCRL), mRNA 


0.26 


5818 


AF027728 


wciiujJLia idcvia KJiiCMii'ICictlCU proicin ^vVl^xilNr^-Ii ) mtvlNA, Complete 

cds 




5819 


AE001407 


Plasmodium falciparum chromosome 2, section 44 of 73 of the 
comnlete seouence 


rt i« 

U. 18 


5820 


AL 157491 


Homo saoiens irVRNA* cDNA DTCFZt»J.^J."K"1 ill ffmm Mnnp 
DKFZd434K1 i in 


OP so 


5821 


XM_002437 


Homo sapiens cAMP-regulated guanine nucleotide exchange factor II 
(CAMP-GEFII), mRNA 


4E-37 


5822 


XM 050146 


Homo saoiens KIAA0472 nrotein fKTAA0479^ niRMA 




5823 


AY032600 


Carica papaya xyloglucan endo-transglycosylase mRNA, complete cds 


0.19 


5824 


Z36061 


S.cerevisiae chromosome II reading frame ORF YBR192w 


3 


5825 


AK021459 


Homo sapiens cDNA FLJ1 1397 As, clone HEMBA1000622 


0.046 


5826 


AF261017 


Nicotiana tabacum putative chloroplast RNA helicase VDL (VDL) 
gene, exons 1 through 5 and complete CDS, alternatively spliced; 
nuclear gene for chloroplast products 


2.1 


5827 


AJ277162 


Beta vulgaris mRNA for putative Cdc2-related protein kinase CRK2 


0.009 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 




r> WAT T TC 

r VALUli 


5828 


AL3 90874 


Human DNA sequence from clone RP1 1-238K16 on chromosome 9, 

WV/XULSXwlV OVVJUWl^v YkjYvWj oil IJi Clio J 


U.Uzo 


5829 


S84872 


ouiiiiiiuii LuuciuouiiL lype a pnyiocnrome (c-pnyAj niKJN A, complete 


U.43 


5830 


AF271370 

■CXI. £* 1 X J / \J 


Homo sapiens interphotoreceptor matrix proteoglycan 200 (1MPG2) 

ppti** pvnn 1 0 

gUlb, XV/ 


U.078 


5831 


XM 051183 


Homo <5anien<! smhrvonir w*tnHf»rm Hp\rp1nnmpnt /ccn\ mD\TA 




5832 


BC006523 


rxuiiivj a<ipiciio, ociuiii/giuuuvvorucoiu reguiaieu Kinase .z, cione 
IMAGE 2988475 mRNA 


Jli-34 


5833 


AF100658 


Caenorhabditis elegans cosmid H08G01 


0.44 


5834 


XM 003725 


xxvjxiuj oapiuu jJUlalive llUUllUulCiloC 111 ^aVLN/VoO-J-L* ), JIlr\JNA 


n AAQ 

U.UUo 


5835 


M58029 


miiucua piuicua ajiiiuiuLiu Ddcienum zyKUa. proiein {szyx.) gene, 
complete cds 


0.71 


5836 


U46541 


o idyiiy us auicuo sai/v gene, compieie COS 


U.UUo 


5837 




AfiHinmic amliiviitpTic nlacmi/1 «l""fcT 1ft 
/A^/HJ-lCUlLlO alillslVaXCilo piaMIUU III -/I * 1 1 f 


U.l / 


5838 


X15702 


Fruitfly LA9 mRNA for DNA binding protein 


0.077 


5839 


AL031590 


Human DNA sequence from clone CTA-232D4 on chromosome 
x-^ijxj.x wuiiiaiua a vjoo, wuinpieie sequence [nomo sapiensj 


U. lo 


5840 


X07946 


Yeast plasrnid DNA coding for RNA polymerase subunit 


0.082 


5841 


AL590324 


oiuuaii i_/iNrt. oct^ucxiuc iruiii ciuiic ivri i-^i /yui i on cnromosome iu, 
complete sequence [Homo sapiens] 


0.082 


5842 


XM 004947 


Homo onr>tpn^ naranYnnncp 9 rPfYW?^ mRMA 


1 A 
1.0 


5843 


AF262582 


x^uii^uui mxiuieriu.es sp. ivianaus 100 mnocnoncmai noosomai kin a 
gene, partial sequence; mitochondrial gene for mitochondrial product 


0.067 


5844 


L81839 


noiiio sapiens {suucione z_nj irom ri ri4Jj din a sequence, complete 
sequence 


0.00006 


5845 


U67550 


Methanococcus jannaschii section 92 of 150 of the complete genome 


0.087 


5846 


XM 044987 


Homo sapiens hypothetical gene supported by U00951 (LOC92415), 
mRNA 


0. 1 


5847 


AE001420 


Plasmodium falciparum chromosome 2, section 57 of 73 of the 

t*UXULJXd.G odJUCllv'C 


A *71 
U. / 1 


5848 


AF3 15823 


Stylonychia mytilus DNA-dependent RNA polymerase II largest 

Cllhlinit RPFtl rRPRl^ oe>r\(± riartial rr\c 
auuuxiiL xvx xj i ^i\tdi ) ^CllC, pal Hal Cub 


A ^1 


5849 


AB066207 


Physcomitrella patens chloroplast genes for ribulose-l,5-bisphosphate 
cdTDoxyiase/ oxygenase large suoumt, tKiNA-ATg ana acetyl- v^o A 
carboxylase beta subunit, complete cds 


0.65 


5850 


AF286898 


Plasmodium chabaudi adami carbamoyl phosphate synthetase II gene, 
partial cds 


0.002 


5851 


AL583888 


Human DNA sequence from clone RP1 1-132E18 on chromosome 10, 
complete sequence [Homo sapiens] 


0.028 


5852 


AB011811 


Chromatium vinosum genes for light-harvesting and reaction center 
proteins, partial and complete cds 


6.4 


5853 


AF153448 


Zea mays nitrate reductase (NR1) gene, complete cds 


0.067 
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Table 3A Nearest Neighbor fBlastN vs. Genbank^ 


SEQIE 
NO 


r 

ACCESSN 


DESCRIP 


P VALUE 


5854 


XM_012762 


Homo sapiens hypothetical protein FLJ10891 (FLJ10891), mRNA 


0 


5855 


1 A 

/vrZ4ool4 


Maoricicada cassiope isolate TB-MI-004 tRNA-Asp gene, complete 
sequence; ATPase suburtit 8 gene, complete cds; and ATPase subunit 
6 gene, partial cds; mitochondrial genes for mitochondrial products 


0.029 


5856 


UVJ 1UZI 


Anthocidaris crassispina mRNA for dynein beta-heavy chain, 
complete cds 


2E-22 


5857 




Plasmodium falciparum chromosome 2, section 38 of 73 of the 
complete sequence 


0.54 


5858 




Homo sapiens, clone IMAGE:2959994, mRNA 


1.9 


5859 




Macaca fascicularis brain cDNA clone:QmoA-12555, full insert 
sequence 


0.18 


5860 


JNM_UUo538 


Mus musculus myristoylated alanine rich protein kinase C substrate 
(Macs), mRNA 


1.5 


5861 


AM Uzyzoo 


Homo sapiens sarcoglycan, epsilon (SGCE), mRNA 


0.66 


5862 


AE006685 


Sulfolobus solfataricus section 44 of 272 of the complete genome 


0.37 


5863 


JdCUUoUU 1 


Homo sapiens, clone 1MAGE:35 10538, mRNA 


6E-34 


5864 


A 'DflO/l C11 


Nicotiana tabacum mRNA for TMV response-related gene product, 
complete cds 


2.1 


5865 


UZ j[)2 / 


Glycine max phosphatidylinositol-specific phospholipase C mRNA, 
complete cds 


0.077 


5866 


U50715 


Mus musculus alpha-galactosidase A gene, complete cds 


0.024 


5867 


U64465 


Stylophora pistillata L-rype calcium channel alpha- 1 subunit 
(STPCACHL) mRNA, complete cds 


0.072 


5868 




Homo sapiens similar to hypothetical protein DKFZp566D1346 (H. 
sapiens) (LOC91690), mRNA 


0.023 


5869 


U60135 


Arabidopsis thaliana serine/threonine protein phosphatase 2A-3 
catalytic subunit gene, complete cds 


0.027 


5870 


AF275745 


Lycopersicon esculentum plasma membrane H+- ATPase (LHA2) 
mRNA, complete cds 


1.5 


5871 


Z29521 


C.cnspus Chloroplast gene encoding 16S rRNA, tRNA-Ile, tRNA- 
Ala, and 23 S rRNA (partial) 


0.003 


5872 


AJ132557 


Oryctolagus cuniculus CYP19 gene, ovarian promoter region 


0.24 


5873 


AF124524 


Arabidopsis thaliana gamma-adaptin 1 gene, complete cds 


0.24 


5874 


a vm "2 1 1 1 
AJvU 1 3 1 5 1 


Mus musculus 10, 1 1 days embryo cDNA, RJKEN full-length 
enriched library, clone:2810422B04, full insert sequence 


0.72 


5875 


AB044413 


SllS SCrofa ^M"^^"P-3 ItlRNA for matriY rr\(^\ vk\\r\r\Tc\ie^\T*rkcc^ 1 n4 t4in1 *-Ac 

° vivia iviAvu. j iiixvi^rt iui iiidu iA iiieuuioproieinase- j , parnai cas 


U.ZJ 


5876 


AF247039 < 


Porcine adenovirus 3 171R (El A), 202R (E1B-1), 474R (E1B-2), 
288R(288R), 198R(pIX), 163R* (163R*), 162R(162R), 97R(97R) 
and 184R (184R) genes, complete cds; and IVa2 (IVa2) gene, partial 
xls 


5.2 


5877 


AF060579 i 


Crossypium barbadense clone pXP020 repetitive DNA sequence 


0.055 


5878 


AE006499 t 


Streptococcus pyogenes Ml GAS strain SF370, section 28 of 167 of 
he complete genome 


2.1 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQED 
NO 


ACCESSN 




D \ 7 A T T TC 


5879 


AL 591498 


nuiiidii uls£\ bcc|uciiL.c irom cione ivni i-iULriz on cnromosome 1 J, 

^vnipicic avifUdiirw pnuniu aapicilaj 


0.003 


5880 


NM_0 17066 


Rattus norvegicus Pleiotrophin (Heparine binding factor, Hbnf, in the 
mouse) (Ptn), mRNA 


0.21 


5881 


XM 047505 


numu aapiciia iijpuuicLiwu gene supponea oy /visajzZ'I'oj 
rLOC92818^ mRNA 


n n 
u. / 


5882 


J00912 


chicken brain tubulin, alpha chain mrna 


0.24 


5883 


M15840 


Human interleiikin 1 -hpta (T\ 1 <ypnp rnrrmlptp rvfc 


u.uo 


5884 


X92683 


H saniCTK TTNa<5e T hvr»pr<:piiQitivp citp 


9 1 

Z. 1 


5885 


AL390767 


ii.uiiia.li bcv^uciicc liuin ciuiic i\ri-oorij on cnromosome 1 ipi j- 
14.2 Contains GSSs and ESTs. Contains part of a novel gene, 
complete sequence [Homo sapiens] 


1.8 


5886 


XM_0 12386 


Homo sapiens KIAA0377 gene product (KIAA0377), mRNA 


1E-48 


5887 


AL137466 


tfnmn cnnipnc mRNA • rTVMA TYK"P"7ty4.'* 4U1 799 /fV^m r>1svn<=» 
nuuiu sapicila UirvTN /\, CL/IN av Ur^TzJy^ J*fFH jZZ (ITOm Clone 

DKFZp434H1322) 


0.00001 


5888 


X62745 


7! TTI51VQ {"YRF 1 mR M A fVir <vc-pl f'tTK^nt hi nHi n fT -for+r*-r 1 
Z-j.mayo VyJJi l 1U1 Uvo~ddllClll. UlllUlllg IctCLOl 1 


v.Zj 


5890 


AB048902 


Macaca fascicularis brain cDNA, clone:QnpA- 17448 


0.24 


5892 


AF221070 


Ophiostoma piliferum strain CBS129.32 18S ribosomal RNA gene, 
pdiudi aci|uciiv^c, liiLcriidi udiibL/ri ucu spacer i, ->.oo riDOSomai xvTn a 

OPtlP qtiH int Pinal trait Cffifwl cnorpr 9 rrvmr*1*»t<» cen~tMf*nr*£* 

gene, aiiu iiiiciiiai ucuiociiucu spcit-cr z, woiiipieie secjuence 




5893 


AE006530 


Streptococcus pyogenes Ml GAS strain SF370, section 59 of 167 of 
the complete genome 


0.22 


5894 


AR0186Q8 

n I >v/ J. 0U70 


Broad bean wilt virus 2 gene for precursor polyprotein, complete cds, 

icrtlntp* TP 

ISUlaLC 1JT 


U.UZj 


5895 


U67537 


Methanococcus jannaschii section 79 of 150 of the complete genome 


0.026 


5896 


AJ000742 


Hnmn SaniptK; hicWI <tptip V TTTR 

J. J.VJlllxJ L7cipi Wild 1113X1 1. cbllvj <J \J 1 X\. 


ft ftftR 


5897 


AF247039 

ru. t UJ7 


Porcine adenovirus 3 171R (El A), 202R (E1B-1), 474R (E1B-2), 
zook \zooiy/, lyolv jpiJV;, lOJK^ ^loJK^j, loZK(lozK), y/K(y/K) 

and 184R (184R) genes, complete cds; and IVa2 (IVa2) gene, partial 
cds 




5898 


AJ290290 


Asphodeline lutea chloroplast trnL-trnF intergenic spacer 


0.23 


5899 


XM_042440 


A lUlllVJ aopi^llo u LT viol . UvlavJl tii r\L UCm 1,t" ^dldL/LUIiy III dllbiCI clbC, 

polypeptide 3 (B4GALT3), mRNA 


6.4 


5900 


U78090 


R ciftiic nnrvPOlfiiQ rv^tJlQQiiim ohantipl rp<ni1atr\r 1 mPMA rnmntptA rHc 
i^*iiiu.a iiuivbgivus puuiaaiuiu wiiaiiiivi iCglllalOl 1 IIUVIN /\, tUllipiCLC LUo 


9 1 

Z. 1 


5901 


NMJH7053 


Rattus norvegicus Neuromedin K receptor (Neurokinin B/Tachikin 3) 
(Tac3r), mRNA 


1.9 


5902 


AF208658 


Tapesia yallundae eburicol 14 alpha-demethylase (CYP51) gene, 
complete cds 


6.2 


5903 


AF281786 


HIV-1 isolate DK-g3 from USA gag polyprotein (gag) gene, partial 
cds 


0.63 


5904 


AJ011019 


Capra hircus csn2 gene, exons 1 to 9, allele 0(null) 


2 


5905 


AJ3 11054 


Chironomus pallidivittatus 4L ORF for putative recombinase 
subtelocentric clone, Cp5.5 


0.023 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCbboN 


DESCRIP 


P VALUE 


5906 


Y17466 


Fugu rubripes SIAT3C gene, last three exons 


0.65 


5907 


AKU22127 


Homo sapiens cDNA FLJ12065 fis, clone HEMBB 1002249 


0.001 


5908 


X79100 


C.vicina (Bl 1) mRNA for arylphorin receptor 


0.6 


5909 


AKa)23079 


Homo sapiens cDNA FLJ13017 fis, clone NT2RP3 000628 


0.23 


5910 


AKO 14448 


Mus musculus 18 days pregnant adult female placenta and extra 
embryonic tissue cDNA, RIKEN full-length enriched library, 
clone:3830421J05, full insert sequence 


1.4 


5911 


AK020293 


Mus musculus adult male cecum cDNA, RIKEN full-length enriched 
library, clone:9130404Hll, full insert sequence 


0.23 


5912 


AE006285 


Lactococcus lactis subsp. lactis IL1403 section 47 of 218 of the 
complete genome 


0.25 


5913 


U41292 


Human prolargin (PRELP) gene, 5' flanking sequence and exon 1 


0.001 


5914 


AT 1fi<V7/Y7 


Homo sapiens mRNA full length insert cDNA clone EUROEMAGE 


0.00004 


5915 


XTA/f HOAKA 
JN JYl_UZ*l- 1 j4 


Rattus norvegicus proton gated cation channel ASIC1 (Accn2), 


6.2 


5916 


U79732 


Plasmodium berghei extrachromosomal plastid PB-2, tRNA-Pro, 
tRNA-Glu, tRNA-Lys, tRNA-Asp, tRNA-Ser, tRNA-Tyr, tRNA-Met, 
tRNA-Leu, tRNA-Cys, and tRNA-His genes, complete sequence, rps4 
gene, complete cds, tRNA-Thr gene, complete sequence, and large 
subu> 


0.023 


5917 


Ar098715 


Coniglobus mercatorius 16S ribosomal RNA gene, mitochondrial 
gene encoding mitochondrial product, partial sequence 


0.095 


5918 


Ar095904 


Toxoplasma gondii ycf24 protein (ycf24) gene, partial cds; DNA 
dependent RNA polymerase beta subunit (rpoB) gene, complete cds; 
and DNA dependent RNA polymerase beta 1 subunit (rpoCl) gene, 
plastid genes encoding plastid proteins, partial cds 


0.026 


5919 


XM_027416 


Homo sapiens mesothelin (MSLN), mRNA 


0.7 


5920 


BC004772 


Mus musculus, Similar to inhibitor of kappa light polypeptide gene 
enhancer in B-cells, kinase beta, clone IMAGE:3256890, mRNA 


1.6 


5921 


AK000935 


Homo sapiens cDNA FLJ10073 fis, clone HEMBA100173 1 


0.18 


5922 




Homo sapiens, clone MGC: 12595 JMAGE:4303422, mRNA, 
complete cds 


0.001 


5923 


J05008 


Homo sapiens endothelin-1 (EDN1) gene, complete cds 


0.5 


5924 


AE002174 


Chlamydophila pneumoniae AR39, section 10 of 94 of the complete 
genome 


0.69 


5925 




\^awllUl llaUUlUa Clwgallo lllTvi>/A. xUl wudLLLll r\ 1 raoC 


u. / 


5926 


AF3 10894 


Dictyostelium discoideum prespore-specific protein (pspC) gene, 
partial cds; RacH (racH) gene, complete cds; and BOP (bopA) gene, 
partial cds 


0.027 


5927 


AE003910 


Xylella fastidiosa 9a5c, section 56 of 229 of the complete genome 


0.73 


5928 


AF335424 


Homo sapiens sperm protein Spl7-2 pseudogene, complete sequence 


1.8 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 


DESCRIP 


P VALUE 


5929 


TC\A 016094 


Hnmo ^anien^ hvnothetical nmtein MGP40S4 fMOP40S4i mPNA 




5930 


U62057 


Mycoplasma capricolum NADH oxidase (naox) gene, partial cds, and 
lipoate-protein ligase (Ipla), pyruvate dehydrogenase EI alpha subunit 
ipupdj, pyruvdie ueuyurogendse jci ueui suuuiui ^oupuj, pyruvdie 
dehydrogenase EII (odp2), dihydroiipoamide dehydroge> 


0.72 


5931 


AF202552 


ixoiiio sapiens i^in/\ ineuxyiiransierase ^lyiNivii i j gene, exonb z, j, 
and 4 


2.2 


5932 


a pro in An j. 


xxUIlRJ adpiCIla oxVl/AJLf.J ^OllldU-J ) gCHC, CAUll J) 


u.Uw X 


5933 


ApfiJlJ.1 ^7 

rVV-yUof ID 1 


V^dCILUrililULLllla ClCgdlla vVJalllxU I *rOIi 1 ZDIV, lAJIlipiCLC aCqUCillsC 


0.71 


5934 


VTVyf ftOTOHT 
AJV1_UZ iy\J 1 


Homo sapiens hypothetical protein DKFZp564D172 


0 IX 


5935 


AP"lfCft1Q1 
AT JOUl7l 


Araoiciopsis inaiiana uiiKnown protein yi y\^z>. iju/ai jg^yjou/ 

IIlxvl>/\, CUIIipiCLC CUa 


6.3 


5938 


AF9R979R 


Culex pipiens Twin-Cp5 SINE retroposon, genomic sequence and v- 

QXJ A T5 T7 _T cr(*Y\f* nartinl cpniiPtir'P 
kjlN /AxVU* JXIVC gCHC, LJdX lid! adfUClllrC 


0.066 


5939 


M81702 


Candida boidinii methanol oxidase (AOD1) gene, complete cds 


0.008 


5940 


AF057740 


riomo sapiens i / -ueid-nyoroxy sieroiu aenyurogendse i v vrioJ-/ 1 / xjh- y 
gene, exon 24 and complete cds 


0.21 


5941 




T-Tr»mo cprMAnc PYnnurlpncp M K K-cn f\ {"if^R 1 f\Q 1 1 mR M A 
xxAJliiAJ acipicxia CAvHiu^xciiaG l^ix-rX sp ^x-/v-/v^o L\jy x i ? iiixvi^ 


0.072 


5942 


AB022048 


Mus museums gene for prolyl oligopeptidase, exon 3, 4, 5, 6 


2.1 


5943 




ivvioiaced cmoropidai genes lor dipiid diiu ueia suuunii oi 
piiycuwydiiiii 


2.1 


5944 


AF181251 


Rattus norvegicus lung Kruppel-like factor (Lklf) gene, complete cds 


5.9 


5945 


ArUUlUJ / 


xxOiiio sapiens genuuuu -l/in/\, ciuuiuuauiiic zi, ^iuiicj^huvjii, iviwi- 

T^91Q171 t(*cj\ r\rt Prtrnnl f^tc* cf^nnf^ni^f 1 
J_^Z 1 o 1 / 1 lCglUII, ^unipicic aClJUCllVC 


0 00001 


5946 


AK0 16374 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone:4930589L23, full insert sequence 


5.9 


5947 


M27697 


Pig urate oxidase mRNA, complete cds 


2.1 


5948 




Mnmn ciiTiif*nc cimilar tn DF ^TFR TIM ( APTTKT fjFPOT VKTFRT7T>jrT 
xxUlllU aapiCiia aiiiLiial iu LJx-j\j x ivxiN ^rvv^iix^ L/u«r v/J_/ 1 ivxxj<r\ 1 1 <ii^< vj 


0.69 


5949 


U39688 


Mycoplasma genitalium section 10 of 51 of the complete genome 


0.077 


5950 


AF320250 


Lactobacillus delbrueckii YmdA (ymdA) gene, partial cds; and 
puiauve unaecaprenyi-pnospiiaie iN-aceryi-giucosanunyi udnsieruae 
(rgpG), YvyE (yvyE), putative ComFl protein (comFl), putative 
ComF3 protein (comF3), YvyD (yvyD), putative preprotein translo> 


0.69 


5951 


Y09852 


H. sapiens FGFR3 gene, partial 


2E-10 


5952 


Z79103 


H. sapiens flow-sorted chromosome 6 Hindlll fragment, SC6pA22Fl 1 


0.00002 


5953 


AP000321 


Homo sapiens genomic DNA, chromosome 21q22.1, D21S226-AML 
region, clone:Q82F5, complete sequence 


0.00004 


5954 


AK025042 


Homo sapiens cDNA: FLJ21389 fis, clone COL03455 


2E-66 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


CCA TT\ 


ACCESSN 


DESCRTP 


P VALUE 




U14576 


Dictyostelium discoideum filopodin (talA) gene, complete cds 


0.21 


5956 


AE001405 


Plasmodium falciparum chromosome 2, section 42 of 73 of the 
complete sequence 


0.052 


Dyj 1 


M64861 


D. discoideum TFIID mRNA, complete cds 


0.002 


5958 


AJ245435 


Plasmodium falciparum gc-alpha gene for guanylyl cyclase alpha 


0.0001 




XM 031443 


Homo sapiens phosphodiesterase 8 A (PDE8A), mRNA 


8E-13 




AC079875 


Homo sapiens clone CTD-2333B13, complete sequence 


0.00003 




AF275272 


Schizophyllum commune unknown gene; mitochondrial gene for 
mitochondrial product 


0.44 


5962 


Z60119 


H. sapiens CpG island DNA genomic Msel fragment, clone 18el, 
reverse read cpgl8el.rtlb 


0.008 




AF100654 


Caenorhabditis elegans cosmid C24E9 


0.026 




AE001399 


Plasmodium falciparum chromosome 2, section 36 of 73 of the 
complete sequence 


0.001 




U89019 


Hepatitis C virus polyprotein gene, complete cds 


0.2 




XM_052512 


Homo sapiens ARP3 (actin-related protein 3, yeast) homolog 
(ACTR3), mRNA 


2E-32 


J^O / 


U17634 


Human chromosome 21, Down syndrome critical region transcript, 
SP6 end of clone 3-3-f8 


0.7 


5968 


AF 197477 


Homo sapiens TRBP pseudogene, partial sequence 


0.22 


r>yoy 


U78599 


Streptococcus mutans putative D,D-carboxypeptidase and putative N- 
acety 1 -muramidase genes, partial cds 


2.1 


5970 


AP001414 


Homo sapiens genomic DNA, chromosome 21q22.2, clone:D30, 
LB7T-ERG region, complete sequence 


2.1 


5971 


AJ277067 


Homo sapiens partial SSH3BPl/e3Bl gene for spectrin SH3 binding 
protein, exon 3-5 


5.7 


con 


X13294 


Human mRNA for myb-related gene A-myb 5'-region 


6E-74 


5973 


NC_001796 


Human parainfluenza virus 3, complete genome 


0.053 


5974 


AJ222787 


Hordeum vulgare mRNA for alpha-keto acid dehydrogenase-like 
protein, clone RG136 


0.073 


5975 


AF284034 


Melanogrammus aeglefinus vitellogenin B (VtgB) mRNA, complete 
cds 


2.1 


5976 


AF127565 


Arabidopsis thaliana ubiquitin-protein ligase 2 (UPL2) gene, complete 
cds 


4.3 


5977 


AP001419 


Homo sapiens genomic DNA, chromosome 21q22.2, clone:PAC24K9, 
LB7T-ERG region, complete sequence 


0.0002 


3V /o 


AL590384 


Human DNA sequence from clone RP1 1-349A16 on chromosome 
Xq22.3-24, complete sequence [Homo sapiens] 


0.02 


oy /y 


ABO 14460 


Homo sapiens TSC2, NTHL1/NTH1 and SLC9A3R2/E3KARP genes, 
partial and complete cds 


0.69 


5980 


XM_018334 


Homo sapiens hypothetical protein FLJ22418 (FLJ22418), mRNA 


0.026 


5981 


AF229850 


Arabidopsis thaliana ferritin (Ferl) gene, complete cds 


0.69 


5982 


AKO 17445 


Mus musculus 10 days neonate head cDNA, ROCEN full-length 
enriched library, clone:5530402H04, full insert sequence 


2.1 


5983 


AF197159 


Mus musculus cubilin mRNA, partial cds 


0.7 
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Table 3A Nearest Neighbor (BlastN vs. Genbank) 


SEQID 
NO 


ACCESSN 




O \ T A T T TT7 

r VALUE 


5984 


XM_029346 


Homo sapiens hypothetical gene supported by U79302; AK026607 
(LOC90138) mRNA 


0.026 


5985 


AK021517 


Homo sapiens cDNA FLJ1 1455 fis, clone HEMBA1001497 


0.05 


5986 


Z74823 


S cerevisiae chromosome ^C\J rp^iHiTia fr^mp orf vht nftin; 


U.ZJ 


5987 


XM_003091 


xruiiiu ^dpiciib jvi/\/\uui/i gene product, putative vj-protein-coupled 

recentor* Or rvrotpin ronnlfvl rprpntnr fr%r TTTYP <r1nr»r»c<» /via ahaaia 
icwpiwi , vj ^iiuLcxii ^uupicu icccpiur XOI UJ-/l^-gluCOSe ^JvJLrvAUUU 1 ^ ? 

mRNA 


1.9 


5988 


XM 039450 


Homo sapiens similar to NONHISTONE CHROMOSOMAL 
PROTEIN HMG-14 fH saniensl fLOC9160rVi mRNA 


J.F-J.7 


5989 


XM_045103 


Homo sapiens G protein coupled receptor interacting protein, 
comolement-cla tumor necrosis factor-related (7*slC±'\i y \ mP>JA 




5990 


AK026086 


Homo saoiens cDNA* FLJ22433 fk clone J^RCOQMJ 


\J.\J fo 


5991 


Z22800 


H. sapiens microsatellite repeat 


3E-45 


5992 


AF165136 


iva j ^upidaiiia. iiryc-uiuca liij'L.uiu.co i^v^ iiypouieucai suriace locaieo. 
membrane protein and lipoprotein B precursor (lppB) genes, partial 
cds 




5993 


AF269465 


Stanhvlococcii*; enifJermiHiQ strain 1 plnnp ctF»« 1 ftnj.t»n^ nonntni^ 

u ia|/iijr iuwwuo ^piuu niiuio oLlaill OIvl &ICU. IvW'tIIUJ genomic 

sequence 


U.Zj 


5994 


XM 049474 


Homo saniens KIAA1538 nrotein fiCTA A 1 SIR^ mPXTA 


U.Zj 


5995 


AC001478 


Homo oanieTic fcnlvlonf* 1 \\1 "From RAP ui ii\ t^xta cannon/^ 
a iwniw Dd^/ibiia ^auu^iuiic i it/ Hum d/w^- niizj iviN/v sequence, 

complete sequence 


0.63 


5996 


AF380163 


uLLiucxi^t .rv v 11 us v."* vjuuac/\juiiiigtiong/»>/y / ^ri_) in i)) segment j 
Dolvmenme fPA^ pene comnlete r-Ao. 

J uav y-*- J girlie, WlilLPl^LC vUj 


o 


5997 


Z74263 


S cerevisiae chromosome TV re^Hina framp fYRPT VTiT 91 


O 

U. JO 


5998 


AF282059 


Homo sapiens clone 2qteI_edcl7fR sequence 


7.9 


5999 


AF233898 


Escherichia coli strain KT121R RfhA Hifh apnp nartial rHc 


U.U/ D 


6000 


M92086 


AVasri venom nrotein mRNA Qennenr*e 


1 *7 
1. / 


6001 


XM 006601 


Homo sanien<; hvnothetinal nrotein FT TIO^ISO rFT TIOA^Q^ mP\TA 


U 


6002 


D14074 


Bovine mRNA for cyclophilin, complete cds 


0.23 


6003 


AC079391 


Homo Sapiens clone CTD-224^M1 1 romnlete cernienr-p* 


/i 


6004 


AE006055 


Pasteurella milltOCida PM70 section 7? of ? ft 4 of the rnmnlptp apnAmp 


U.U / / 


6005 


Y09000 


R.norvegicus mRNA for dendrin 


0.18 


6006 


AK006991 


Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone: 1700084F23, full insert sequence 


2.1 


6007 


AF3 19636 


Pelargonium peltatum cultivar Guenievre microsatellite Pp3a 
sequence 


0.001 


6008 


D 17668 


Clostridium septicum gene for alpha-toxin, complete cds 


0.077 


6009 


AK025193 


Homo sapiens cDNA: FLJ21540 fis, clone COL06156 


0.025 


6010 


X17191 


E.gracilis chloroplast RNA polymerase rpoB-rpoCl-rpoC2 operon 


0.22 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


4 


12721131 


gi|12721131|gb|AAK02908.1| (AE006121) unknown 
[Pasteurella multocida] 


0.14 


6 


1504016 


gi|1504016|dbj|BAA13207.1| (D86971) no similarities 
to reported gene products [Homo sapiens] 


0.0003 


7 


12838708 


gi|12838708|dbj|BAB24303.1| (AK005889) putative 
[Mus musculus] 


5E-74 


8 


12854246 


gi|12854246|dbj|BAB29972.1| (AK015778) putative 
[Mus musculus] 


2.2 


9 


14715574 


gi|14715574|dbj|BAB62041.1| (AB065434) PRPK 
mutant 1 [Homo sapiens] 


4E-80 


10 


6678964 


gi|6678964|reflNP_032671.1|murinoglobulin 1 [Mus 
musculus] sp|P28665|A2Ml_MOUSE 
MURINOGLOBULIN 1 PRECURSOR (MUG1) 
gb|AAA73048.1| (M65736) [Mouse murinoglobulin 
mRNA, complete cds.], gene product [Mus musculus] 


4.2 


14 


11351090 


gi|l 135 1090|pir||E82999 probable beta-ketoacyl 
synthase PA5174 [imported] - Pseudomonas aeruginosa 
(strain PAOl) gb|AAG08559.1|AE004930_5 
(AE004930) probable beta-ketoacyl synthase 
[Pseudomonas aeruginosa] 


9.1 


15 


9506761 


gi|9506761|reflNP_062298.1| glucosamine-phosphate N- 
acetyltransferase; glucosamine-6-phosphate 
acetyltransferase [Mus musculus] emb|CAA04463.1| 
(AJ001006) EMeg32 protein [Mus musculus] 
dbj|BAB22 120.1| (AK002466) putative [Mus musculus] 
dbj|BAB25 161.1| (AK007647) putative [Mus musculus] 
dbj|BAB25212.1| (AK007722) putative [Mus musculus] 
dbj|BAB25240.1| (AK007764) putative [Mus musculus] 
dbj|BAB25749.1| (AK008566) putative [Mus musculus] 
dbj|BAB27395.1| (AK01 1098) putative [Mus musculus] 
dbj|BAB3 1241.1| (AK0 18499) putative [Mus musculus] 


1E-84 


20 


9369403 


gi|9369403|gb|AAF87151.1|AC002423_16(AC002423) 
T23E23.19 [Arabidopsis thaliana] 


7.4 


24 


12862430 


gi|12862430|dbj|BAB32466.1| (AB047280) Gag-like 
protein [Tricholoma matsutake] 


2.6 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 

N . 


DESCRIPTION 


P VALUE 


25 


7434102 


gi|7434 1 02 |pir||H7 1 620 aspartate transaminase (EC 
2.6.1.1) PFB0200c [similarity] - malaria parasite 
(Plasmodium falciparum) gb|AAC71829.1| (AE001380) 
aspartate aminotransferase [Plasmodium falciparum] 


0.73 


26 


14721245 


gi|14721245|reflXP_044013.1| kinesin-like protein 2 
[Homo sapiens] 


8E-91 


27 


9506713 


gi|9506713|ref|NP_061856.1| nucleolar protein family 
A, member 1 ; H/ACA small nucleolar RNPs protein 1 
[Homo sapiens] ref|XP_003601.1| nucleolar protein 
family A, member 1 (H/ACA small nucleolar RNPs) 
[Homo sapiens] ref|XP_039016.1| nucleolar protein 
family A, member 1 (H/ACA small nucleolar RNPs) 
[Homo sapiens] ref|NP_127460.1| nucleolar protein 
family A, member 1 ; H/ACA small nucleolar RNPs 
protein 1 [Homo sapiens] emb|CAB76563.1| 
(AJ276003) GAR1 protein [Homo sapiens] 
gb|AAH03413.1|AAH03413 (BC003413) nucleolar 
protein family A, member 1 (H/ACA small nucleolar 
RNPs) [Homo sapiens] 


4 


28 


14150082 


gi|14150082|ref|NP_l 15691. 1| hypothetical protein 
MGC4399 [Homo sapiens] gb|AAH04991.1|AAH04991 
(BC004991) Similar to RIKEN cDNA 5730438N18 
gene [Homo sapiens] 


0.001 


29 


12861366 


gi|12861366|dbj|BAB32182.1| (AK020701) putative 
[Mus musculus] 


5E-42 


32 


14739173 


gi| 14739 1 73 |refjXP_03753 1 . 1 1 59395 [Homo sapiens] 


3E-23 


36 


7513572 


gi|75 13572|pir||S72438 phosphatidylserine 
decarboxylase (EC 4.1.1.65) precursor, mitochondrial - 
Chinese hamster 


e-112 


37 


7488381 


gi|7488381|pir||T08408 transcription factor homolog 
Fl 8B3 .150- Arabidopsis thaliana emb|CAB429 1 6. 1 1 
(AL049862) transcription factor-like protein 
[Arabidopsis thaliana] 


8.1 


38 


12849752 


gi|12849752|dbj|BAB28466.1| (AK012782) putative 
[Mus musculus] 


5E-89 


39 


14715574 


gi|14715574|dbj|BAB62041.1| (AB065434) PRPK 
mutant 1 [Homo sapiens] 


1E-88 


40 


12832845 


gi|12832845|dbj|BAB22281.1| (AK002682) putative 
[Mus musculus] 


4E-51 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


43 


12832845 


gi|12832845|dbj|BAB22281.1| (AK002682) putative 
[Mus musculus] 


6E-45 


47 


1 1558040 


gi| 1 1558040|emb|CAC 1 7800. 1 1 (AJ245890) 
hypothetical protein [Polymyxa betae] 


7.6 


48 


8052306 


gi|8052306|emb|CAB92239.1| (AJ242721) AIP37 
protein [Mus musculus] 


1.3 


49 


13812380 


gi| 138 12380|ref[NP_l 13498. 1 j N-myristoyltransferase 
[Guillardiatheta] emb|CAC27067.1| (AJ010592) N- 
myristoyltransferase [Guillardia theta] 


7 


51 


8885549 


gi|8885549|dbj|BAA97479.1| (AB025604) telomere 
repeat-binding protein [Arabidopsis thaliana] 


1.5 


52 


14727244 


gi|14727244|ref|XP_035596.1| hypothetical protein 
AF140225 [Homo sapiens] reflXP_035597.1| 49482 
[Homo sapiens] gb|AAH07829. 1|AAH07829 
(BC007829) Similar to hypothetical protein AF140225 
[Homo sapiens] 


6E-58 


57 


14737646 


gi|14737646|re£|XP_038048.1| similar to APICAL 
ENDOSOMAL GLYCOPROTEIN PRECURSOR (R. 
norvegicus) [Homo sapiens] 


0.003 


58 


13811699 


gi| 138 1 1 699|reffNP_109668. 1 1 zinc finger protein 313; 
zinc finger protein 228 [Mus musculus] 
gb|AAG01141.1|AF282919_l (AF282919) Zfp228 
[Mus musculus] 


0.00002 


59 


14042394 


gi|14042394|dbj|BAB55226.1| (AK027604) unnamed 
protein product [Homo sapiens] 


e-112 


60 


14769036 


gi|14769036|refjXP_036091.1| 19140 [Homo sapiens] 


9E-97 


61 


13648339 


gi|13648339|reflXP_003184.2| zinc finger protein 
ANC_2H01 [Homo sapiens] 


0.0006 


64 


13638702 


gi|13638702|ref]XP_015971.1| hypothetical protein 
FU23384 [Homo sapiens] reflXP_048858.1| 
hypothetical protein FLJ23384 [Homo sapiens] 


2E-86 


65 


14150082 


gi| 141 5 0082|ref|NP_l 1 5 69 1 . 1 1 hypothetical protein 
MGC4399 [Homo sapiens] gb|AAH04991.1|AAH04991 
(BC004991) Similar to RIKEN cDNA 5730438N18 
gene [Homo sapiens] 


0.00003 


67 


14727244 


gi|14727244|ref|XP_035596.1| hypothetical protein 
AF140225 [Homo sapiens] reflXP_035597.1| 49482 
[Homo sapiens] gb|AAH07829. 1 |AAH07829 
(BC007829) Similar to hypothetical protein AF 140225 
[Homo sapiens] 


2E-38 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


68 


14727244 


gi|14727244|ref)XP_035596.1| hypothetical protein 
AF140225 [Homo sapiens] reflXP_035597.1| 49482 
[Homo sapiens] gb|AAH07829.1|AAH07829 
(BC007829) Similar to hypothetical protein AF 140225 
[Homo sapiens] 


6E-60 


69 


14715574 


gi|14715574|dbj|BAB62041.1| (AB065434) PRPK 
mutant 1 [Homo sapiens] 


2E-98 


70 


7491888 


gi|7491888|pir||T41272 hypothetical protein 
SPCC297.05 - fission yeast (Schizosaccharomyces 
pombe) emb|CAB40785. 1| (AL049609) hypothetical 
protein [Schizosaccharomyces pombe] 


9.3 


71 


12843000 


gi|12843000|dbj|BAB25820.1| (AK008666) putative 
[Mus museums] 


1E-86 


72 


13542310 


gi| 135423 10|ref|NP_l 1 1998. 1| Uncharacterized 
conserved protein [Thermoplasma volcanium] 


6.6 


73 


14728906 


gi|14728906|refiXP_003327.3| hypothetical protein 
FLJ10858 [Homo sapiens] 


7E-98 


74 


14749106 


gi|14749106|ref|XP_031512.1| 14599 [Homo sapiens] 
dbj|BAB15298.1| (AK025963) unnamed protein product 
[Homo sapiens] 


1E-26 


76 


1176169 


gi|l 176169|sp|P28636|YHBE_ECOLI 
HYPOTHETICAL 35.0 KDA PROTEIN IN DACB- 
RPMA INTERGENIC REGION pir||B65109 
hypothetical 35.0K protein (dacB-rpmA intergenic 
region) - Escherichia coli gb|AAA57985.1| (U 18997) 
ORFJ321 [Escherichia coli] gb|AAC76216.1| 
(AE000399) orf, hypothetical protein [Escherichia coli 
K12] gb|AAG58318.1|AE005547_4 (AE005547) orf, 
hypothetical protein [Escherichia coli 0157:H7 
EDL933] dbj|BAB37486.1| (AP002564) hypothetical 
protein [Escherichia coli 0157:H7] 


0.23 


77 


13161357 


gi|13161357|dbj|BAB32948.1| (AP002908) hypothetical 
protein-similar to Arabidopsis thaliana F1P2. 140 
[Oryza sativa] 


6.3 


78 


8922806 


gi|8922806|ref[NP_060761.1| hypothetical protein 
FLJ10986 [Homo sapiens] dbj|BAA91940.1| 
(AK001848) unnamed protein product [Homo sapiens] 
gb|AAH00610.1|AAH00610 (BC000610) hypothetical 
protein FLJ 10986 [Homo sapiens] 


5E-83 


82 


14603356 


gi|14603356|gb|AAH10136.1|AAH10136(BC010136) 
Unknown (protein for MGC:19513) [Homo sapiens] 


1E-74 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


83 


12804667 


gi|12804667|gb|AAH01758.1|AAH01758 (BC001758) 
hypothetical protein FLJ20758 [Homo sapiens] 


0.000000005 


90 


9795608 


gi|9795608|gb|AAF98426. 1|AC021044_5 (AC021044) 
Unknown protein [Arabidopsis thaliana] 


0.7 


91 


13631937 


gi|13631937|ref|XP 001412.31 metal-reeulatorv 
transcription factor 1 [Homo sapiens] 


0.0004 


92 


14574479 


g i|14574479|gb|AAF60869.2| (AC024856) Hypothetical 
protein Y71G10AR.2 [Caenorhabditis elegans] 


0.15 


93 


14970562 


gi|14970562|emb|CAC44371.1| (AJ292465) WDR9 
protein, form A [Homo sapiens] 


2E-16 


98 


12855751 


gi|12855751|dbj|BAB30445.1| (AK016814) putative 
[Mus musculus] 


3.1 


102 


11498284 


eill 1498284lreflNP 069S 10 11 aH^nvlatA L-in^cp f**rtv\ 
&*i 1 a^-?«a-o^|iwj.|i^x uu7Jiv;. i| ducnyiaie Kinase (auK) 

[Archaeoglobus fiilgidus] sp|029581|KAD ARCFU 

ADENYLATE KINASE (ATP-AMP 

TRANSPHOSPHORYLASE) oirllD69334 adenvlate 

kinase (EC 2.7.4.3) - Archaeoglobus fiilgidus 

gb|AAB90565. 1| (AE001058) adenylate kinase (adk) 

Archaeoglobus fiilgidus] 


4.00E-13 


104 


9910248 


gi|9910248|ref|NP_064579.1| GL004 protein [Homo 
sapiens] gb|AAF86949.1|AF226049_l (AF226049) 
GL004 [Homo sapiens] 


1 00E-85 


105 


7302191 


gi|7302191|gb|AAF57287.1| (AE003784) CG7856 gene 
product [Drosophila melanogaster] 


1 00E+00 

A ,V/V/J—i ' \J\J 


106 


12005513 


gil 120055 13|gb|AAG44486 1IAF246239 1 fAF246239^ 
AD030 [Homo sapiens] gb|AAG44658.1|AF258660 1 
(AF258660) AD033 [Homo sapiens] 


2 00E-73 


108 


13874435 


gi|13874435|dbj|BAB46923.1| (AB000782) cerebral 
protein- 10 [Homo sapiens] 


4 00E-04 


109 


14783375 


gi|14783375|reflXP_045533. 1| protein predicted by 
clone 23627 [Homo sapiens] 


1.60E-01 


110 


J 

7229532 


gi|7229532|gb|AAF42865 . 1 |AF225924_1 (AF225924) 
staufen [Drosophila virilis] 


3.30E+00 


112 


i 

14775931 


gi|14775931|ref|XP_049935.1| hypothetical protein 
FLJ 14950 [Homo sapiens] 


3.00E-77 


114 


1 

10140758 


?i|10140758|gb|AAG13589.1|AC051633_5 

[AC05 1633) putative ubiquitin protein [Oryza sativa] 


2.10E+00 
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N 


DESCRIPTION 
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115 


421058 


gi|421058|pir||S34255 hypothetical protein la - 
Escherichia coli emb|CAA51036.1| (X72295) ORFla 
[Escherichia coli] 


6.50E+00 




116 


14775931 


gi| 1477593 l|ref|XP_049935.1| hypothetical protein 
FLJ14950 [Homo sapiens] 


2.00E-74 




117 


14781316 


gi|14781316|ref|XP_032533.1| hypothetical protein 
FLJ12787 [Homo sapiens] 


e-101 




118 


13161938 


gi|13161938|emb|CAC32991.1| (AJ309861) putative 
protein kinase WNK4 [Homo sapiens] 


1.00E-30 








gi|7662028|ref|NP_055557.1| KIAA0255 gene product 
[Homo sapiens] ref)XP_009540.1| KIAA0255 gene 
product [Homo sapiens] sp|Q92544|T9S4_HUMAN 

TRANSMEMBRANE 9 SUPERFAMILY PROTEIN 




a 






MEMBER 4 dbj|BAA 13385.1| (D87444) Similar to 




W 

rll 


119 


7662028 


S.cerevisiae EMP70 protein precursor (S25 1 10) [Homo 
sapiens] emb|CAB75607.2| (AL049539) dJ836N17.2 
(KIAA0255 protein) [Homo sapiens] 


7.00E-14 




120 


10179324 


gi|10179324|dbj|BAB13674.1| (AB04 1351) type IV 
collagen alpha 6 chain [Mus musculus] 


7.9 


%? * 

ca 


122 


7160119 


gi|71601 19|emb|CAB76306. 1| (AL158057) putative 
acetyltransferase (fragment). [Streptomyces coelicolor 
A3(2)] 


0.1 


r ■ 

Cm 


123 


14732244 


gi|14732244|reflXP_039960.1| PC3-96 protein [Homo 
sapiens] ref|XP_039961.1| PC3-96 protein [Homo 

sapiens] pir||T46276 hypothetical protein 
DKFZp564Ml 178.1 - human emb|CAB70781.1| 
(AL 1375 15) hypothetical protein [Homo sapiens] 
gb|AAG35611.1|AF202092_l (AF202092) PC3-96 
[Homo sapiens] 


9E-81 




124 


7020475 


gi|7020475|dbj|BAA91 144. 1| (AK000407) unnamed 
protein product [Homo sapiens] 


2.6 




125 


14775931 


gi|14775931|ref]XP_049935.1| hypothetical protein 
FLJ14950 [Homo sapiens] 


2E-72 




127 


226135 


gi|226135|prfl|141 1303A GABA receptor alpha2 [Bos 
taurus] 


0.046 




136 


4995818 


gi|4995818|emb|CAB44313.1| (AJ131899) proline rich 
synapse associated protein 1 [Rattus norvegicus] 


0.00003 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


138 


13357869 


gi|13357869|ref]NP_078143. 1| unique hypothetical 
[Ureaplasma urealyticum] pir||D82907 hypothetical 
protein UU309 [imported] - Ureaplasma urealyticum 
gb|AAF30718.1|AE002128_6 (AE002128) unique 
hypothetical [Ureaplasma urealyticum] 


2.7 


139 


1076802 


gi|1076802|pir||S49915 extensin-like protein - maize 
emb|CAA84230.1| (Z34465) extensin-like protein [Zea 
mays] prf|[21 1 1476A extensin-like domain [Zea mays] 


1.9 


140 


11466208 


gi|l 1466208|refINP_066531.1| NADH dehydrogenase 

subunit 4 [Naegleria gruberi] 
gb|AAG17809.1|AF288092_34 (AF288092) NADH 
dehydrogenase subunit 4 [Naegleria gruberi] 


6.6 


145 


7490290 


gi|7490290|pir||T38644 conserved hypothetical protein 
SPAC323.07c - fission yeast (Schizosaccharomyces 
pombe) emb|CAB534 10.1| (AL109988) conserved 
hypothetical protein; UPF0013 [Schizosaccharomyces 
pombe] 


0.78 


147 


5813770 


gi|5813770|gb|AAD52006.1|AF017304_l (AF017304) 
FMVIB [Morone saxatilis] 


6.8 


149 


2497311 


gi|24973 1 l|sp|P55803|MOG_BOVIN MYELIN- 
OLIGODENDROCYTE GLYCOPROTEIN 
PRECURSOR pir||A47712 myelin/oligodendrocyte 
glycoprotein precursor - bovine 


0.33 


152 


609342 


gi|609342|gb|AAA58698.1| (U04946) nucleophosmin- 
anaplastic lymphoma kinase fusion protein [Homo 
sapiens] 


2.7 


153 


7481152 


gi|7481 152|pir||T36795 probable penicillin acylase - 
Streptomyces coelicolor emb|CAB46792.1| (AL09681 1) 
putative penicillin acylase [Streptomyces coelicolor 
A3(2)] 


8 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ ID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


154 


12229694 


gi|12229694|sp|O93830|BET2_CANAL TYPE II 
PROTEINS GERANYLGERANYLTRANSFERASE 
BETA SUB UN IT (TYPE II PROTFTIM OFR ANVI - 
GERANYLTRANSFERASE BETA SUB UNIT) 
(GGTASE-II-BETA) (PGGT) (YPT1/SEC4 
PROTEINS GERANYLGERANYLTRANSFERASE 
BETA SUBUNIT) dbj|BAA35193. 1| (AB021 171) Beta 
subunit of geranylgeranyl transferase type2 [Candida 
albicans] 


4.7 


156 


9630058 


gi|9630058|reflNP 046276 11 unknown TOrevia 
pseudotsugata single capsid nuclear polyhedrosis virus] 
sp|O10359|Y120_NPVOP HYPOTHETICAL 9.3 KD 
PROTEIN (ORF120) pir||T10389 hypothetical protein 
120 - Orgyia pseudotsugata nuclear polyhedrosis virus 
gb|AAC591 19.1| (U75930) unknown [Orgyia 
pseudotsugata single capsid nuclear polyhedrosis virus] 


8.5 


157 


9965966 


gi|9965966|gb|AAG10219.1|AF294433_l (AF294433) 
coat protein [Alfalfa mosaic virus] 


7 


158 


7296047 


gi|7296047|gb|AAF5 1343.1] (AE003585) CG17711 
gene product [Drosophila melanogaster] 


4.6 


161 


13507856 


gi|13507856lrefINP 109805 11 ribosomal nrotein I 20 
Mycoplasma pneumoniae] sp|P78023|RL20_MYCPN 
SOS RIBOSOMAL PROTEIN L20 pir||S73363 
ribosomal protein L20 - Mycoplasma pneumoniae 
(strain ATCC 29342) gb|AAG34734.1|AE000004_3 
(AE000004) ribosomal protein L20 [Mycoplasma 
pneumoniae] 


0.86 


162 


8393641 


gi|8393641|reflNP 058889.11 kynurenine 
aminotransferase II [Rattus norvegicus] 
emb|CAA90507.1| (Z50144) kynurenine/alpha- 
aminoadipate aminotransferase [Rattus norvegicus] 


4.2 


169 


1515448 


gi|1515448|gb|AAC45559.1| (U63096) Description: 
pBF4 gene involved in Bacteroides spp. conjugal 
transfer [Bacteroides fragilis] 


2.6 


175 


422832 


gi|422832|pir||B46629 mucin 6, gastric (3-repeat clone) - 
human (fragment) gb|AAB61945. 1| (L07518) mucin 
Homo sapiens] 


3.2 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


176 


7504075 


gi|7504075|pir||T22584 hypothetical protein F53F4.6 - 
Caenorhabditis elegans emb|CAB01215.1| (Z77663) 
F53F4.6 [Caenorhabditis elegans] 


6.6 


178 


13385538 


gi|13385538|reflNP_080316.1| RIKEN cDNA 
2810036K01 gene [Mus musculus] dbj|BAB28520.1| 
(AK0 12865) putative [Mus musculus] 


2E-48 


180 


902377 


gi|902377|gb|AAA82981.1| (U18059) polyprotein 
[pestivirus type 1] 


0.53 


181 


13421499 


gi|13421499|gb|AAK22335.1| (AE005708) conserved 
hypothetical protein [Caulobacter crescentus] 


6.2 


187 


10803574 


gi|10803574|ref]NP_045972.1| GvpN [Halobacterium 
sp. NRC-1] sp|Q9HI16|GVNl_HALNl GVPN 
PROTEIN 1 pir||A47053 gas-vesicle operon protein 
gvpN [imported] - Halobacterium salinarum pir||T08244 
gas-vesicle operon protein gvpN - Halobacterium sp. 
(strain NRC-1) plasmid pNRClOO gb|AAC8281 1.1| 
(AF016485) GvpN [Halobacterium sp. NRC-1] 
gb|AAD 15044.1| (L03361) gas vesicle protein [Plasmid 
pNRClOO] gb|AAG20728.1| (AE005142) GvpN 
protein, cluster A; GvpNl [Halobacterium sp. NRC-1] 


7.40E-01 


192 


7662532 


gi|7662532|reflNP_054838.1| PRO0195 protein [Homo 
sapiens] gb|AAF24025.1|AF090901_l (AF090901) 
PRO0195 [Homo sapiens] 


2.60E-01 


194 


5360226 


gi|5360226|dbj|BAA36472.1| (AB015177) FO-ATPase 
subunit 6 [Beta vulgaris] 


2.3 


196 


3024921 


gi|3024921|sp|Q583 12|Y902_METJA 
HYPOTHETICAL PROTEIN MJ0902 pir||F64412 

hypothetical protein MJ0902 - Methanococcus 
jannaschii gb|AAB98907. 1 1 (U67533) M. jannaschii 
predicted coding region MJ0902 [Methanococcus 
jannaschii] 


6.5 


198 


7522108 


gi|7522108|pir||T29097 pro-pol-dUTPase polyprotein - 
murine endogenous retrovirus ERV-L (fragment) 
emb|C AA73251.il (Y12713) protease; reverse 
transcriptase; RNaseH; integrase; dUTPase; Pro-Pol- 
dUTPase polyprotein [Mus musculus] 


1.7 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ ID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


199 


9955592 


gi|9955592|emb|CAC00653.1| (AJ292519) N-acylamino 
acid racemase [Amycolatopsis orientalis subsp. lurida] 


4.5 


203 


4097932 


gi|4097932|gb|AAD 10321.1| (U72240) unknown 
[Choristoneura fumiferana nucleopolyhedrovirus] 


6.5 


205 


11252252 


gi|11252252|pir||D82079 pyruvate dehydrogenase, El 
component VC2414 [imported] - Vibrio cholerae (group 
Ol strain N16961)gb|AAF95557.1| (AE004311) 
pyruvate dehydrogenase, El component [Vibrio 
cholerae] 


3.5 


206 


12963481 


gi|12963481|ref]NP 061927 11 hvDothetical orotein 
MGC5560; hypothetical protein [Homo sapiens] 
dbj|BAB 14342.1| (AK022978) unnamed protein product 
[Homo sapiens] 


3E-73 


207 


14737646 


gi|14737646|ref|XP_038048. 1| similar to APICAL 
ENDOSOMAL GLYCOPROTEIN PRECURSOR (R. 
norvegicus) [Homo sapiens] 


4 


209 


14742770 


gi|14742770|ref|XP_039393.1| KIAA1550 protein 
[Homo sapiens] 


8.4 


213 


12643499 


gi|12643499|sp|P89202|RRPO_SHMV RNA- 
DIRECTED RNA POLYMERASE (186 KDA 
PROTEIN) [CONTAINS: 
METHYLTRANSFERASE/RNA HELICASE 
(MT/HEL) (128 KDA PROTEIN)] 


6.5 


223 


9629141 


gi|9629141|reflNP_044299. 1| putative transport protein 

SovHean cfilnrnttr mnttlp virncl 

sp|P15631|VMP_SOCMV MOVEMENT PROTEIN 
(CELL-TO-CELL TRANSPORT PROTEIN) (ORF 
IA) pir||JS0379 hypothetical 35.5K protein - soybean 
chlorotic mottle virus emb|CAA33833.1| (X15828) 
putative transport protein [Soybean chlorotic mottle 
virus] 


4.3 


226 


7494266 


gi|7494266|pir||T18485 hypothetical protein C0840w - 
malaria parasite (Plasmodium falciparum) 
emb|CABl 1124.11 (Z98551) putative P-type ATPase 
Plasmodium falciparum] 


2.3 


228 


14042283 


gi|14042283|dbj|BAB55184.1| (AK027535) unnamed 
protein product [Homo sapiens] 


2E-30 


229 


14744326 


gi|14744326|ref|XP_044812.1| NIMA (never in mitosis 
gene a)-related kinase 6 [Homo sapiens] 


4E-13 


232 


] 

98014 


gi|98014|pir||Sl 1 148 amiA protein - Streptococcus 
pneumoniae 


10 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


233 


10178317 


gi|10178317|emb|CAC08403 11 (AL121928) 
bA18I14.5.1 (novel protein isoform 1) [Homo sapiens] 


7 


234 


630444 


gi|630444|pir||S43955 probable NADH dehydrogenase 
(ubiquinone) (EC 1.6.5.3) chain 3, kinetoplast - 
Trypanosoma brucei mitochondrion (fragment) 
gb|AAA20887.1| (L26251) CR5 [Trypanosoma brucei] 


6.2 


239 


7505826 


gi|7505826|pir||T25816 hypothetical protein K12D9.2 - 
Caenorhabditis elegans 


1.4 


240 


7434912 


gi|74349 12|pir||H7 1 934 phosphatidylglycerophosphate 
synthase - Helicobacter pylori (strain J99) 
gb|AAD05990.1| (AE001475) 
PHOSPHATIDYLGLYCEROPHOSPHATE 
SYNTHASE [Helicobacter pylori J99] 


5.8 


243 


6136155 


gi|6 1 36 1 55|sp|O70546|UTX_MOUSE 
UBIQUITOUSLY TRANSCRIBED X 
CHROMOSOME TETRATRICOPEPTIDE REPFAT 
PROTEIN (UBIQUITOUSLY TRANSCRIBED TPR 
PROTEIN ON THE X CHROMOSOME) 
emb|CAA05692.1| (AJ002730) UTX [Mus musculus] 


8.90E+00 


246 


2143962 


gi|2143962|pir||I59422 rsec8 - rat (fragment) 
gb|AAC52265.1| (U32498) rsec8 [Rattus norvegicus] 


1E-93 


250 


6136155 


gi|6 1 36 1 55|sp|O70546|UTX_MOUSE 
UBIQUITOUSLY TRANSCRIBED X 
CHROMOSOME TETRATRICOPEPTIDE REPEAT 
PROTEIN (UBIQUITOUSLY TRANSCRIBED TPR 
PROTEIN ON THE X CHROMOSOME) 
emb|CAA05692.1| (AJ002730) UTX [Mus musculus] 


7.1 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins^ 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


255 


12737603 


gi|12737603|ref|XP_006839.2| activin A receptor type 11 
like 1 [Homo sapiens] ref]XP_050707.1| activin A 
receptor type Il-like 1 [Homo sapiens] 
sp|P37023|KIR3_HUMAN SERINE/THREONINE- 
PROTEIN KINASE RECEPTOR R3 PRECURSOR 
(SKR3) fACTIVTN RFfFPTDR-T TK"F ktntace i\ 
(ALK-1) (TGF-B SUPERFAMILY RECEPTOR TYPE 
I) (TSR-I) pir||A4943 1 activin/TGF-beta-like type I 
receptor - human eb|AAA16160 11 (LI 70751 TGF-h 
superfamily receptor type I [Homo saniensl 
gb|AAB61900. 1| (U77713) activin receptor like kinase 1 
[Homo sapiens] 


7.50E+00 


265 


2133974 


gi|2133974|pir||S68425 SoxPl protein - rainbow trout 
dbj|BAAl 1868.11 (D83256) SoxPl [Oncorhynchus 
mykiss] 


0.18 


266 


9838427 


gi|9838427|ref|NP 064041. 1| orf2 14 [Beta vulgaris] 


2.2 


268 


5731737 


gi|5731737|dbj|BAA83337.1| (AB021878) similar to 
yeast sodium/proton exchanger [Oryza sativa] 


3.3 


271 


13642312 


gi|13642312|ref|XP_002895.2| parathyroid hormone 
receptor 1 [Homo sapiens 1 reflXP 033742 1 1 
parathyroid hormone receptor 1 [Homo sapiens] 
ref|XP_033743J| parathyroid hormone receptor 1 
Homo sapiens] 


9 


272 


11545138 


gi|11545138|emb|CAC08390.2| (AL121886) 
dJ1028D15.1 (continued from dJ138B7.1 in 
Em:Z98752) [Homo sapiens] 


0.0000008 


275 


6323677 


ei!6323677lreflNP 0 1 ^748 1 1 Vmrfnd™ 

Saccharomyces cerevisiae] sp|Q05131|YMS4_YEAST 
HYPOTHETICAL 48 4 KD PROTEIN IN ARP9- 
IMP2 INTERGENIC REGION pir||S53951 probable 
membrane protein YMR034c - yeast (Saccharomyces 
cerevisiae) emb|CAA89150.1| (Z49213) unknown 

Saccharomyces cerevisiae] 


6.6 


277 


7297043 


gi|7297043|gb|AAF523 12.1| (AE003612) CG13992 
gene product [Drosophila melanogaster] 


8.1 


282 


1616595 


gi|1616595|emb|CAA41710.1| (X58907) steroid 21- 
monooxygenase [Homo sapiens] 


6.8 


283 


14089610 


gi|14089610|emb|CAC13370.1| (AL445563) unknown; 
predicted coding region [Mycoplasma pulmonis] 


2.1 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins^ 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


284 


139809 


gi|139809|sp|P27571|XIST_MOUSE X INACTIVE 
SPECIFIC TRANSCRIPT PROTEIN pir||S 15433 

hvDOthetical nrntpin - mnncp *»tnViir^ A AA 1 07ft 1 1 

(X59289) ORF [Mus musculus] prf||1711440A xist 
gene [Mus musculus] 


0.18 


285 


11355576 


gi|l 1355576|pir||T4433 1 hypothetical protein wblD 

rimnortftHl - Vihrin r4ir»1f*rsip> HKtIRA b.l'XfSX 1 ^ 1 1 

(AB012957) probable NADH dehydrogenase [Vibrio 
cholerae] 


0.23 


297 


14389443 


oil 143 89443 IreflNP 1 1 6776 1 1 TsJADR rf^hvAmn^nwo 
±-T ^ o^t*tjh^l|i^ jt iiu/ /u.ij i>/vL/n Qcnycirogenase 

subunit 4 [Rana nigromaculata] dbj|BAB58996.1| 

(AB043889) NADH dehydrogenase subunit 4 [Rana 

nigromaculata] 


9.2 


300 


4493987 


gi|4493987|emb|CAB39046.1| (AL034559) hypothetical 
protein, PFC 1 030w [Plasmodium falciparum] 


2.1 


306 


7494200 


gi|7494200|pir||T18434 hypothetical protein C0375c - 

maiana paras ue ^lasmoaium falciparum J 

emb|CAB 1 1 1 1 1 . 1 1 (Z98547) predicted using hexExon; 

^yir\i^j r j. lu ^rrL/Uj /Jt^, nomoiogue oi i^.eiegans 
T08A1 1.2 protein, len: 1387 aa; Similarity to C.elegans 
T08A11.2 protein. C.elegans T08A11.2 protein 
(WP:T08A1 1 .2) BLAST Score: 2901, sum P(2) = 0.0; 
66% identity in 839 aa > 


1.3 


308 


10728064 


ei|10728064leblAAF50455 21 (AE003556* CG7060 
gene product [Drosophila melanogaster] 


4.7 


317 


9631682 


gi|9631682|ref|NP_048461.1| all3L [Paramecium 
uuiacuici v^mvji viid vii u» ij pirj| 1 1 nypouieticai 
protein al 13L - Chlorella virus PBCV-1 
gb|AAC96481.1| (U42580) al 13L [Paramecium 
5ursaria Chlorella virus 1] 


7.7 


318 


4733986 


gi|4733986|gb|AAD28666. 1|AC007209_2 (AC007209) 
hypothetical protein [Arabidopsis thaliana] 


5.3OE+O0 


319 


461649 


gi|461649|sp|Q05004|BB61_RABIT BRUSH BORDER 
61.9 KD PROTEIN PRECURSOR pir||B45665 adult- 
specific 61 .9K brush border protein precursor - rabbit 
emb|CAA78302.1| (Z 12840) protein of unknown 
function [Oryctolagus cuniculus] 


3.00E-16 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins^ 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


322 


1 1466552 


gi| 1 1466552|ref]NP_04480 1 . 1 1 NADH dehydrogenase, 
subunit 4 rReclinomonas americanal nirllS7R 1 NADH 

dehydrogenase (ubiquinone) (EC 1.6.5.3) chain 4 - 
Reclinomonas americana (ATCC 50394) mitochondrion 
gb|AADl 1916.1| (AF007261) NADH dehydrogenase, 
subunit 4 [Reclinomonas americana] 


9.1 


323 


13603897 


gi|13603897|gb|AAK3 1986. 1|AF295594_1 (AF295594) 
SKTxl precursor [Buthus martensii] 


9.3 


325 


1 1497767 


gi|l 1497767|ref|NP_068989. 1| hypothetical protein 
[Archaeoglobus fulgidus] sp|O30087|Y150_ARCFU 
HYPOTHETICAL PROTEIN AF0150 pir||F69268 
hypothetical protein AF0150 - Archaeoglobus fulgidus 
gb|AAB9 1087.1| (AE001096) A. fulgidus predicted 
coding region AF0150 [Archaeoglobus fulgidus] 


7.1 


332 


7506939 


gi|7506939|pir||T24356 hypothetical protein T02D1.3 - 
Caenorhabditis elegans emb|CAB05908. 1| (Z833 19) 
T02D1.3 [Caenorhabditis elegans] 


3.9 


336 


7510675 


gi|7510675|pir||T29689 hypothetical protein ZC266.2 - 
Caenorhabditis elegans gb|AAA96101 . 1| (U53 140) 
ZC266.2 gene product [Caenorhabditis elegans] 


2.4 


337 


12697923 


gi|12697923|dbj|BAB21780.1| (AB051476) KIAA1689 
protein [Homo sapiens] 


3E-21 


338 


630465 


gi|630465|pir||S47042 protein kinase (EC 2.7. 1.37) cdc2- 
related 1 - malaria parasite (Plasmodium falciparum) 


5.7 


340 


7488318 


gi|74883 18|pir||T01 191 RNA-directed DNA polymerase 
lomolog F21E10.5 - Arabidopsis thaliana 
gb|AAC 13599.1| (AF058914) similar to reverse 
transcriptase (Pfam: transcript_fact.hmm, score: 72.31) 
Arabidopsis thaliana] 


5 


341 


5804818 


gi|5804818|emb|CAB52872.1| (AL021712) putative 
protein [Arabidopsis thaliana] emb|CAB79171.1| 
[AL161556) putative protein [Arabidopsis thaliana] 


5 


343 


14579376 


gi| 14579376|gb|AAK69253. 1|AF336309_48 
(AF336309) SpyB [Yersinia enterocolitica] 


8.2 



448 





Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


344 


11284162 


gi|l 12841 62|pir||H8 1077 hypothetical protein NMB1490 
[imported] - Neisseria meningitidis (group B strain 
MD58) gb|AAF4 1846.1| (AE002498) hypothetical 
protein [Neisseria meningitidis MC58] 


3 


347 


11361512 


gi|11361512|pir||H82781 hypothetical protein XF0626 
[imported] - Xylella fastidiosa (strain 9a5c) 
gb|AAF83436.1|AE003908_4 (AE003908) hypothetical 
protein [Xylella fastidiosa 9a5c] 


7.6 


348 


7497054 


gi|7497054|pir||T 19739 hypothetical protein C35A5.4 - 
Caenorhabditis elegans emb|CAA94905.l| (Z71 185) 
contains similarity to Pfam domain: PF00635 (MSP 
(Major sperm protein) domain), Score=66.4, E-value=2e 
16, N=l [Caenorhabditis elegans] 


0.064 


349 


6531601 


gi|6531601|gb|AAF15500.1|(AF167672) 
immunoglobulin heavy chain variable region [Homo 
sapiens] 


1.6 


352 


12858471 


gi|12858471|dbj|BAB31327.1| (AK018655) putative 
[Mus musculus] 


3.9 


353 


2128157 


gi|2128157|pir||C64319 hypothetical protein MJ0154 - 
Methanococcus jannaschii 


0.98 


355 


14195575 


gi|14195575|sp|P581 14|YV78_CAUCR 
HYPOTHETICAL PROTEIN CC3178 
gb|AAK25 140. 1 1 (AE00598 1) pirin-related protein 
[Caulobacter crescentus] 


4.20E+00 


359 


7468435 


gi|7468435|pir||B72015 metalloproteinase, insulinase 
family CP0903 [imported] - Chlamydophila pneumoniae 
(strains CWL029 and AR39) gb|AAD19093.1| 
(AE001675) Insulinase family/Protease III 
[Chlamydophila pneumoniae CWL029] 
gb|AAF38689.1| (AE002249) metalloprotease, 
insulinase family [Chlamydophila pneumoniae AR39] 
dbj|BAA99165.1| (AP002548) insulinase 
family/protease III [Chlamydophila pneumoniae J138] 


4.1 


360 


461649 


gi|461649|sp|Q05004|BB61_RABIT BRUSH BORDER 
61.9 KD PROTEIN PRECURSOR pir||B45665 adult- 
specific 61.9K brush border protein precursor - rabbit 
emb|CAA78302. 1| (Z12840) protein of unknown 
function [Oryctolagus cuniculus] 


8E-45 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


363 


12859685 


gi|12859685|dbj|BAB31736.1| (AK019464) putative 
[Mus musculus] 


6.1 


365 


11055982 


gi|l 1055982|reflNP_067633.1| PR domain containing 
13; PR-domain containing protein 13 [Homo sapiens] 
ref|XP_01 1460. 1| PR domain containing 13 [Homo 
sapiens] gb|AAG13448.1| (AY004253) PR-domain 
containing protein 13 [Homo sapiens] 


3.4 


368 


3915305 


gi|3915305|sp|Q96597|VP5_AHSV9 OUTER CAPSID 
PROTEIN VP5 gb|AAB 17570.1| (U74489) VP5 
[African horse sickness virus] 


0.63 


370 


11466986 


gi|11466986|ref]NP_04 1893.1| Orf635 [Euglena 
gracilis] sp|P31916|YCX3_EUGGR HYPOTHETICAL 
78.0 KD PROTEIN IN PSBC INTRON 2 (ORF635) 
emb|CAA50080.1| (X70810) Orf635 [Euglena gracilis] 


1.7 


373 


14766918 


gi|14766918|ref]XP_034599.1| acid sphingomyelinase- 
like phosphodiesterase [Homo sapiens] 


2E-99 


375 


5732610 


gi|5732610|gb|AAD49190.1|AF115423_2 (AF1 15423) 
core protein [Hepatitis B virus] 


3.4 


377 


2707661 


gi|2707661|gb|AAB94699.1| (AF029934) IgM heavy 
chain VDJ region [Oryctolagus cuniculus] 


4.7 


379 


7516819 


gi|7516819|pir||C72580 hypothetical protein APE1923 - 
Aeropyrum pernix (strain Kl) dbj|BAA80928.1| 
(AP000062) 105aa long hypothetical protein 
Aeropyrum pernix] 


6.80E+00 


380 


3859670 


gi|3859670|emb|CAA22008.1[ (AL033502) hypothetical 
membrane protein [Candida albicans] 


9.90E+00 


382 


14784562 


gi|14784562|ref]XP_040415.1| mutL (E. coli) homolog 3 
Homo sapiens] gb|AAF23905. 1| (AF195658) DNA 
mismatch repair protein [Homo sapiens] 


2E-25 


383 


14194519 


gi| 1 4 1 945 1 9|sp|Q9K920|CMGB_BACHD COMG 
OPERON PROTEIN 2 HOMOLOG dbj|BAB06550.1| 
(AP001516) DNA transport machinery [Bacillus 
lalodurans] 


2.7 


386 


7509624 


gi|7509624|pir||T26676 hypothetical protein Y38F1A.1 - 
Caenorhabditis elegans emb|CAA21628.1| (AL032639) 
predicted using Genefinder-cDNA EST 
EMBL:AW057282 comes from this gene 
Caenorhabditis elegans] 


9.4 



450 





Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


388 


7508443 


cil7508443loirllT25281 hvootherical nrntpin 1 1 
Caenorhabditis elegans emb|CAB04829.1| (Z82052) 
contains similarity to Pfam domain: PF01827 (Domain 
of unknown function), Score=130.6, E-value=4.6e-36, 
N=l [Caenorhabditis elegans] 


3.4 


389 


7295832 


gi|7295832|gb|AAF5 1133.1J (AE003580) CG3347gene 
product [Drosophila melanogaster] 


0.032 


390 


7510359 


gi|7510359|pir||T27283 hypothetical protein Y64G10A.f 
- Caenorhabditis elegans 


8.2 


395 


6679381 


gi|6679381|ref|NP_032903.1| plasminogen [Mus 
musculus] sp|P20918|PLMN_MOUSE 
PLASMINOGEN PRECURSOR [CONTAINS: 
ANGIOSTATIN] pir||PLMS plasmin (EC 3.4.21.7) 
precursor - mouse gb|AAA50168.1| (J04766) 
plasminogen [Mus musculus] 


2.2 


396 


8479518 


gi|84795 1 8|sp|Q89853|VGP_EBORS STRUCTURAL 
GLYCOPROTEIN PRECURSOR (VIRION SPTKF 
GLYCOPROTEIN) [CONTAINS: GP1; GP2] 
gb|AAC54889.1| (U23416) virion spike glycoprotein 
[Ebola virus] gb|AAC54891.1| (U23417) virion spike 
glycoprotein [Ebola virus] 


3.1 


403 


14133247 


gi|14133247|dbj|BAA86564.2| (AB033076) KIAA1250 
protein [Homo sapiens] 


6E-19 


412 


3293234 


gi|3293234|gb|AAC25913.1| (U96413) T-DNA 
oncoprotein [Agrobacterium tumefaciens] 


9.1 


415 


7023033 


gi|7023033|dbj|BAA91809.1| (AK001649) unnamed 
protein product [Homo sapiens] 


9E-85 


416 


12514130 


gi| 125 14 130|gb|AAG55434. 1|AE005285_1 (AE005285) 
putative oxidoreductase [Escherichia coli 0157:H7 
EDL933] dbj|BAB34455.1| (AP002554) putative 
oxidoreductase [Escherichia coli 0157:H7] 


9.40E-01 


418 


4514359 


gi|4514359|dbj|BAA75394.1| (ABO 13377) ComGB 
Bacillus halodurans] 


2.6 


419 


7510324 


gi|75 10324|pir||T27250 hypothetical protein Y5F2A.3 - 
Caenorhabditis elegans emb|CAA21648.1| (AL032641) 
Y5F2A.3 [Caenorhabditis elegans] 


1.6 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


421 


6981358 


gi|6981358|refINP_037137.1| phosphoinositide 3-kinase 
p85 (other splicing variants: p55 and p50) [Rattus 
nnrveiririisl <jnl063787IP85A RAT 
PHOSPHATIDYLINOSITOL 3-KINASE 
REGULATORY ALPHA SUB UNIT (PI3 -KINASE 
P85-ALPHA SUBUNIT) (PTDINS-3 -KINASE P85- 
ALPHA) (PI3K) dbj|BAA18932.1| (D64045) 
phosphatidylinositol 3-kinase p85 alpha subunit [Rattus 
norvegicus] 


2.4 


423 


6576738 


gi|6576738|dbj|BAA88337.1| (AB005891) ORF2 
[Platemys spixii] 


3.4 


424 


6739553 


gi|6739553|gb|AAF27299.1| (AF146429) DeltaC [Danio 
rerio] 


3.9 


425 


13816283 


gi|13816283|gb|AAK43020.1| (AE006883) Sulfate 
adenylyltransferase (sat) [Sulfolobus solfataricus] 


3.6 


428 


1 1034774 


gi|l 1034774|gb|AAG27071.1|AF105225_7 (AF030414) 
NifQ [Gluconacetobacter diazotrophicus] 


3.6 


433 


13364676 


gi|13364676|dbj|BAB38622.1| (AP002568) hypothetical 
protein [Escherichia coli 0157:H7] 


5.8 


434 


12845784 


gi|12845784|dbj|BAB26898.1| (AK010380) putative 
[Mus musculus] 


0.075 


435 


12513917 


gi| 1 25 1 39 1 7|gb|AAG55266. 1 |AE005270_2 (AE005270) 
Zl 121 gene product [Escherichia coli 0157:H7 
EDL933] gb|AAG55675.1|AE005306_4 (AE005306) 
Z1560 gene product [Escherichia coli 0157:H7 
EDL933] dbj|BAB34723.1| (AP002554) putative 
membrane protein [Escherichia coli 0157:H7] 


8.3 


436 


7661654 


gi|7661654|ref[NP_056444.1| DKFZP566J153 protein 
[Homo sapiens] emb|CAB43677.1| (AL050369) 
hypothetical protein [Homo sapiens] 


0.42 


437 


1150678 


gi|1150678|emb|CAA50971.1| (X72086) ORF20R; 
B21R in citation [3] [Variola virus] 


5.10E+00 


438 


8928456 


gi|8928456|sp|05 1039|Y006_BORBU 
HYPOTHETICAL PROTEIN BB0006 pir||F70100 
conserved hypothetical integral membrane protein 
BB0006 - Lyme disease spirochete gb|AAC66397.1| 
(AE001 1 15) conserved hypothetical integral membrane 
protein [Borrelia burgdorferi] 


4.2 


440 


7297273 


gi|7297273|gb|AAF52536.1| (AE003618) Myo28Bl 
gene product [Drosophila melanogaster] 


3.3 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


448 


7495690 


gi|7495690|pir||T19090 hypothetical protein C08F1 1.3 - 
Caenorhabditis elegans emb|CAB05674.1| (Z83216) 
C08F1 1.3 [Caenorhabditis elegans] 


4.9 


449 


10438579 


gi|10438579|dbj|BAB15281.1| (AK025916) unnamed 
protein product [Homo sapiens] 


3.7 


451 


112200 


gi|l 12200|pir||A32868 prolactin receptor - rat 
(fragments) 


8.6 


452 


114987 


gi|l 14987|sp|P17885|BIMA_EMENI BIMA PROTEIN 
pir||A53256 nuclear protein bimA - Emericella nidulans 
emb|CAA41959.1| (X59269) bimA [Emericella 
nidulans] 


6.60E+00 


457 


4885039 


gi|4885039|gb|AAD3 1932.1|U00058_6 (U00058) 
contains similarity to DNAJ domains (Pfam: PF00226, 
Score=44.9, E=1.8e-09, N=l) [Caenorhabditis elegans] 


4.3 


459 


4505121 


gi|4505 121|ref|NP_003916. 1| methyl-CpG binding 
domain protein 4; 3,N(4)-ethenocytosine glycosylase; 
G/T mismatch glycosylase; G/U mismatch glycosylase; 

{'r/S-fliinrrviirapil mi^matph olvp-ncvlnQP witVi HinhaQir* 

Y_J/ -J llUUiUUldvll llllollld.Uv'lI f^lj ICLjks Willi ClLJlIdolv/ 

kinetics [Homo sapiens] gb|AAC68879.1| (AF072250) 
methyl-CpG binding protein MBD4 [Homo sapiens] 
gb|AAD22195.1|AF114784_l (AF114784) methyl-CpG 
binding endonuclease [Homo sapiens] gb|AAD50374.1| 
(AF 120999) methyl-CpG binding protein 4 [Homo 
sapiens] 


2.1 


463 


12644455 


gi| 12644455|sp|Q6075 1 |IGlR_MOUSE INSULIN- 
LIKE GROWTH FACTOR I RECEPTOR 
PRECURSOR gb|AAC12782.1| (AF056187) insulin- 
like growth factor I receptor; IGF-I receptor [Mus 
musculus] 


3.1 


465 


1 1465398 


mlll465398lreflNP 045211 11 unknown* N-acetvl- 
glutamate-gamma-semialdehyde dehydrogenase 
[Cyanidium caldarium] sp|Q9TLQ8|HIS5 CYACA 
AMIDOTRANSFERASE H1SH 

gb|AAF12883.1|AF022186_5 (AF022186) unknown; N- 
acetyl-glutamate-gamma-semialdehyde dehydrogenase 
[Cyanidium caldarium] 


5 


466 


5761329 


gi|5761329|dbj|BAA83473.1| (AB004819) cysteine 
endopeptidase [Oryza sativa] 


5.70E+00 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


468 


5306243 


gi|5306243|gb|AAD41976. 1|AC006438_8 (AC006438) 
hypothetical protein [Arabidopsis thaliana] 


4.2 


470 


7304025 


gi|7304025|gb|AAF59067.1| (AE003836) CG8639 gene 
product [Drosophila melanogaster] 


3.20E-01 


473 


7303724 


oil7 r ?n^7'?4li»hlAAF58773 11 fAFOO^R^Q^ Inla orne 

product [alt 2] [Drosophila melanogaster] 
gb|AAF58774. 1 1 (AE003829) lola gene product [alt 3] 

[Drosophila melanogaster] gb|AAF5 8775 . 1 1 
(AE003829) lola gene product [alt 4] [Drosophila 
melanogaster] gb|AAF58776.1| (AE003829) lola gene 
product [alt 5] [Drosophila melanogaster] 


5.8 


476 


1098549 


gi|1098549|gb|AAA82594.1| (U25703) immunoglobulin 
light chain F class [Ictalurus punctatus] 


7 


477 


4262231 


gi|4262231|gb|AAD 14524. 1| (AC006200) hypothetical 
protein [Arabidopsis thaliana] 


3.8 


478 


5834894 


gi|5834894|ref|NPJ)06964.1|ND5_10021 NADH 

Uvll yUl Ugvllaov OUUUXUl — ' y\*sCL\sli\Jl IICIL/UILIO v/lv/gdllo J 

sp|P24896|NU5M_CAEEL NADH-UBIQUINONE 
OXIDOREDUCTASE CHAIN 5 pir||S26037 NADH 
dehydrogenase (ubiquinone) (EC 1.6.5.3) chain 5 - 
Caenorhabditis elegans mitochondrion 
emb|CAA38 162. 1| (X54252) ND5 protein (AA 1 - 527) 
[Caenorhabditis elegans] 


0.23 


480 


14776608 


gi|14776608|ref|XP 033588.1] BAI1 -associated protein 
3 [Homo sapiens] 


8.6 


482 


7448960 


gi|7448960|pir||D72417 conserved hypothetical protein - 

Thermotoga maritima (strain MSB8) 
gb|AAD35211.1|AE001697_ll (AE001697) conserved 
hypothetical protein [Thermotoga maritima] 


9.1 


484 


7497960 


gi|7497960|pir||T15840 hypothetical protein C54G7.3 - 
Caenorhabditis eleeans eblAAA81392 11 (TJ40410) 
coded for by C. elegans cDNA yk9el0.5; coded for by 
C. elegans cDNA yk9el0.3; multiple regions of 
similarity to EGF-like repeats and cysteine-rich repeats 
[Caenorhabditis elegans] 


5 


487 


14720884 


gi|14720884|ref|XP_032 180.1| hypothetical protein 
DKFZp434A171 [Homo sapiens] 


7.7 


490 


3201900 


gi|3201900|gb|AAC19365.1| (AF067420) SNC73 
protein [Homo sapiens] 


0.28 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


493 


12018149 


gi| 120 1 8 149|gb|AAG4542 1 . 1 |AF309495_1 (AF309495) 
gamete-specific hydroxyproline-rich glycoprotein a2 
[Chlamydomonas reinhardtii] 


3.7 


494 


467634 


gi|467634|emb|CAA82856.1| (Z29969) HLY4 
[Entamoeba histolytica] 


8.9 


498 


14726914 


gi|14726914|refjXP_037105.1| KIAA0622 protein; 
Drosophila 'multiple asters' (Mast)-like homolog 1 
[Homo sapiens] 


6.8 


499 


11692583 


gi|l 1692583|gb|AAG39888.1|AF282303_l (AF282303) 
odorant receptor M34 [Mus musculus] 


9.5 


502 


7303064 


gi|7303064|gb|AAF58 132.1| (AE003810) CG12960 
gene product [Drosophila melanogaster] 


4.9 


504 


13701254 


gi| 13701 254|dbj|BAB42549. 1| (AP003 133) 
ORFID:SA1289~hypothetical protein, similar to 
bifunctional biotin ligase/biotin operon repressor 
[Staphylococcus aureus subsp. aureus N315] 
dbj|BAB57618.1| (AP003362) hypothetical protein 
[Staphylococcus aureus subsp. aureus Mu50] 


5.9 


505 


137073 


gi|137073|sp|P17086|UREl_PROMI UREASE ALPHA 
SUBUNIT (UREA AMIDOHYDROLASE) 
pir||D43719 urease (EC 3.5.1.5) 62K chain - Proteus 
mirabilis gb|AAA25669.1| (M31834) urease subunit C 
[Proteus mirabilis] 


8.5 


512 


14548038 


gi|14548038|sp|Q9D952|EVPL_MOUSE 
ENVOPLAKIN (P210) (210 KDA CORNIFIED 
ENVELOPE PRECURSOR) emb|CAC38864.2| 

(AJ309317) envoplakin [Mus musculus] 


4.2 


516 


14031018 


gi| 1403 101 8|gb|AAK50523.1| (AY029684) NADH 
dehydrogenase F [Stenotaphrum secundatum] 


4.6 


518 


14600836 


gi|14600836|refjNP_147359.1| hypothetical protein 
[Aeropyrum pemix] pir||D72647 hypothetical protein 
APE0610 - Aeropyrum pernix (strain Kl) 
dbj|BAA79580. 1| (AP000060) 376aa long hypothetical 
protein [Aeropyrum pernix] 


2.8 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


519 


4930062 


gi|4930062|pdb|lMAA|D Chain D, Mouse 
Acetylcholinesterase Catalytic Domain, Glycosylated 
Protein odbl 1MAAIA Chain A Mon<!e 
Acetylcholinesterase Catalytic Domain, Glycosylated 
Protein pdb|lMAA|C Chain C, Mouse 
Acetylcholinesterase Catalytic Domain, Glycosylated 
Protein pdb|lMAA|B Chain B, Mouse 
Acetylcholinesterase Catalytic Domain, Glycosylated 
Protein 


8.8 


522 


9757603 


gi|9757603|dbj|BAB08147.1| (AB030877) maturase 
[Lilium mackliniae] 


0.62 


526 


7445887 


g i|7445887|pir||T07052 probable potassium channel 
protein SKT2 - potato emb|CAA70870. 1| (Y09699) 
putative inward rectifying potassium channel [Solanum 
tuberosum] 


2.1 


528 


2897832 


gi|2897832|dbj|BAA24910.1| (AB000134) cytochrome 
oxidase subunit I [Prorocentrum micans] 


9.8 


537 


7446379 


gi|7446379|pir||T14039 protein kinase (EC 2.7.1.37), 
myotonic dystrophy-associated - rat gb|AAC02941.1| 
(AF021935) mytonic dystrophy kinase-related Cdc42- 
binding kinase [Rattus norvegicus] 


8E-52 


544 


14779648 


gi|14779648|ref|XP_027140.1| hypothetical protein 
FLJ23239 [Homo sapiens] 


6.3 


546 


7292503 


gi|7292503|gb|AAF47906.1| (AE003481) CG15023 
gene product [Drosophila melanogaster] 


6.3 


548 


4758758 


gi|4758758|ref|NP_004529.1| nucleosome assembly 
protein 1-like 3 [Homo sapiens] 
sp|Q99457|NPL3_HUMAN NUCLEOSOME 
ASSEMBLY PROTEIN 1-LIKE 3 dbj|BAA08904.1| 
(D50370) nucleosome assembly protein [Homo sapiens] 


1.6 


554 


14530412 


gi|14530412|emb|CAC42291.1| (Z69360) cDNA EST 
EMBL:U52071 comes from this gene [Caenorhabditis 
elegans] 


7 


559 


6680788 


gi|6680788|ref|NP_03 1576.1| Bloom syndrome protein 
lomolog (human) [Mus musculus] 
sp|O88700|BLM_MOUSE BLOOM'S SYNDROME 
PROTEIN HOMOLOG emb|CAB10933.1| (Z98263) 
BLM protein [Mus musculus] 


4.8 



456 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


560 


3294495 


gi|3294495|gb|AAC25822.1| (AF038608) contains 
similarity to Mus musculus tumor susceptibility protein 
TSG101 (GB:U52945) [Caenorhabditis elegans] 


0.78 


561 


6319273 


gi|63 19273|reflNP_009356. 1| pre-tRNA processing; 
Pta.lt) rSaccharomvces cerevisiael 
sp|Q01329|PTAl_YEAST PTA1 PROTEIN 
pir||S3 1299 pre-tRNA processing protein PTA1 - yeast 
(Saccharomyces cerevisiae) gb|AAA34919.1| (M95673) 
ptal [Saccharomyces cerevisiae] gb|AAC04988.1| 
(U12980) Ptalp: Pre-tRNA processing involved protein 
[Saccharomyces cerevisiae] 


9.9 


567 


5817732 


gi|5817732|gb|AAD52875.1|AF142703_l (AF142703) 
maturase-like protein [Ophrestia radicosa] 


6.6 


569 


126296 


gi|126296|sp|P08548|LINl_NYCCO LINE-1 
REVERSE TRANSCRIPTASE HOMOLOG 
prf||1207289B reverse transcriptase related protein 
[Nycticebus coucang] 


0.00000002 


572 


13816099 


gi|13816099|gb|AAK42875.1| (AE006870) Hypothetical 
protein [Sulfolobus solfataricus] 


2.1 


574 


12513361 


gi| 125 1 336 1 |gb|AAG54834. 1 |AE005228_1 (AE005228) 
putative glutaminase [Escherichia coli 0157:H7 
EDL933] dbj|BAB33961.1| (AP002552) putative 
glutaminase [Escherichia coli 0157:H7] 


3.3 


575 


11350453 


gi|11350453|pir||B82965 hypothetical protein PA5456 
[imported] - Pseudomonas aeruginosa (strain PAOl) 
gb|AAG0884 1 . 1 |AE00495 8_1 1 (AE00495 8) 
hypothetical protein [Pseudomonas aeruginosa] 


9 


584 


1174945 


gi|l 174945|sp|P43 109|VEXB_SALTI VI 
POLYSACCHARIDE EXPORT INNER- 
MEMBRANE PROTEIN VEXB pir||G36892 Vi 
polysaccharide capsule transporter VexB - Salmonella 
typhi dbj|BAA03 197.1| (D14156) Wzm protein 
[Salmonella typhi] 


4.5 


586 


4566616 


gi|4566616|gb|AAD23408.1|AF113531_l (AF1 13531) 
follicle-stimulating hormone receptor precursor 
[Coturnix coturnix] 


2 


589 


14746756 


gi|14746756|ref|XP_039 102. 1| similar to hypothetical 
protein FLJ20378 (H. sapiens) [Homo sapiens] 


0.007 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


590 


7429823 


gi|7429823|pir||D69991 conserved hypothetical protein 
yteU - Bacillus subtilis gb|AAC00275.1| (AF008220) 
YteU [Bacillus subtilis] emb|CAB14987.1| (Z99119) 
similar to hypothetical proteins from B. subtilis 
[Bacillus subtilis] 


0.49 


592 


11877309 


gi|11877309|emb|CAC19023.1| (AJ278707) ORF 4 
[Neisseria meningitidis phage 2 120] 


9.2 


594 


14193715 


gi|14193715|gb|AAK56102.1|AF332074_l (AF332074) 
peroxisome proliferator-activated receptor binding 
protein [Mus museums] 


0.000002 


595 


14625344 


gi|14625344|gb|AAK71419.1|U80842_9(U80842) 
Hypothetical protein ZC239.19 [Caenorhabditis 
elegans] 


1.9 


596 


1107614 


gi|1107614|emb|CAA62696.1| (X91351) viral infectivity 
factor protein [Human immunodeficiency virus type 1] 


0.008 


599 


6324561 


gi|6324561|ref|NP_014630.1| required to degrade 
misfolded ER lumenal and integral membrane proteins; 
Hrdlp [Saccharomyces cerevisiae] pir||S66695 probable 
membrane protein YOLO 13c - yeast (Saccharomyces 
cerevisiae) emb|CAA99012.1| (Z74755) ORF YOL013c 
[Saccharomyces cerevisiae] 


6.7 


600 


7494878 


gi|7494878|pir||T33047 hypothetical protein B0344.2 - 
Caenorhabditis elegans gb|AAC16982.1| (AF067209) 
B0344.2 gene product [Caenorhabditis elegans] 


0.94 


601 


11347199 


1 1471 QQIr»irllP81 IS 1 nrnhahlp ITTYP-N- 

gi|llj*f / iy7jpir||i^ou j i proodoic uur'ii 
acetylmuramoylalanyl-D-glutamyl-2, 6-diaminopimelate- 
-D-alanyl-D-alanine ligase (EC 6.3.2.15) Cj0795c 
[imported] - Campylobacter jejuni (strain NCTC 1 1 168) 
emb|CAB73060.1| (AL139076) putative UDP-N- 
acetylmuramoylalanyl-D-glutamyl-2, 6-diaminopimelate 
- ligase [Campylobacter jejuni] 


0.55 


604 


1293067 


gi|1293067|gb|AAB06014.1| (U50958) drosophila seven 
in-absentia gene product homolog [Mus musculus] 


7.9 
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NO 
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N 
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P VALUE 


606 


5031975 


gi|5031975|ref]NP_005875.1| p21-activated kinase 4; 
protein kinase related to S. cerevisiae STE20, effector 
for Cdc42Hs [Homo sapiens] ref]XP_04 1 095. 1| protein 
kinase related to S. cerevisiae STE20, effector for 
Cdc42Hs [Homo sapiens] ref|XP_041093.1| protein 
kinase related to S. cerevisiae STE20, effector for 
Cdc42Hs [Homo sapiens] sp|O96013|PAK4_HUMAN 
SERINE/THREONINE-PROTEIN KINASE PAK 4 
(P21 -ACTIVATED KINASE 4) (PAK-4) 
gb|AAD01210.1| (AF005046) serine/threonine kinase 
[Homo sapiens] emb|CAA09820.1| (AJ011855) PAK4 
protein [Homo sapiens] 


1.1 


608 


9280285 


gi|9280285|dbj|BAB01686.1| (AB046104) unnamed 
protein product [Macaca fascicularis] 


8.7 


609 


5869819 


gi|5869819|emb|CAB55576.1| (AJ249395) NADH- 
ubiquinone oxidoreductase subunit 1 [Globodera 
pallida] 


0.84 


611 


12045283 


gi|12045283|ref|NP_073094.1| conserved hypothetical 
protein [Mycoplasma genitalium] 
sp|P47662|Y423_MYCGE HYPOTHETICAL 
PROTEIN MG423 pir||G64246 conserved hypothetical 
protein MG423 - Mycoplasma genitalium 
gb|AAC71647.1| (U39724) conserved hypothetical 
protein [Mycoplasma genitalium] 


2.8 


612 


11289932 


gi|11289932|pir||T50247 probable helicase [imported] - 
fission yeast (Schizosaccharomyces pombe) 
emb|CAB71840.1| (AL138666) putative helicase 
[Schizosaccharomyces pombe] 


4.5 


613 


6119709 


gi|6119709|emb|CAB59566.1| (AJ249986) C3G protein 
[Rattus norvegicus] 


1.6 


620 


7144507 


gi|7144507|gb|AAA58585.2| (U12823) hemolysin 
[Acanthamoeba polyphaga] 


8.1 


622 


15011757 


gi|15011757|gb|AAB04582.3| (U64603) Hypothetical 
protein C09B7.1 [Caenorhabditis elegans] 


7 


625 


8923324 


gi|8923324|ref|NP_060247.1| hypothetical protein 
FLJ20345 [Homo sapiens] ref|XP_008161.3| 
hypothetical protein FLJ20345 [Homo sapiens] 
dbj|BAA91 105. 1| (AK000352) unnamed protein product 
[Homo sapiens] 


0.004 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


626 


6960319 


gi|69603 19|gb|AAD43326.2|AF155 156_1 (AF155 156) 
adaptor-related protein complex AP-4 epsilon subunit 
[Homo sapiens] 


0.00008 


628 


5739387 


gi|5739387|gb|AAD50450.1|AF169388_l (AF169388) 
alpha 4 collagen IV [Mus musculus] 


5.5 


631 


3510234 


gi|3510234|gb|AAC33487.1| (AC005581) R31237_l, 
partial CDS [Homo sapiens] 


8.6 


632 


802150 


gi|802150|gb|AAB32775.1| (S75037) pancreatic 
peptidylglycine alpha-amidating monooxygenase, 
PAM=membrane-bound isoform {alternatively spliced, 
clone PAM-3, transmembrane domain (Ba region)} 
[human, islet cell tumor cell line QGP-1, Peptide Partial, 
971 aa] [Homo sapiens] 


1E-13 


636 


2105238 


gi|2105238|gb|AAB57954.1| (U86905) similar to variola 
and vaccinia E9L [Molluscum contagiosum virus 
subtype 1] 


1.5 


641 


7576305 


gi|7576305|emb|CAB88003.1| (AJ277410) NADH 
dehydrogenase subunit 1 [Taenia hydatigena] 


9.3 


643 


12231395 


gi|12231395|gb|AAG49078.1|AF216121_l (AF216121) 
phytochrome B [Gyminda tonduzii] 


7 


647 


281178 


gi|281 1 78|pir|j JQ 1 580 major surface antigen - hepatitis 
B virus (subtype adw4q-, strains CNTS-38 and Fou) 
emb|CAA53344.1| (X75658) surface antigen [Hepatitis 
B virus] 


6.7 


652 


7503145 


ei|7503145lDirllT16315 hvDothetical Drotein F41C3 6 - 
Caenorhabditis elegans gb|AAC46813.1| (U23521) 
asked finisher to look for frameshift because of 
discrepancy between prediction and EST CEESW76F; 
looks like there should be a splice from 21751 to 21706. 
But no problem was found in the area [Caenorhabditis 
elegans] 


6.9 


653 


14773502 


gi|14773502|ref]XP_043252.1| PCAF associated factor 
65 alpha [Homo sapiens] gb|AAH08785.1|AAH08785 
(BC008785) PCAF associated factor 65 alpha [Homo 
sapiens] 


3E-10 


658 


14732840 


gi|14732840|ref]XP_0341 10. 1| hypothetical protein 
XP_0341 10 [Homo sapiens] 


1.8 


659 


13442965 


gi|13442965|gb|AAK26242.1|AF247132_l (AF247132) 
putative chromatin remodeling factor [Mus musculus] 


4 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


660 


4512671 


gi|4512671|gb|AAD21725.1| (AC006931) En/Spm-like 
transposon protein [Arabidopsis thaliana] 
gb|AAD33868.1|AF141375_l (AF141375) protodermal 
factor 1 [Arabidopsis thaliana] 

gb|AAD33869.1|AF141376_l (AF141376) protodermal 
factor 1 [Arabidopsis thaliana] 


6.6 


663 


13385330 


gi|13385330|ref|NP_080127.1| RIKEN cDNA 
1700010114 gene [Mus musculus] dbj|BAB24262.1| 
(AK005830) putative [Mus musculus] 


0.33 


665 


14325583 


gi|14325583|dbj|BAB60486.1| (AP000996) unknown 
product [Thermoplasma volcanium] 


5.4 


666 


14779941 


gi|14779941|ref|XP_007847.3| hypothetical protein from 
clone 24796 [Homo sapiens] 


0.00000002 


670 


14330407 


gi|14330407|emb|CAC41079.1| (AJ308518) P2X5 
receptor [Gallus gallus] 


3.3 


672 


13991595 


gi|13991595|gb|AAK51427.1| (AF355796) BRCA1 
[Echymipera kalubu] 


6.9 


681 


14728540 


gi|14728540|ref|XP_048261.1| similar to ring finger 
protein 23; RING-B box-coiled coil-B30.2 (M. 
musculus) [Homo sapiens] 


0.026 


683 


14740222 


gi|14740222|reflXP_049918.1| similar to hypothetical 
protein MGC 10940 (H. sapiens) [Homo sapiens] 


1E-86 


684 


11359451 


gi|l 135945 l|pir||T5 1040 hypothetical protein 
B 15120. 100 [imported] - Neurospora crassa 
emb|CAB97464.1| (AL389900) conserved hypothetical 
protein [Neurospora crassa] 


9.2 


685 


14735526 


gi|14735526|ref|XP_001804.4| guanylate binding protein 
2, interferon-inducible [Homo sapiens] 


2.3 


687 


7510353 


gi|75 10353|pir||T27275 hypothetical protein Y63D3A.9 - 
Caenorhabditis elegans emb|CAA21711.1| (AL032652) 
contains similarity to Pfam domain: PF00646 (F-box 
domain.), Score=43.2, E-value=1.9e-09, N=l 
[Caenorhabditis elegans] 


6.8 


697 


14773502 


gi|14773502|ref|XP_043252.1| PCAF associated factor 
65 alpha [Homo sapiens] gb|AAH08785.1|AAH08785 
(BC008785) PCAF associated factor 65 alpha [Homo 
sapiens] 


3E-10 


700 


7498998 


gi|7498998|pir||T16057 hypothetical protein F13D1 1.2 - 
Caenorhabditis elegans 


0.27 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


703 


4493893 


gi|4493893|emb|CAB39002.1| (AL034558) predicted 
using hexExon; MAL3P2.15 (PFC0230c), Hypothetical 
protein len: 3979 aa [Plasmodium falciparum] 


9.2 


706 


7243081 


gi|7243081|dbj|BAA92588.1| (AB037771) KIAA1350 
protein [Homo sapiens] 


1E-79 


707 


7496384 


gi|7496384|pir||T33266 hypothetical protein C24B9.10 - 
Caenorhabditis elegans gb|AAC 19256. 1| (AF068709) 
Hypothetical protein C24B9. 10 [Caenorhabditis 
elegans] 


5.4 


708 


7492007 


gi|7492007|pir||T41680 hypothetical protein 
SPCP1E1 1.01c - fission yeast (Schizosaccharomyces 
pombe) emb|CAB54860.1| (AL1 17183) hypothetical 
protein [Schizosaccharomyces pombe] 


4.1 


709 


14758507 


gi|14758507|ref|XP_044768.1j similar to death receptor 
6 (H. sapiens) [Homo sapiens] 


0.031 


710 


6513925 


gi|6513925|gb|AAF14829.1|AC01 1664_11 (AC01 1664) 
unknown protein [Arabidopsis thaliana] 


0.83 


713 


6434714 


gi|6434714|emb|CAB6 1161.1| (AL132973) putative 
membrane protein. [Streptomyces coelicolor A3 (2)] 


1.2 


714 


8810476 


gi|88 10476|gb|AAF80 137.1 |AC024 1 74_19 (AC024 1 74) 
Contains similarity to a hypothetical protein F24K9. 13 
gi|6006885 from Arabidopsis thaliana gb|AC008153 
and contains multiple PPR PF|01535 repeats 


0.22 


719 


9626839 


gi|9626839|reflNP_04 1109.1| ORF 18 [ictalurid 
herpesvirus 1] sp|Q00120|VG18_HSVIl 
HYPOTHETICAL GENE 18 PROTEIN pir||A36788 
hypothetical protein ORF 18 - ictalurid herpesvirus 1 
(strain auburn 1) gb|AAA88121.1| (M75136) ORF 18 
[ictalurid herpesvirus 1] 


9.2 


723 


15004903 


gi| 1 5 004903 |gb| AAK77203 . 1 |AC006622_3 
(AC006622) Hypothetical protein C52D10. 12 
[Caenorhabditis elegans] 


2.4 


724 


6677663 


gi|6677663|ref]NP_033047.1| retinoic acid induced 1 
Mus museums] pir||T30250 GT1 protein - mouse 
dbj|BAA06 184.1| (D29801) Unknown [Mus musculus] 
prf]|2123391A GT1 gene [Mus musculus] 


0.12 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


729 


2343183 


gi|2343183|gb|AAB67715.1| (AFO 13243) zinc finger 
transcription factor [Lytechinus variegatus] 


0.021 


730 


7465334 


gi|7465334|pir||B71980 proline dehydrogenase (EC 
1.5.99.8) / l-pyrroline-5-carboxylate dehydrogenase 
(EC 1 .5. 1 . 12) putA [similarity] - Helicobacter pylori 
(strain J99) gb|AAD05632.1| (AE001444) 
Proline/pyrroline-5 -carboxylate dehydrogenase 
[Helicobacter pylori J99] 


4.9 


733 


9631436 


gi|9631436|ref]NP_048287.1| ORF MSV216 SCGgene 
family protein [Melanoplus sanguinipes 
entomopoxvirus] pir||T28377 ORF MSV216 SCG gene 
family protein - Melanoplus sanguinipes 
entomopoxvirus gb|AAC97749.1| (AF063866) ORF 
MSV216 SCG gene family protein [Melanoplus 
sanguinipes entomopoxvirus] 


0.52 


734 


13813168 


gi|13813168|gb|AAK40402.1| (AE006645) Hypothetical 
protein [Sulfolobus solfataricus] 


2.7 


736 


14776978 


gi|14776978|ref|XP_033086.1| hypothetical protein 
XP_033086 [Homo sapiens] 


5.4 


737 


7294863 


gi|7294863|gb|AAF50194.1| (AE003550) CG6718 gene 
product [Drosophila melanogaster] 


1.9 


745 


7465334 


gi|7465334|pir||B71980 proline dehydrogenase (EC 
1.5.99.8) / l-pyrroline-5-carboxylate dehydrogenase 
(EC 1.5.1.12) putA [similarity] - Helicobacter pylori 
(strain J99) gb|AAD05632.1| (AE001444) 
Proline/pyrroline-5 -carboxylate dehydrogenase 
[Helicobacter pylori J99] 


3.9 


750 


5759216 


gi|5759216|gb|AAD51030.1|AF172399_l (AF172399) 
p75 neurotrophin receptor a-1 [Xenopus laevis] 
gb|AAD51031.1|AF172400_l (AF172400) p75 
neurotrophin receptor a-2 [Xenopus laevis] 


3.9 


751 


6518511 


gi|651851 l|dbj|BAA87907.1| (AB021222) ketosynthase 
[Streptomyces rochei] 


6.6 


754 


6679777 


gi|6679777|ref|NP_032029.1| fibroblast growth factor 
15 [Mus musculus] sp|035622|FGFF_MOUSE 
FIBROBLAST GROWTH FACTOR-15 
PRECURSOR (FGF-15) gb|AAB63 197. 1| (AF007268) 
fibroblast growth factor [Mus musculus] 
dbj|BAB30961.1| (AK017829) putative [Mus musculus] 


7.7 
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SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


755 


7498881 


gi|7498881|pir||T20748 hypothetical protein F11A5.2 - 
Caenorhabditis elegans emb|CAB07355.1| (Z92830) 
contains similarity to Pfam domain: PF01604 (7TM 
chemoreceptor), Score=428.7, E-value=1.7e-125, N=l 
[Caenorhabditis elegans] 


6.4 


756 


1378135 


gi|1378135|gb|AAB02606.1| (U58466) putative 
fibroblast growth factor receptor 1-like protein; one of 
three possible open reading frames in the sequence 
[Rattus norvegicus] 


5.1 


758 


7444901 


g i|7444901|pir||B71630 hypothetical protein RP702 - 
Rickettsia prowazekii emb|CAA15 138. 1| (AJ235273) 
unknown [Rickettsia prowazekii] 


2.2 


759 


12854427 


gi|12854427|dbj|BAB30025.1| (AK015906) putative 
[Mus musculus] 


6.6 


763 


4185892 


gi|4 1 85 892|emb|CAA2 1831.1| (AL033125) 1- 
evidence=predicted by content- 1- 
method=genefinder;084~ 1 -evidence_end~2- 
evidence=predicted by match~2- 
match_accession=SWISS-PROT:P38205~2- 
match_description=HYPOTHETICAL 77.9 KD 
PROTEIN IN RRN10-MCM2 INTERGENIC 
REGION.~2-match_species=SACCHA> 


9.1 


766 


14725855 


gi|14725855|ref]XP_038865. 1| similar to hypothetical 
protein PR02822 (H. sapiens) [Homo sapiens] 


9.2 


770 


7582302 


gi|7582302|gb|AAF64271 . 1|AF208857_1 (AF208857) 
BM-015 [Homo sapiens] 


8E-13 


772 


6594283 


gi|6594283|dbj|BAA884 19.1| (ABO 166 15) hydrophobic 
transmembrane protein [Staphylococcus aureus] 


1.7 


783 


12597803 


gi| 1 2597803|gb|AAG60 115.1 |AC073 1 78_26 
(AC073 178) hypothetical protein [Arabidopsis thaliana] 


0.62 


784 


8777303 


gi|8777303|dbj|BAA96893.1| (AB018112) serine 
carboxypeptidase [Arabidopsis thaliana] 


1.2 


785 


10045258 


gi|10045258|emb|CAC07930.1| (AJ228487) cytochrome 
c oxidase subunit 3 [Acanthogammarus godlewskii] 


1.5 


786 


1794167 


gi) 1 794 1 67|dbj|BAA 1 12 1 7. 1 1 (D78137) unnamed 
protein product [Vibrio parahaemolyticus] 


9.7 


790 


12835698 


gi|12835698|dbj|BAB23329.1| (AK004489) putative 
[Mus musculus] 


8.1 



464 





Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


N 
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P VALUE 


791 


131761 


™li 1 1 mi I«*^II>1 1 AQAIOAV XTFTTPP nTTTNJATF 
5 1 /Ol|Sp|r 1 lOJO|l^/Vi _!NillJ\^Iv V^UlxNrv m 

PERMEASE (QUINATE TRANSPORTER) 
pir||G3 1277 quinate transport protein - Neurospora 

^rabbet ^ICIlUAll VC awi|UCliyc/ t/iiiLJ|v^.rvzT._J.£. i x | x-t\j\j-j / 

quinate transporter [Neurospora crassa] 


1.7 


794 


11640576 


gi|11640576|gb|AAG39285.1|AF113214_l (AF1 13214) 
MSTP034 [Homo sapiens] 


6.3 


797 


5306243 


gi|5306243|gb|AAD41976.1|AC006438_8 (AC006438) 
lypothetical protein [Arabidopsis thaliana] 


3.4 


798 


7511787 


gi|7511787|pir||T29089 alpha-mannosidase (EC 3.2.1.-), 
class II - fall armyworm gb|AAB62719.1| (AF005034) 
alpha-mannosidase II [Spodoptera frugiperda] 


7 


800 


3170793 


gi|3 1 70793 |gb| AAC 1 820 1 . 1 1 (AF062 1 65) 
immunoglobulin heavy chain variable region [Homo 
sapiens] 


8.7 


802 


11466458 


gi|11466458|rerlNP_038161.1| NADH dehydrogenase 
subunit 4L [Chrysodidymus synuroideus] 
gb|AAlj;0 l| (Uj'tOjj) iN/vL/ri aenyurogena&e 
subunit 4L [Chrysodidymus synuroideus] 
gb|AAF36927.1|AF222718_l (AF222718) NADH 
dehydrogenase subunit 4L [Chrysodidymus 
synuroideus] 


4.2 


803 


14779941 


ml 1 477QQ4. 1 1rpfllTP 007847 31 hvnothetical orotein from 
clone 24796 [Homo sapiens] 


0.00000002 


804 


13430868 


gi 1343UoOo|rei|rNr^ u/ /uuo.ij nypouicxicai proLciii 
MGC2615 [Homo sapiens] rer]XP_015759.1| 
xiypoincxicai proicm ivivj v>z.u i *j \l ivjuiu oa^iwiioj 
gb|AAH01656.1|AAH01656 (BC001656) Unknown 
(protein for MGC:2615) [Homo sapiens] 


4E-21 


808 




gi|3293234|gb|AAC25913.1| (U96413) T-DNA 

nnmnrotein f Affrobacterium tumefaciensl 


7.9 


810 


7299169 


gi|7299169|gb|AAF54367.1| (AE003682) Fps85D gene 
product [alt 1] [Drosophila melanogaster] 


5.6 


814 


11361627 


gill 1361627|pir||H82736 hypothetical protein XF0992 

[imported] - Xylella fastidiosa (strain 9a5c) 
gb|AAF83802.1|AE003937_6 (AE003937) hypothetical 
protein [Xylella fastidiosa 9a5c] 


0.43 


816 


10803160 


gi|10803160|emb|CAC13088.1| (AL445503) putative 
two component system response regulator [Streptomyces 
coelicolor] 


i 

6.4 
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SEQID 
NO 
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XT 

IN 




P VAIIJE 


819 




gi|2613090|gb|AAB84279.1| (AF030317) OmpR 

rroieus vuigansj 


1 7 


821 


2494232 


gi|2494232|sp|Q628 14|E2F5_RAT TRANSCRIPTION 

^ALlUivljZrj ^EZr-jJ gu|/\/\iJUU loU. 1 j ^ujioooj 

E2F-5 [Rattus norvegicus] 


7.8 


822 


l*f /Zy / jZ 


gi|14729752|ref|XP_036388.1| hypothetical protein 


6.7 


825 


7024427 


gi|7024427|emb|CAA87594.2| (Z47547) 
Pwi^rf234.1;Mpo^rf244;Angio==orfx-no ATG start 
codon [Chondrus crispus] 


8 


833 


1 1 A0 1 $ 1 1 


gi|11691811|emb|CAC11114.1|(AL121893) 
bA189K21.6 (Sec23 (S. cerevisiae) homolog B) [Homo 


5.6 


834 


117704 


gi|117704|sp|P18246|CXAl_BOVIN GAP JUNCTION 

ALPHA- 1 PROTEIN (CONNEXIN 43) (CX43) 
(VASCULAR SMOOTH MUSCLE CONNEXIN 43) 
pir||A36623 gap junction protein Cx43 - bovine 
gb|AAA30459.1| (J05535) vascular smooth muscle 
connexin43 [Bos taurus] 


9.2 


835 


1 1 /uouo 


gi|1170606|sp|P43188|KADC_MAIZE ADENYLATE 
KINASE, CHLOROPLAST (ATP-AMP 
TRANSPHOSPHORYLASE) pir||S45634 adenylate 

kinase (EC 2.7.4.3), chloroplast - maize pdb|lZAK|A 
Chain A, Adenylate Kinase From Maize In Complex 
With The Inhibitor Pl,P5-Bis(Adenosine-5'- 
)pentaphospnate (Ap5a) pab|lZAjv|D LJiain t$, 
Adenylate Kinase From Maize In Complex With The 
Inhibitor P l,P5-Bis(Adenosine-5'-)pentaphosphate 

( AnSs^ 

(Apoaj 


0.0000001 


836 


7661750 


gi|7661750|ref|NP_054866.1| HSPC047 protein [Homo 
sapiens] gb|AAF29019.1|AF161532_l (AF161532) 
HSPC047 [Homo sapiens] 


5E-28 


838 


13815530 


gi|13815530|gb|AAK42398.1| (AE006828) Transport 
protein, hypothetical [Sulfolobus solfataricus] 


5 


839 


14771844 


gi| 1477 1 844|reflXP_045585 . 1 1 hypothetical protein 
FLJ1 1085 [Homo sapiens] emb|CAB91047.2| 
(AL109935) dJ1022P6.2 (KIAA1434) [Homo sapiens] 


0.089 


843 


4007786 


gi|4007786|emb|CAA51374.1| (X72850) acetyl CoA 
acetyltransferase [Sphingomonas sp.] 


4.6 
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N 
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847 


9968471 


gi|9968471|emb|CAC06698.1| (AJ275988) Kruppel like 
factor [Mus musculus] 


6.2 


84S 


7499805 


gi|7499805|pir||T21337 hypothetical protein F25D7.1 - 

i^aenornaDQius eiegans emu|v^/VDU i ovo. 1 1 yju i oh- ioj 
cDNA EST yk60g6.3 comes from this gene~cDNA EST 
yk60g6.5 comes from this gene-cDNA EST 
vlcl 56H10 *? comes* from thi^ £?ene — cDNA EST 
ykl56bl0.5 comes from this gene~cDNA EST 
yk399f2.3 comes from this gene-cDNA EST yk399f2.5 
comes from this gene~cDN> 


7.1 


849 


6581093 


gi|6581093|gb|AAF18453.1|AF205599_l (AF205599) 
transposase-like protein [Mus musculus] 


4.4 


853 


13651256 


gi|13651256|reflXP_015436.1| chloride channel 4 

[Homo sapiens] ref|XP_045756.1| chloride channel 4 
rHnmn stanien*;! reflXP 045757 1 1 chloride channel 4 
[Homo sapiens] dbj|BAA77327.1| (ABO 19432) chloride 
channel protein 4 [Homo sapiens] 
gb|AAD50981.1|AF170492_l (AF170492) chloride 
channel CLC4 [Homo sapiens] 


2.3 


854 


336159 


gi|336159|gb|AAA46774.1| (M90520) polymerase 
protein [Woodchuck hepatitis B virus] 


3.2 


861 


8979124 


gi|8979124|dbj|BAA98959.1| (AP002547) CT651 
hypothetical protein [Chlamydophila pneumoniae J 138] 


0.76 


863 


4758712 


gi|4758712|ref|NP_004659.1| alpha-glucosidase; brush 
uorucr nyuxuidoc [numo bdpicnbj 
sp|043451|MGA_HUMAN MALTASE- 
GLUCOAMYLASE, INTESTINAL [INCLUDES: 
MALTASE fALPHA-GLUCOSIDASEV 
GLUCOAMYLASE (GLUCAN 1,4-ALPHA- 
GLUCOSIDASE)] gb|AAC39568.1| (AF016833) 
maltase-glucoamylase [Homo sapiens] 


0.1 


864 


13812050 


gi|13812050|ref]NP_l 13184. 1| hypothetical protein 
[Guillardia theta] gb|AAK39752. 1|AF08303 1_109 
(AF083031) hypothetical protein [Guillardia theta] 


4.3 


867 


10726408 


gi|10726408|gb|AAF54394.2| (AE003683) CG9381 
gene product [Drosophila melanogaster] 


5.5 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


870 


7299899 


gi|7299899|gb|AAF55074.1| (AE003705) CG7987 gene 
product [alt 1] [Drosophila melanogaster] 
gb|AAF55075.1| (AE003705) CG7987 gene product [alt 
2] [Drosophila melanogaster] 


9.7 


872 


6979325 


gi|6979325|gb|AAF34418. 1|AF172282_7 (AF 172282) 
putative phosphatidylinositol 4-kinase [Oryza sativa] 


4.8 


873 


13699930 


gi|13699930|dbj|BAB41229.1| (AP003129) conserved 
hypothetical protein [Staphylococcus aureus subsp. 
aureus N315] dbj|BAB56175.1| (AP003358) conserved 
hypothetical protein [Staphylococcus aureus subsp. 
aureus Mu50] 


3.7 


877 


6841256 


gi|6841256|gb|AAF28981.1|AF161421_l (AF161421) 
HSPC303 [Homo sapiens] 


7E-64 


879 


2459878 


gi|2459878|gb|AAC40459.1| (AF005734) glycoprotein 
precursor [Marburg virus] 


2.5 


880 


14759292 


gi|14759292|refjXP_006700.2| checkpoint with forkhead 
and ring finger domains [Homo sapiens] 
gb|AAF91084.1|AF170724_l (AF170724) cell cycle 
checkpoint protein CHFR [Homo sapiens] 


0.000000002 


886 


141164 


gi|141164|sp|P19297|YORM_TTVl HYPOTHETICAL 
38.6 KD PROTEIN emb|CAA32993.1| (X14855) URF 
(352 AA) [Thermoproteus tenax virus 1] 


2.2 


889 


1755061 


gijl755061|gb|AAB50394.1| (U62737) photosystem I 
subunit XI [Synechococcus sp. PCC 7942] 


3.6 


890 


7464739 


gi|7464739|pir||G71920 hypothetical protein jhp05 18 - 
Helicobacter pylori (strain J99) gb|AAD06099. 1| 
(AE001485) putative [Helicobacter pylori J99] 


7.6 


891 


14141161 


gi| 14 14 1 1 6 1 |ref]NP_004492.2| heterogeneous nuclear 
ribonucleoprotein U, isoform b; hnRNP U protein; 
scaffold attachment factor A; pi 20 nuclear protein 
[Homo sapiens] gb|AAH03367.1|AAH03367 
(BC003367) heterogeneous nuclear ribonucleoprotein U 
(scaffold attachment factor A) [Homo sapiens] 
gb|AAH0362 1 . 1 |AAH0362 1 (BC00362 1) heterogeneous 
nuclear ribonucleoprotein U (scaffold attachment factor 
A) [Homo sapiens] 


6.9 


892 


7649887 


gi|7649887|dbj|BAA94165.1| (AP000422) tail fiber 
protein [Escherichia coli 0157:H7] 


0.63 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


901 


7496871 


gi|749687 1 |pir||Tl 9650 hypothetical protein C32H1 1.7 - 
Caenorhabditis elegans emb|CAB05141.1| (Z82260) 
cDNA EST yk576f6.3 comes from this gene 
[Caenorhabditis elegans] 


4.6 


905 


13400109 


gi|13400109|gb|AAK21974.1| (U77931) rRNA 
promoter binding protein [Rattus norvegicus] 


0.003 


918 


13235458 


gi|13235458|emb|CAC33751.1| (AJ293322) ProP2 
protein [Rickettsia typhi] 


4.1 


919 


7292165 


gi|7292165|gb|AAF47577.1| (AE003472) CG12023 
gene product [Drosophila melanogaster] 


0.25 


920 


12831207 


gi| 1283 1207|ref|NP_075579. 1 1 gamnia-aminobutyric 
acid (GABA-A) receptor, subunit epsilon [Rattus 

norvegicus] gb|AAF70383.1|AF189262_l (AF189262) 
GABA-A receptor epsilon-like subunit [Rattus 
norvegicus] 


4.5 


926 


10765285 


gi| 1 0765285 |gb|AAG2297 L 1 1 AF 1 8343 1_1 (AF 1 8343 1 ) 
inhibitor of apoptosis protein 2 [Rattus norvegicus] 


4.2 


927 


2832268 


gi|2832268|gb|AAC15477.1| (AF043233) Caco-2 
oligopeptide transporter [Homo sapiens] 


5.1 


928 


6179898 


gi|6179898|gb|AAF05702.1|AF190129_l (AF190129) 
Na+/K+/2C1- cotransporter [Callinectes sapidus] 


6.1 


931 


12644035 


gi|12644035|sp|Q9Z8Ml|IF2_CHLPN 
TRANSLATION INITIATION FACTOR IF-2 

X IVTii ^ L/Uil. X IV/i ^ XI ^ X X XI & X X V/ X ^ X I. ✓ X V^X*. XX X* 

pir||E81576 translation initiation factor 2 CP0440 
[imported] - Chlamydophila pneumoniae (strain AR39) 
gb|AAF38279.1| (AE002205) translation initiation 
factor 2 [Chlamydophila pneumoniae AR39] 
dbj|BAA98527.1| (AP002546) initiation factor-2 
[Chlamydophila pneumoniae J 138] 


0.91 


932 


10956362 


gi|10956362|reflNP_052811.1| pXOl-115 [Bacillus 
anthracis] pir||C59105 hypothetical protein pXOl-1 15 - 
Bacillus anthracis virulence plasmid pXOl 
gb|AAD32419.1|AAD32419 (AF065404) pXOl-1 15 
[Bacillus anthracis] gb|AAD41347.1|AF150965_l 
(AF 150965) resolvase X [Bacillus anthracis] 


1.9 


933 


13814214 


gi|13814214|gb|AAK41294.1| (AE006722) Conserved 
hypothetical protein [Sulfolobus solfataricus] 


0.99 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


940 


6320195 


gi|6320195|reflNP_010275. 1 1 Ydl009cp 
[Saccharomyces cerevisiae] pir||S52510 probable 
membrane protein YDL009c - yeast (Saccharomyces 
cerevisiae) emb|CAA88350.1| (Z48432) unknown 
[Saccharomyces cerevisiae] emb|CAA98568.1| 
(Z74059) ORF YDL009c [Saccharomyces cerevisiae] 


7 


942 


10181130 


gi|10181130|ref|NP_065586.1| acid sphingomyelinase- 
like phosphodiesterase 3a [Mus musculus] 
emb|CAA69329.1| (Y08135) acid sphingomyelinase-like 
phosphodiesterase [Mus musculus] 


6E-84 


943 


13124718 


gi| 1 3 1 247 1 8|sp|P54358|DPOD_DROME DNA 
POLYMERASE DELTA CATALYTIC SUBUNIT 
gb|AAF49555.1| (AE003529) DNApol-delta gene 
product [Drosophila melanogaster] 


2.8 


946 


2258300 


gi|2258300|gb|AAB63269.1| (AF001783) AgrC 
[Staphylococcus aureus] 


1.9 


947 


7460037 


gi|7460037|pir||T 13431 hypothetical protein T17A13.20 
Arabidopsis thaliana emb|CAB79678.1| (AL161574) 
hypothetical protein [Arabidopsis thaliana] 


3.4 


948 


7494291 


gi|749429 1 |pir]|E7 1616 hypothetical protein PFB0365w 
malaria parasite (Plasmodium falciparum) 
gb|AAC7 1862.1| (AE001390) hypothetical protein 
[Plasmodium falciparum] 


1.9 


950 


1707627 


gi|1707627|emb|CAA96381.1| (Z71701) cytochrome 
oxidase subunit III [Euhadra herklotsi] 


9.1 


952 


6739602 


gi|6739602|gb|AAF27330.1| (AF178534) talin [Homo 
sapiens] 


1E-14 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ ID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


954 


7498956 


gi|7498956|pir||T20804 hypothetical protein F12F6.1 - 
Caenorhabditis elegans emb|CAA97791.1| (Z73425) 
cDNA EST yk30a8.3 comes from this gene-cDNA EST 
ykl2dl 1 .5 comes from this gene-cDNA EST yk30a8.5 

comes from this gene-cDNA EST ykl 17g9.3 comes 
from this gene-cDNA EST ykl 13hl 1 .5 comes from this 

gene-cDNA EST ykl 17g9.5 comes from this 
gene~c \j in emu|v^/\/w / /yo. ij \l, / jnzoj r^o i 
yk30a8.3 comes from this gene-cDNA EST ykl2dl 1.5 
comes from this gene-cDNA EST yk30a8.5 comes from 
this gene-cDNA EST ykl 17g9.3 comes from this 
gene-cDNA EST ykl 13hl 1 .5 comes from this 
gene-cDNA EST ykl 17g9.5 comes from this 
gene~cDNA> 


6.6 


955 


9631417 


gi|9631417|ref|NP_048325.1| ORF MSV254 leucine 
rich repeat gene family protein, similar to Amsacta 
moorei entomopoxvirus Q3 ORF SW:P28854 
[JYieianopius sanguinipes eniomupoxviruoj piiniz-otu 
ORF MSV254 leucine rich repeat gene family protein - 

Melanoplus sanguinipes entomopoxvirus 
**hlAAC97730 11 fAF063866^ ORF MSV254 leucine 
rich repeat gene family protein, similar to Amsacta 
moorei entomopoxvirus Q3 ORF SW:P28854 
[Melanoplus sanguinipes entomopoxvirus] 


2.1 


956 


7160126 


gi|7160126|emb|CAB76312.1| (AL158060) putative 
membrane protein. [Streptomyces coelicolor A3(2)] 


0.25 


959 


1203965 


gi|1203965|gb|AAA89173.1| (L42379) bone-derived 
growth factor [Homo sapiens] 


1.4 


960 


585147 


gi|585147|sp|P38094|FLUG_EMENI FLUG PROTEIN 

pir||A53186 fluG protein - Emericella nidulans 
gb|AAC37414.1| (L27817) FluG [Emericella nidulans] 


5.4 


963 


11352695 


gi|l 1352695 |pir||D83270 transcription-repair coupling 
protein Mfd P A3 002 [imported] - Pseudomonas 
aeruginosa (strain PAOl) 
gb|AAG06390.1|AE004725_3 (AE004725) 
transcription-repair coupling protein Mfd [Pseudomonas 
aeruginosa] 


1.3 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


N 


DESCRIPTION 


P VALUE 


964 


119777 


gi| 1 1 9777|sp|P 1 629 1 |FA9_SHEEP COAGULATION 
FACTOR IX (CHRISTMAS FACTOR) pir||I47078 
coagulation factor IXa (EC 3.4.21.22) - sheep 
rtraompntl ehlAAA31520 11 (M26233) factor IX TOvis 

aries] 


8.2 


965 


7473507 


#»il'7/i'7'3^n'7lr*i*-IIP"7^^il7 i^rriVinHlf* olvpncvl transferase - 

Deinococcus radiodurans (strain Rl) 
t?hlAAF12451 1IAE001863 76 (AE001863) 
glycosyltransferase, putative [Deinococcus radiodurans] 


0.24 


967 


2494130 


gi|2494130|gb|AAB80639.1| (AC002376) Contains 
similarity to Glycine SRC2 (gb|AB000130). 
[Arabidopsis thaliana] 


5.2 


972 


10728660 


ml10728660lffblAAF52603 21 (AE003620) CG8683 
gene product [Drosophila melanogaster] 


6E-12 


980 


7500037 


gi|7500037|pir||T34063 chromosome segregation protein 
smcl F28B3.7 [similarity] - Caenorhabditis elegans 
gb|AAK21378. 1| (AF003 136) contains similarity to 
ATP synthase subunit B [Caenorhabditis elegans] 


0.68 


981 


7298917 


gi|7298917|gb|AAF54122.1| (AE003675) CG10272 
gene product [Drosophila melanogaster] 


5.9 


983 


1170115 


c%\\ \ 1 701 1 SknlP464^0lfrTTl MANSE 
GLUTATHIONE S-TRANSFERASE 1 (GST CLASS- 
THETA) gb|AAA92880.1| (L32091) glutathione S- 
transferase [Manduca sexta] 


7.1 


985 


10864490 


oil10864490leblAAG24203 11 (AF022981) Hypothetical 
protein W03F9. 6 [Caenorhabditis elegans] 


5.3 


987 


* 


gi|105400|pir||A35648 B-cell adhesion protein CD22 
alpha splice form precursor - human emb|CAA36988. 1| 
(X52785) CD22 antigen [Homo sapiens] prf||1608208A 
R <v»11 antmen DC22 [Homo saniensl 


6 


989 


14750376 


gi|14750376|reflXP_012394.3| myosin IXA [Homo 
sapiens] 


3.9 


990 


6324560 


gi|6324560|reflNP_014629. 1| Yol013w-ap 
[Saccharomyces cerevisiae] pir||S78736 protein 
YOL013w-a - yeast (Saccharomyces cerevisiae) delta 
remnant 


9.3 


991 


7290503 


gi|7290503|gb|AAF45956.1| (AE003431) CG3527 gene 
product [Drosophila melanogaster] 


7.6 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


994 


9964623 


gi|9964623|reflNP_064753.1| RP DNApol [Roseophage 
SIOl] gb|AAG02598.1|AF189021_17 (AF189021) 
Roseophage SIOl complete genome 


3.5 


996 


2769538 


gi|2769538|emb|CAA11217.1| (AJ223279) voltage- 
sensitive sodium channel [Plutella xylostella] 


7.6 


998 


9628506 


oil9628506lreflNP 043384 11 reeulatorv nrotein E2 
"Human papillomavirus type 29] 
sp|P50772|VE2_HPV29 REGULATORY PROTEIN 
E2 gb|AAA79432.1| (U31784) regulatory protein E2 
[Human papillomavirus type 29] 


6.2 


999 


6566147 


gi|6566147|dbj|BAA04745.2| (D21203) large Forked 
protein [Drosophila melanogaster] 


0.085 


1002 


11346920 


gi|l 1346920|pir||H8 1 390 probable integral membrane 
protein Cj 0461c [imported] - Campylobacter jejuni 
(strain NCTC 11168) emb|CAB75099.1| (AL139075) 
putative integral membrane protein [Campylobacter 
jejuni] 


1.5 


1005 


9294038 


gi|929403 8|dbj|BAB0 1995.1] (AB020746) 
gene_id:MOB24.1 -unknown protein [Arabidopsis 
thaliana] 


9.8 


1006 


4505067 


gi|4505067|ref|NP_002349.1| MAD2-like 1; mitotic 
arrest deficient, yeast, homolog-like 1 [Homo sapiens] 
sp|Q13257|MD21_HUMAN MITOTIC SPINDLE 
ASSEMBLY CHECKPOINT PROTEIN MAD2A 
(MAD2-LIKE 1) (HSMAD2) pir||G01942 mitotic 
feedback control protein Madp2 homolog - human 
gb|AAC52060.1| (U31278) mitotic feedback control 
protein Madp2 homolog [Homo sapiens] 
gb|AAC50781.1| (U65410) Mad2 [Homo sapiens] 
emb|CAA03943.1| (AJ000186) MAD2 [Homo sapiens] 
gb|AAH00356.1|AAH00356 (BC000356) MAD2 
(mitotic arrest deficient, yeast, homolog)-like 1 [Homo 
sapiens] gb|AAH05945.1|AAH05945 (BC005945) 
MAD2 (mitotic arrest deficient, yeast, homolog)-like 1 
[Homo sapiens] gb|AAK38174.1| (AF202273) MAD2- 
like protein 1 [Homo sapiens] 


0.000005 



473 






Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1010 


13994140 


gi|13994140|ref]NP 038928. 1| antigen p97 (melanoma 
associated) identified by monoclonal antibodies 133.2 
and 96.5 [Mus musculus] dbj|BAA86655.1| 
(AB024336) membrane-bound transferrin-like protein 
p97 [Mus musculus] dbj|BAB41139.1| (AB047799) 
membrane-bound transferrin-like protein [Mus 
musculus] 


4.1 


1011 


11465559 


gi|11465559|ref|NP_045049.1| unknown [Cyanidium 
caldarium] gb|AAF12997.1|AF022186_169 
(AF022186) unknown [Cyanidium caldarium] 


1.8 


1012 


13421288 


gi|13421288|gb|AAK22158.1| (AE005691) TonB- 
dependent receptor [Caulobacter crescentus] 


0.65 


1013 


6978481 


gi|6978481|ref|NP_036905.1| a-kinase anchoring protein 
[Rattus norvegicus] sp|Q62924|AKl 1 RAT A KINASE 
ANCHOR PROTEIN 1 1 (PROTEIN KINASE A 
ANCHORING PROTEIN 1 1) (PRKA1 1) (A KINASE 
ANCHOR PROTEIN 220 KDA) (AKAP 220) 
pir||T42732 A-kinase anchoring protein AKAP 220 - rat 
gb|AAB06559.1| (U48288) AKAP 220 [Rattus 
norvegicus] 


3 


1015 


7498137 


gi|7498137|pir||T31694 hypothetical protein D 1065.2 - 
Caenorhabditis elegans 


7.4 


1017 


9944230 


gi|9944230|emb|CAC05416.1| (AJ400866) membrane 
tyrosine phosphatase [Bos indicus] 


7 


1023 


14520862 


gi|14520862|ref|NP_126337.1| hypothetical protein 
[Pyrococcus abyssi] pir||G75106 hypothetical protein 
PAB0444 - Pyrococcus abyssi (strain Orsay) 
emb|CAB49568.1| (AJ248285) hypothetical protein 
[Pyrococcus abyssi] 


9.7 


1025 


11290247 


gi|l 1290247|pir||D82039 conserved hypothetical protein 
VC2740 [imported] - Vibrio cholerae (group Ol strain 
N16961) gb|AAF95879.1| (AE004339) conserved 
hypothetical protein [Vibrio cholerae] 


8.2 


1027 


12484269 


gi|12484269|gb|AAG54048.1|AF224607_4 (AF224607) 
NADH dehydrogenase subunit 4 [Propithecus verreauxi 
deckeni] 


7 


1028 


14329703 


gi|14329703|emb|CAC40662.1| (AJ292926) anaerobic 
(class III) ribonucleotide reductase large subunit chain 
[Staphylococcus aureus] 


1.7 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1037 


2636683 


gi|2636683|gb|AAC06264.1| (U66333) pol 
[Schistosoma mansoni] 


5.2 


1039 


6016240 


gl|OU lOZ^U|Sp|vU<£ !UU|rlV^r 1 V_/YCI11j 11M 1 HVJIV/VLj 

MEMBRANE PROTEIN HOP-1 pir||T15 184 presenilin 

presenilin-beta homolog - Caenorhabditis elegans 
gb|AAB52948.1| (AF000265) Hypothetical protein 
C18E3.8 [Caenorhabditis elegans] gb|AAB84394.1| 

(AF021905) presenilin [Caenorhabditis elegans] 


8.5 


1040 


10047173 


gi|10047173|dbj|BAB13380.1| (AB046774) KIAA1554 
protein [Homo sapiens] 


8.4 


1043 


14089928 


gi|14089928|emb|CAC13687.1| (AL445564) 
HEMOLYSIN C [Mycoplasma pulmonis] 


4.8 


1045 


7469269 


gi|7469269|pir||S77245 bioY protein - Synechocystis sp. 
(strain PCC 6803) dbj|BAA17579.1| (D90907) BioY 
protein [Synechocystis sp. PCC 6803] 


5.6 


1046 


10140780 


gi|10140780|gb|AAG13610.1|AC078840_l 
(AC078840) hypothetical protein [Oryza sativa] 


2.2 


1050 


7496756 


gi|7496756|pir||T19593 hypothetical protein C31A1 1.3 - 

Caenorhabditis elegans emb|CAB05685.1| (Z83218) 
rnntriin^ ^imilaritv to Pfam domain* PF01 838 fDomain 
of unknown function), Score=506.2, E-value=5.4e-154, 
N— 2 [Caenorhabditis elegans] 


4.1 


1052 


7500420 


gi|7500420|pir||T32834 hypothetical protein F33H12.1 - 
Caenorhabditis eleeans eblAAB95002 11 CAF040649^ 
Hypothetical protein F33H12.1 [Caenorhabditis 
elegans] 


4 


1054 


14736828 


gi|14736828|re£lXP_032481.1| hypothetical protein 
XP_03248 1 [Homo sapiens] 


5.5 


1056 


7290674 


gi|7290674|gb|AAF46122.1| (AE003436) CG4320 gene 
product [Drosophila melanogaster] 


4.7 


1060 


14970757 


gi|14970757|emb|CAC44464.1| (AJ313506) CtxX 
protein [Salmonella typhimurium] 


5.5 


1062 


4115497 


gi|4 1 1 5497|dbj|BAA3 639 1 . 1 1 (AB010426) AL1 like 
protein [Phytoplasma sp.] 


3.4 


1064 


111816 


gi| 1 1 1 8 16|pir||S2 1348 probable pol polyprotein-related 
protein 4 - rat emb|CAA37647.1| (X53581) ORF4 
[Rattus norvegicus] 


6.4 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1067 


14149940 


gi|14149940|ref|NP_l 15610.1| hypothetical protein 
FLJ23059 [Homo sapiens] dbj|BAB15536.1| 
(AK026712) unnamed protein product [Homo sapiens] 


8E-34 


1072 


6321761 


gi|6321761|ref|NP_01 1837.1) Yhl026cp 
[Saccharomyces cerevisiae] sp|P38740|YHC6_YEAST 
HYPOTHETICAL 30.7 KD PROTEIN IN RIM1- 
SNF6 INTERGENIC REGION PRECURSOR 
pir||S48942 hypothetical protein YHL026c - yeast 
(Saccharomyces cerevisiae) gb|AAB65062.1| (U11583) 
YHL026c gene product [Saccharomyces cerevisiae] 


0.26 


1076 


5932366 


gi|5932366|gb|AAD56919.1|AF180145_ll (AF180145) 
hypothetical protein; zml2orf5 [Zymomonas mobilis] 


7.2 


1084 


7106864 


gi|7106864|gb|AAF36157.1|AF151071_l (AF151071) 
HSPC237 [Homo sapiens] 


8.3 


1087 


1173846 


gi|1173846|gb|AAA86616.1| (U39455) envelope 
glycoprotein precursor [Crimean-Congo hemorrhagic 
fever virus] 


6.1 


1092 


87765 


gi|87765|pir||JU0033 hypothetical LI protein (third 

^7 1 lr II J ± Mr V 

intron of gene TS) - human prf||1510254A LI repetitive 
element ORF [Homo sapiens] 


2 


1096 


9506813 


gi|9506813|reflNP_062 184.1| Inositol polyphosphate-5- 
phosphatase [Rattus norvegicus] gb|AAB40610.1| 
(U55192) inositol polyphosphate 5' phosphatase Ship 
[Rattus norvegicus] 


3.3 


1101 


1 1496734 


gi|11496734|reflNP_04551Ll| B. burgdorferi predicted 
coding region BBH18 [Borrelia burgdorferi] pir||C70237 
hypothetical protein BBH18 - Lyme disease spirochete 
plasmid H/lp28-3 gb|AAC66022.1| (AE000784) B. 
burgdorferi predicted coding region BBH18 [Borrelia 
burgdorferi] 


7 


1102 


9630723 


gi|9630723|ref[NP_047269.1| putative virulence 
determinant Vir [Mycoplasma arthritidis bacteriophage 
MAV1] gb|AAC33779.1| (AF074945) putative 
virulence determinant Vir [Mycoplasma arthritidis 
bacteriophage MAV1] 


9.9 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ ID 
NO 


ACCESS 

N 


DESCRIPTION 


P VALUE 


1107 


3638957 


gi|3638957|gb|AAC36301.1| (AC004877) sco-spondin- 
miirin-like- similar to P98167 fPID*ffl71 1548V details 
of intron/exon structure uncertain [Homo sapiens] 


3.3 


1110 




gi|14725582|reflXP_002477.3| hypothetical protein 
FLJ 10829 IHomo saniensl 


0.85 


1112 


7298126 


gi|7298126|gb|AAF53364.1|(AE003642) 
BG:DS00180.10 gene product [Drosophila 
melanogaster] 


8.6 


1120 


15011489 


gi|1501 1489|gb|AAK77584. 1|AF396436_24 
(AF396436) heme maturase [Tetrahymena thermophila] 


0.077 


1121 


/ JUUOUD 


Caenorhabditis elegans emb|CAB04238.1| (Z81522) 

nrt*Aint(*A nQina frpnpfitiHpr rf^aenorriahditis eleffansl 


5.6 


1123 


170156 


gi|170156|gb|AAA73078.1| (M73688) [Sorghum bicolor 
endosperm tissue mRNA, complete CDS.], gene product 
prf|| 1 80833 1 A gamma kafirin [Sorghum bicolor] 


0.41 


1125 




gi|348951|gb|AAC78248.1| (M77194)gag [Rat 

lpiilrpmia vi nisi 


4.2 


1127 


14583262 


gi|14583262|ref|NP_127506.1| replicase ORFlab 
polyprotein [Equine arteritis virus] emb|CAA69187.2| 
(Y07862) replicase ORFlb polyprotein [Cloning vector 
pEAV030] emb|CAC42775.2| (X53459) replicase 
ORFlb polyprotein [Equine arteritis virus] 


0.79 


1128 


6322140 


gi|6322140|ref|NP_012215.1| involved in filamentous 
growth; DfglOp [Saccharomyces cerevisiae] 
sp|P40526|YIE9_YEAST HYPOTHETICAL 30.3 KD 
PROTEIN IN RPL34B-SYG1 INTERGENIC 
REGION pir||S48430 probable membrane protein 
YIL049w - yeast (Saccharomyces cerevisiae) 
emb|CAA86173.1| (Z38060) orf, len: 253, CAI: 0.11 
[Saccharomyces cerevisiae] 


7.6 


1129 


14779404 


gi|14779404|ref|XP_008099.4| integrin alpha L 
precursor [Homo sapiens] 


0.004 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1136 


13928898 


mil IQTRRQRIrpfllMP 1 13837 11 killer cell lectin-like 
receptor subfamily G, member 1 [Rattus norvegicus] 
pir||I59421 mast cell function associated antigen - rat 
emb|CAA56208. 1| (X79812) mast cell function 
associated antigen [Rattus norvegicus] 
emb|CAA65829. 1| (X97191) MAFA protein [Rattus 
norvegicus] 


5 


1138 


11467057 


gi|l 1467057|ref|NP_042533.1| NADH dehydrogenase, 

ciiKunit A r Ar*5infVi5jmfv*H5i pacfpllfuriil 
SUDUnil H ^AiAJJllllallltJClJ** waoLviicuuij 

sp|Q37375|NU4M_ACACA NADH-UBIQUINONE 
OXIDOREDUCTASE CHAIN 4 pir||S53834 NADH 
dehydrogenase (ubiquinone) (EC 1.6.5.3) chain 4 - 
Acanthamoeba castellanii mitochondrion 
gb|AADl 1826.1| (U12386) NADH dehydrogenase, 
subunit 4 [Acanthamoeba castellanii] 


0.63 


1140 


7331848 


gi|7331848|gb|AAF60536.1| (AC024772) contains 
similarity to Pfam family PF00569 (Zinc finger present 
in dystrophin, CBP/p300), score=30.4, E=4.3e-05, N=l 
[Caenorhabditis elegans] 


4.5 


1144 


12725042 


gi|12725042|gb|AAK06095:l|AE006429_13 
(AE006429) UNKNOWN PROTEIN [Lactococcus 
lactis subsp. lactis] 


0.82 


1146 


13122173 


gi|13122173|emb|CAC32349.1| (AL583945) putative 
bifunctional protein (histidine kinase and regulator) 
[Streptomyces coelicolor] 


1.3 


1147 


13925661 


gi| 1 392566 1 |gb| AAK49407. 1 |AF26 1233_1 (AF26 1 233) 
sodium/calcium exchanger protein [Mus musculus] 


5.3 


1149 


6552484 


gi|6552484|gb|AAF16411.1|AF038572_l (AF038572) 
jagged2 [Mus musculus] 


0.83 


1154 


13489284 


gi| 1 3489284|gb|AAF 16898.2|AF 1686 14_1 (AF 1686 14) 
HMG-box transcription factor Sox 17 [Danio rerio] 


9.8 


1155 


1170115 


gi|l 1701 15|sp|P46430|GTTl_MANSE 
GLUTATHIONE S-TRANSFERASE 1 (GST CLASS- 
THETA) gb|AAA92880.1| (L32091) glutathione S- 
transferase [Manduca sexta] 


7.8 


1159 


13377412 


gi| 1 33774 12|gb|AAK20674. 1 |AF3 16639_9 (AF3 16639) 
Wzy [Streptococcus pneumoniae] gb|AAK74527.1| 
(AE007347) hypothetical protein [Streptococcus 
pneumoniae] 


4.1 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1163 


7298868 


gi|7298868|gb|AAF54075.1| (AE003673) CGI 105 gene 
product [Drosophila melanogaster] 


5.5 


1164 


10178202 


gi|10178202|dbj|BAB11626.1|(AB016875) 
gene id:K9D7.13~unknown protein [Arabidopsis 
thaliana] 


9.3 


1166 


7494379 


gi|7494379|pir||C71610 probable membrane associated 
protein PFB0615c - malaria parasite (Plasmodium 
falciparum) gb|AAC71912.1| (AE001406) predicted 
membrane associated protein [Plasmodium falciparum] 


6.5 


1170 


12408633 


m\\')Aftfl&1'X\r(*1\biP (174.094 11 rvtnrhrnme c nxidase 

subunit 1 [Podospora anserina] 
sp|P20681|COXl_PODAN CYTOCHROME C 
OXIDASE POLYPEPTIDE I pir||A48327 cytochrome-c 
oxidase (EC 1.9.3.1) chain I - Podospora anserina 
mitochondrion emb|CAA38777.1| (X55026) cytochrome 
oxidase c [Podospora anserina] 


3.1 


1173 


2144233 


gi|2144233|pir||JC5010 nucleotide-binding protein F - 
Methanosarcina mazei emb|CAA62802.1| (X91502) 
ABC transporter [Methanosarcina mazei] 


6.9 


1174 


10581460 


gi|10581460|gb|AAG20195.1| (AE005096) Vng2034h 
[Halobacterium sp. NRC-1] 


0.15 


1176 


13652647 


gi|13652647|reflXP_007000.3| solute carrier family 6 
(neurotransmitter transporter, betaine/GABA), member 

12 [Homo sapiens] ref|XP_029976.1| solute carrier 
family 6 (neurotransmitter transporter, betaine/GABA), 
member 12 [Homo sapiens] 


9 


1181 


9631326 


mlQ6^ 1 ^?6lrHTNP 048 1 1 ! ORF MSV088 

iil|>70 J 1 J Oil Ciller UtO ± *j zs . 1 1 V/IU v v/oo 

hypothetical protein [Melanoplus sanguinipes 
entomonoxvirusl oirllT28249 ORF MSV088 
hypothetical protein - Melanoplus sanguinipes 
entomopoxvirus gb|AAC97639.1| (AF063866) ORF 
MSV088 hypothetical protein [Melanoplus sanguinipes 
entomopoxvirus] 


3.2 


1184 


13897920 


gi|13897920|gb|AAK48502.1|AF260966_l (AF260966) 
IL-8 receptor [Oncorhynchus mykiss] 


2.6 


1185 


12860694 


gi|12860694|dbj|BAB32022.1| (AK020187) putative 
[Mus musculus] 


0.86 


1187 


6671484 


gi|6671484|gb|AAC49301.2| (U32444) phytochrome F 
[Lycopersicon esculentum] 


3.4 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ ID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1191 


11358531 


gi| 1 1 35853 1 |pir||T5 1640 myb-related transcription 
factor MYB 19 [imported] - Arabidopsis thaliana 
(fragment) gb|AAC83590.1| (AF062868) putative 
transcription factor [Arabidopsis thaliana] 


0.95 


1192 


88462 


gi|88462|pir||A27307 proline-rich phosphoprotein (gene 
PRH1, Db allele) - human 


7.9 


1193 


7484909 


gi|7484909|pir||T06608 disease resistance protein 
homolog F16J13.80 - Arabidopsis thaliana 
emb|CAB40942.1| (AL049638) putative disease 
resistance protein (TMV N-like) [Arabidopsis thaliana] 
emb|CAB78244.1| (AL161533) putative disease 
resistance protein (TMV N-like) [Arabidopsis thaliana] 


3.4 


1195 


7492269 


gi|7492269|pirj|T39663 paired amphipathic helix, 
probable transcription regulator protein - fission yeast 

(Schizosaccharomyces pombe) emb|CAA2 1310.1] 
(AL031856) putative transcriptional rgulatory protein 
[Schizosaccharomyces pombe] 


4.1 


1197 


134437 


g i|134437|sp|P13823|SERA_PLAFG SERINE- 
REPEAT ANTIGEN PROTEIN PRECURSOR (P126) 

antigen precursor - malaria parasite (Plasmodium 
falcinarum> (strain FCR3) eblAAA29763 11 (J03993) 

serine repeat protein [Plasmodium falciparum] 
gb| AAA 16791.1| (J04000) serine-repeat antigen protein 
[Plasmodium falciparum] gb|AAA749 1 1 . 1 1 (U08 113) 
serine repeat antigen [Plasmodium falciparum] 


4.6 


1202 


12841678 


gi|12841678|dbj|BAB25308.1| (AK007856) putative 
[Mus musculus] 


2E-31 


1211 


586120 


gi|586120|sp|Q07283|TRHY_HUMAN 
TRICHOHYALIN pir||A45973 trichohyalin - human 
gb|AAA65582.1| (L09190) trichohyalin [Homo sapiens] 


9.8 


1214 


3122611 


gi|3 12261 l|sp|O18417|A70A_DROSE ACCESSORY 
GLAND-SPECIFIC PEPTIDE 70A PRECURSOR 
(PARAGONIAL PEPTIDE B) emb|CAA67791.1| 
(X99414) sex-peptide [Drosophila sechellia] 


6.1 


1215 


2190464 


gi|2190464|emb|CAB09537.1| (Z96107) Uncx4.1 [Mus 
musculus] 


7.6 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1216 


153930 


gi|153930|gb|AAA27047.1| (M23007) NADPH-sulfite 
reductase hemoprotein component [Salmonella 
typhimurium] 


1.3 


1217 


5091519 


gi|5091519|dbj|BAA78754.1| (AB023482) Hypothetical 
protein [Oryza sativa] 


0.95 


1219 


9759203 


gi|9759203|dbj|BAB09740.1| (AB015476) heat shock 
transcription factor HSF30-like protein [Arabidopsis 
thaliana] 


9.2 


1224 


1732073 


gi|1732073|gb|AAC50901.1| (U75308) TBP-associated 
factor [Homo sapiens] 


0.13 


1225 


7505421 


gi|7505421|pir||T23399 hypothetical protein K07C10.1 - 
Caenorhabditis elegans emb|CAA87375.1| (Z47074) 
similarity to transmembranous domains of the 
drosophila protein patched (Swiss Prot accession 
number PI 8502) [Caenorhabditis elegans] 


3.4 


1228 


140791 


gi| 1 4079 1 |sp|P 1 7369|YHR3_VACCV 
HYPOTHETICAL HOST RANGE 27.4 KDA 
PROTEIN pir||WZVZA3 27.4K Hindlll-C protein - 
vaccinia virus (strain WR) gb|AAA69594. 1| (M22812) 
unknown protein [Vaccinia virus] 


4.6 


1231 


6831569 


gi|683 1569|sp|O84098|IF2_CHLTR TRANSLATION 
INITIATION FACTOR IF-2 pir||H71558 probable 
translation initiation factor IF-2 - Chlamydia 
trachomatis (serotype D, strain UW3/Cx) 
gb|AAC67687.1| (AE001283) Initiation Factor-2 
[Chlamydia trachomatis] 


7.8 


1243 


12725042 


gi| 12725042|gb|AAK06095. 1|AE006429_13 
(AE006429) UNKNOWN PROTEIN [Lactococcus 
lactis subsp. lactis] 


0.91 


1246 


10178036 


gi| 1 0 1 78036|dbj |BAB 1 1 5 1 9 . 1 1 (AB005245) 
pectinesterase [Arabidopsis thaliana] 


0.28 


1249 


7494298 


gi|7494298|pir||A71613 hypothetical protein PFB0530c - 
malaria parasite (Plasmodium falciparum) 
gb| AAC71895.il (AE001400) hypothetical protein 
[Plasmodium falciparum] 


2.6 


1253 


14721018 


gi| 1 472 1 0 1 8|reflXP_05 1 562. 1 1 similar to agrin (H. 
sapiens) [Homo sapiens] 


3.5 


1255 


6175163 


gi|6175163|gb|AAF04889.1|AC011437_4(AC011437) 
unknown protein [Arabidopsis thaliana] 


8.7 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1264 


12320927 


gi| 1 2320927|gb|AAG50592. 1 |AC083 89 1_6 
(AC083891) ABC transporter, putative [Arabidopsis 
thaliana] 


9.5 


1265 


13375919 


gi|13375919|refINP_078940.1| hypothetical protein 
FLJ11838 [Homo sapiens] dbj|BAB13931.1| 
(AK021900) unnamed protein product [Homo sapiens] 


4.2 


1274 


9369395 


gi|9369395|gb|AAF87143. 1|AC002423_8 (AC002423) 
T23E23.16 [Arabidopsis thaliana] 


7.2 


1275 


7435789 


gi|7435789|pir||T06276benzothiadiazole-induced 
protein (clone WCI-4) - wheat gb|AAC49287.1| 
(U32430) thiol protease [Triticum aestivum] 


5.4 


1280 


7497192 


gi|7497 1 92|pir||Tl 9833 hypothetical protein C38D9.3 - 
Caenorhabditis elegans emb|CAB03949.1| (Z81481) 
C38D9.3 [Caenorhabditis elegans] 


0.15 


1284 


14739967 


gi|14739967|ref|XP_035107.1| v-raf murine sarcoma 
viral oncogene homolog Bl [Homo sapiens] 


1.9 


1289 


7463772 


gi|7463772|pir||F70103 signal peptidase I (lepB-1) 
homolog - Lyme disease spirochete gb|AAC66422. 1| 
(AE001 1 17) signal peptidase I (lepB-1) [Borrelia 
burgdorferi] 


0.22 


1292 


4580388 


gi|4580388|gb|AAD24366.1|AC007171 2 (AC007171) 
hypothetical protein [Arabidopsis thaliana] 


5.5 


1293 


10437002 


gi|10437002|dbj|BAB14954.1| (AK024666) unnamed 
protein product [Homo sapiens] 


0.000000002 


1296 


4493974 


gi|4493974|emb|CAB39033.1| (AL034559) hypothetical 
protein, PFC0930c [Plasmodium falciparum] 


3.2 


1301 


482030 


gi|482030|pir||S40544 hypothetical protein - Escherichia 
coli dbj|BAA01299.1| (D10483) IS1 hypothetical 
protein E-96(PIR:A04462) [Escherichia coli] 


8.3 


1306 


4836719 


gi|4836719|gb|AAD30537.1|AF133256_2(AF133256) 
envelope protein precursor [Friend mink cell focus- 
forming virus] 


8.6 


1307 


12843929 


gi|12843929|dbj|BAB26168.1| (AK009250) putative 
[Mus musculus] 


2.1 


1308 


10175999 


gi| 1 0 1 75 999|dbj|BAB07095 . 1 1 (AP00 1518) 
lipopolysaccharide biosynthesis [Bacillus halodurans] 


2.8 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1310 


14767733 


gi|14767733|reflXP_030859.1| similar to TGF(beta)- 
induced transcription factor 2 (H. sapiens) [Homo 
sapiens] 


1.5 


1311 


1351721 


gi| 1 3 5 1 72 1 |sp|Q 1 0 1 05 |YAQ5_SCHPO PUTATIVE 
TRANSLATIONAL ACTIVATOR C18G6.05C 
(GCN1 HOMOLOG) pir||T37919 GCN1 homolog - 
fission yeast (Schizosaccharomyces pombe) 
emb|CAA923 85.1| (Z68198) GCN1 homologue 
[Schizosaccharomyces pombe] 


0.48 


1312 


13376747 


gi|13376747|reflNP_079428.1| hypothetical protein 
FLJ 12660 [Homo sapiens] reflXP_017923.1| 
hypothetical protein FLJ 12660 [Homo sapiens] 
dbj|BAB14203. 1| (AK022722) unnamed protein product 
[Homo sapiens] 


0.000000003 


1315 


9453886 


gi|9453886|dbj|BAB03287.1| (AB045975) pro-alpha 1 
type V/XI collagen [Pagrus major] 


0.021 


1320 


11034630 


gi|l 1034630|dbj|BAB17154.1| (AP002868) hypothetical 
protein [Oryza sativa] dbj|BAB55491.1| (AP002541) 
hypothetical protein [Oryza sativa] 


0.49 


1322 


14727261 


gi|14727261|ref|XP_027313.1| hypothetical protein 
FLJ22351 [Homo sapiens] reflXP_027314.1| 
hypothetical protein FLJ22351 [Homo sapiens] 


0.077 


1327 


2789430 


gi|2789430|dbj|BAA24380.1| (D30612) repressor 
protein [Homo sapiens] 


0.97 


1328 


7497494 


gi|7497494|pir| |T 1 9963 hypothetical protein C46C2.4 - 
Caenorhabditis elegans emb|CAA92590.1| (Z68296) 
contains similarity to Pfam domain: PF00561 
(alpha/beta hydrolase fold), Score=30.8, E-value=le-05, 
N=l [Caenorhabditis elegans] 


0.2 


1329 


13129018 


gi| 1 3 1 290 1 8|ref]NP_076956. 1 1 hypothetical protein 
MGC3077 [Homo sapiens] ref|XP_030116.1| 
hypothetical protein MGC3077 [Homo sapiens] 
gb|AAC23790.1| (AC005154) similar to protein 
U28928 (PID:g861306) [Homo sapiens] 
gb|AAH00625.1|AAH00625 (BC000625) Unknown 
(protein for MGC:3077) [Homo sapiens] 


2E-89 


1330 


14647539 


gi|14647539|gb|AAK71916.1|AF332040_l (AF332040) 
growth hormone receptor [Jaculus jaculus] 


4.8 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ ID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1331 


1584488 


gi|1584488|prf||2123261AB chemosensory receptor 
[Caenorhabditis elegans] 


0.68 


1333 


2326324 


gi|2326324|emb|CAB 10936. 1| (Z98264) cytochrome c 
oxidase subunit III [Anabaena sp.] 


4.1 


1334 


7434912 


mV7 A*\AQ\ 9lr*irllH71 Q^zl r»liocoViatiH\/1<»l\7f*^*rnr*lincr»Vmtp 
gi| /*t jh? lZrjpir j|n. / ijjh piiu5»pnd.iiuyigiyt^iupiiuopii<tLc 

synthase - Helicobacter pylori (strain J99) 

gb|AAD05990.1| (AE001475) 

PHOSPHATIDYLGLYCEROPHOSPHATE 

SYNTHASE [Helicobacter pylori J99] 


5.9 


1337 


1335199 


gi|1335199|emb|CAA26919.1| (X03145) pot. ORF V 
[Homo sapiens] 


5.2 


1340 


7440285 


gi|7440285|pir||T07994 ribosomal protein S9 - 
Chlamydomonas reinhardtii chloroplast 
emblCAA74006 11 (Y13655) 30S ribosomal orotein S9 
[Chlamydomonas reinhardtii] 


3 


1341 


8923088 


gi|8923088|ref|NP_060127.1| hypothetical protein 

PT 190080 rHntnn csinipnci r^flVP C\A>f\f\~kf\ 1 1 

hypothetical protein FLJ20080 [Homo sapiens] 

HhilR A A90936 1 1 rAK000087 , i unnamed orotein oroduct 

[Homo sapiens] 


0.000000008 


1345 


6958206 


gi|695 8206|gb|AAF32493 . 1 |AF093 1 32_1 (AF093 1 32) 
Wpvin-like nrntease 1CEX1 TPneiimocvstis carinii f sd 
muris] 


2.3 


1346 


7496774 


ail74Q6774lnirllT32166 hvoothetical orotein C31B8 8 - 
Caenorhabditis elegans 


2 


1350 


9366789 


gi|9366789|emb|CAB95551.1| (AL359782) hypothetical 
protein, CHR1.313. [Trypanosoma brucei] 


2.4 


1352 


10175488 


gi|10175488|dbj|BAB06586.1| (AP001516) cation 
antiporter (Na+/Ca2-r-) [Bacillus halodurans] 


0.72 


1354 


1119230 


gi|1119230|dbj|BAA06595.1| (D31786) secretion 
protein Y [Acyrthosiphon kondoi endosymbiont] 


8.5 


1355 


2897812 


ml2897812ldbilBAA24894 11 (AB010996) G2-G1 
polyprotein precursor [tomato spotted wilt virus] 


8.9 


1357 


13111586 


gi| 13 1 1 1586|gb|AAK12388. 1|AF296094_1 (AF296094) 
polyprotein [Porcine teschovirus] 


9.2 


1359 


13881823 


gi|13881823|gb|AAK46426.1| (AE007064) hypothetical 
protein [Mycobacterium tuberculosis CDC 1551] 


1.3 


1368 


1362185 


gi|1362185|pir||S56686 histone H2B123 - wheat 
dbj|BAA07158.1| (D37944) protein H2B 123 [Triticum 
aestivum] 


9.7 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 




SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 




1369 


13812308 


oil 1 "38 1 9^nXlrpflNP 1 1 ^496 1 1 hvnnthptiral nrotein 
[Guillardiatheta] emb|CAC26995.1| (AJ0 10592) 
hypothetical protein [Guillardia theta] 


4.7 




1370 


1** 541 057 


gi|13541057|ref|NP_l 10745. 1| Permease (major 

far*ilitjitr»r Qiinprfamilv^ nTipnnnnla<5iTia volcaniuml 

dbj|BAB59369.1| (AP000991) unknown product 
fXhermonlasma volcaniuml 


7.6 




1371 


9964353 


gi|9964353|ref|NP_064821.1| AMV039 [Amsacta 
moorei entomopoxvirus] gb|AAG02745.1|AF250284_39 
fAF7S0284^ AMV039 r Amsacta moorei 
entomopoxvirus] 


5.3 




1373 


902377 


gi|902377|gb|AAA82981.1| (U18059) polyprotein 
[pestivirus type 1] 


0.41 




1374 


07JJ / lO 


oilRQS^74xlrlhilRAA98067 11 fAP000368^ 
gene_id:F6B6. l~pir||C71410~similar to unknown 
nrotein TArabidoDsis thalianal 


0.27 




1375 


11291752 


gi|l 1 29 1 752|pir||T4797 1 seven in absentia-like protein - 

ArahiHnn<;i<: thaliana emhlCAB71 109 11 CAL132959) 

seven in absentia-like protein [Arabidopsis thaliana] 


10 


T-r - 
f 8 ^ 


1379 


10177211 


gi|10177211|dbj|BAB10286.1| (AB026650) protein 
kinase [Arabidopsis thaliana] 


5 


1. - JV, 

Cn 


1384 


7332073 


gi|7332073|gb|AAF60760.1| (AC024810) Hypothetical 
protein Y54E 10 A. 1 [Caenorhabditis elegans] 


5.7 




1388 


1 1 D JOO l*t 


gi| 1 1 35 8 8 14|pir||T46 130 RNA polymerase III subunit- 

lilr^ nmtpin - A t*;*hiHnrwi<s thriliJina pmHlf^AR67010 1 

(AL 132967) RNA polymerase III subunit-like protein 
TArabidoosis thalianal 


2.2 




1389 


4506569 


gi|4506569|ref]NP_002932.1| roundabout (axon 
guidance receptor, Drosophila) homolog 1 [Homo 
sapiens] gb|AAC39575.1| (AF040990) roundabout 1 
[Homo sapiens] 


4E-17 




1393 


11761072 


gi|11761072|dbj|BAB19062.1| (AP002744) hypothetical 
protein [Oryza sativa] 


1.9 




1398 


461649 


gi|461649|sp|Q05004|BB61_RABIT BRUSH BORDER 
61.9 KD PROTEIN PRECURSOR pir||B45665 adult- 
specific 61.9K brush border protein precursor - rabbit 
emb|CAA78302.1| (Z12840) protein of unknown 
function [Oryctolagus cuniculus] 


8E-52 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1402 


478809 


gi|478809|pir||S29851 protein kinase 6 (EC 2.7.1.-) - 
soybean gb|AAA34002. 1| (M67449) protein kinase 
Glycine max] prf||1908223A protein kinase [Glycine 
max] 


9.3 


1406 


1517936 


gi|15 17936|gb|AAB07000. 1| (U52347) tachykinin-like 
receptor [Stomoxys calcitrans] 


1.1 


1407 


9977929 


gi|9977929|sp|Q050 1 3 |LIPA_NEIMB CAPSULE 
POLYSACCHARIDE MODIFICATION PROTEIN 
LIPA pir||D81240 capsule polysaccharide modification 
protein LipA NMB0082 [imported] - Neisseria 
meningitidis (group B strain MD58) gb|AAF40546.1| 
(AE002367) capsule polysaccharide modification 
protein LipA [Neisseria meningitidis MC58] 


9.3 


1412 


14773348 


gi|14773348|ref|XP_038450.1| 20849 [Homo sapiens] 


1E-48 


1413 


13537363 


gi|13537363|dbj|BAB40663.1| (AB051851) death 
receptor 3 [Homo sapiens] 


3.9 


1419 


7477083 


gi|7477083|pir||A70577 hypothetical protein Rv2133c - 
Mycobacterium tuberculosis (strain H37RV) 
emb|CAB08660.1| (Z95388) hypothetical protein 
Rv2133c [Mycobacterium tuberculosis] 
gb|AAK46475.1| (AE007067) conserved hypothetical 
protein [Mycobacterium tuberculosis CDC 1551] 


1.9 


1422 


7499021 


gi|7499021|pir||T20846 hypothetical protein F13E9.9 - 
Caenorhabditis elegans emb|CAA93411.1| (Z69383) 
F13E9.9 [Caenorhabditis elegans] 


4.1 


1424 


14723696 


gi|14723696|ref|XP_035744.1| hypothetical protein 
XP 035744 [Homo sapiens] 


1.5 


1425 


9961349 


gi|9961349|ref]NP_005500.2| Dmx-like 1 [Homo 
sapiens] 


0.94 


1426 


478302 


gi|478302|pir||JN0835 carbonate dehydratase (EC 
4.2. 1 . 1) I - chimpanzee 


1.6 


1427 


6649942 


gi|6649942|gb|AAF2 1 64 1 . 1 |AF032379_1 (AF032379) 
gonadotrophin releasing hormone receptor; GnRH-R 
[Trichosurus vulpecula] 


9.6 


1432 


6636500 


gi|6636500|gb|AAF20201 . 1|AF205791_1 (AF205791) , 
squalene synthase [Botryococcus braunii] 


1.8 


1434 


7504070 


gi|7504070|pir||T22586 hypothetical protein F53F4.14 - 
Caenorhabditis elegans 


0.05 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1437 


102177 


gi|102177|pir||S13141 hypothetical protein (ribosomal 
RNA repeat region) - Giardia lamblia 


1.2 


1439 


13357869 


gi|13357869|ref]NP_078143. 1| unique hypothetical 
[Ureaplasma urealyticum] pir||D82907 hypothetical 
protein UU309 [imported] - Ureaplasma urealyticum 
gb|AAF30718.1|AE002128_6 (AE002128) unique 
hypothetical [Ureaplasma urealyticum] 


5.4 


1451 


7468435 


gi|7468435|pir||B72015 metalloproteinase, insulinase 
family CP0903 [imported] - Chlamydophila pneumoniae 
(strains CWL029 and AR39) eblAAD19093 11 
(AE001675) Insulinase family/Protease III 
[Chlamydophila pneumoniae CWL029] 
gb|AAF38689.1| (AE002249) metalloprotease, 
insulinase family [Chlamydophila pneumoniae AR39] 
dbj|BAA99165.1| (AP002548) insulinase 
family/protease III [Chlamydophila pneumoniae J138] 


7.1 


1452 


9857712 


gi|9857712|gb|AAG00902.1|AF176776_l (AF176776) 
xyloglucan endotransglycosylase LeXET2 
[Lycopersicon esculentum] 


1.9 


1453 


4493974 


gi|4493974|emb|CAB39033.1| (AL034559) hypothetical 
protein, PFC0930c [Plasmodium falciparum] 


3.4 


1454 


4493900 


gi|4493900|emb|CAB39009.1| (AL034558) predicted 
using hexExon; MAL3P2.22 (PFC0265c) 5 Hypothetical 
protein, len: 637 aa [Plasmodium falciparum] 


6.1 


1456 


7503603 


gi|7503603|pir||T16375 hypothetical protein F46G1 1.1 - 
Caenorhabditis elegans gb|AAA81397.1| (U40412) 
Hypothetical protein F46G1 1 . 1 [Caenorhabditis 
elegans] 


2.1 


1462 


9437954 


gi|9437954lgb|AAF87502.1|AF250474 1 (AF250474) 
nucleoprotein [Influenza A virus (A/Duck/Hong 
Kong/P54/97(HllN9))] 


6.7 


1464 


2129239 


gi|2129239|pir||G64488 reverse gyrase (intein- 
containing) - Methanococcus jannaschii 
gb|AAB9953 1 . 1 1 (U67592) reverse gyrase, intein 
containing (rgy) [Methanococcus jannaschii] 


9.2 


1466 


12232439 


gi|12232439|ref|NP_073602.1| hypothetical protein 
FLJ11937 [Homo sapiens] dbj|BAB15124.1| 
(AK025392) unnamed protein product [Homo sapiens] 


6.4 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 

N 


DESCRIPTION 


P VALUE 


1468 


7243247 


gi|7243247|dbj|BAA92671.1| (AB037854) KIAA1433 
protein [Homo sapiens] 


2E-87 


1472 


12382242 


gi|12382242|gb|AAG53080. 1|AF263824_1 (AF263824) 
5'A2rel-related protein [Leishmania donovani] 


8.7 


1473 


13812383 


gi|13812383|ref|NP_113501.1| seryl-tRNA synthetase 
(serin-tRNA ligase) [Guillardia theta] 
emb|CAC27070.1| (AJQ 10592) seryl-tRNA synthetase 
(serin-tRNA ligase) [Guillardia theta] 


6.5 


1476 


6014998 


gi|60 1 4998|sp|O80 1 64|DPA5 BPR69 DNA 
POLYMERASE PROCESSIVITY COMPONENT 
(DNA POLYMERASE ACCESSORY PROTEIN 45) 
(GP45) pdb|lB77|A Chain A, Building A Replisome 
Structure From Interacting Pieces: A Sliding Clamp 
Complexed With An Interaction Peptide From Dna 
Polymerase pdb|lB77|B Chain B, Building A Replisome 
Structure From Interacting Pieces: A Sliding Clamp 
Complexed With An Interaction Peptide From Dna 
Polymerase pdb|lB77|C Chain C, Building A Replisome 
Structure From Interacting Pieces: A Sliding Clamp 
Complexed With An Interaction Peptide From Dna 
Polymerase pdb|lB8H|A Chain A, Sliding Clamp, Dna 
Polymerase pdb|lB8H|B Chain B, Sliding Clamp, Dna 
Polymerase pdb|lB8H|C Chain C, Sliding Clamp, Dna 
Polymerase gb|AAC39310.1| (AF039565) DNA 
polymerase processivity component [Bacteriophage 
RB69] 


6.9 


1479 


7706747 


gi|7706747|ref|NP_057263.1| transient receptor potential 
4 [Homo sapiens] sp|Q9UBN4|TRP4_HUMAN 
SHORT TRANSIENT RECEPTOR POTENTIAL 
CHANNEL 4 (TRPC4) (TRP-RELATED PROTEIN 4) 
(HTRP-4) (HTRP4) gb|AAD51736.1|AF175406_l 
(AF 175406) transient receptor potential 4 [Homo 
sapiens] gb|AAF22927.1|AF063822_l (AF063822) top- 
related protein 4 [Homo sapiens] 


1.7 


1480 


9757550 


gi|9757550|dbj|BAB08 163.1| (AB030831) SrtT 
[Streptococcus pyogenes] 


0.47 


1483 


13516917 


gi| 1 35 1 69 1 7|dbj |BAB403 3 8 . 1 1 (AB044076) hybrid 
sensor [Myxococcus xanthus] 


5 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1487 


5790208 


gi|5790208|dbj|BAA83536. 1| (AB03 1285) NADH 
dehydrogenase subunit 2 [Taenia saginata] 


2.7 


1489 


14754352 


gi|14754352|ref|XP_032294.1| hypothetical protein 
FLJ 10775 [Homo sapiens] 


0.0008 


1490 


3552028 


gi|3552028|gb|AAC64946.1| (AF087130) siderophore 
regulation protein [Neurospora crassa] 


7.2 


1493 


3913936 


gi|3913936|sp|Q43652|IP27_SOLTU PROTEINASE 
INHIBITOR TYPE II CM7 PRECURSOR pir||S43 105 
proteinase inhibitor II - potato emb|CAA55082.1| 
(X78275) proteinase inhibitor II [Solanum tuberosum] 


9.9 


1494 


6573777 


gi|6573777|gb|AAF17697. 1|AC009243_24 (AC009243) 
F28K19.17 [Arabidopsis thaliana] 


2.2 


1496 


13622425 


gi|13622425|gb|AAK34148.1| (AE006569) maltodextrin 
transport system permease [Streptococcus pyogenes Ml 
GAS] 


3.7 


1498 


9910266 


gi|9910266|reflNP_064627.1| kinesin-like protein 2 
[Homo sapiens] dbj|BAB03309.1| (AB035898) kinesin- 
like protein 2 [Homo sapiens] 


5E-19 


1503 


7294128 


gi|7294128|gb|AAF49482.1| (AE003527) CG4925 gene 
product [Drosophila melanogaster] 


3.6 


1506 


14089610 


gi|14089610|emb|CAC13370.1| (AL445563) unknown; 
predicted coding region [Mycoplasma pulmonis] 


1.5 


1507 


2565196 


gi|2565196|gb|AAB81938.1| (AF000381) non-functional 
folate binding protein [Homo sapiens] 


0.0000004 


1509 


14753935 


gi|14753935|reflXP_040892.1| hypothetical protein 
XP_040892 [Homo sapiens] 


3.3 


1510 


12545425 


gi|12545425|ref|NP_074975.1| hypothetical protein 
[Astasia longa] sp|P34776|YCY2_ASTLO 
HYPOTHETICAL 34.5 KDA PROTEIN IN RPS12- 
TRNP INTERGENIC REGION (ORF288) 
emb|CAC24586.1| (AJ294725) hypothetical protein 
[Astasia longa] 


7 


1511 


420215 


gi|420215|pir||B45878 hypothetical protein 2 - mouse 


7.1 


1512 


7512874 


gi|7512874|pir||T08792 hypothetical protein 
DKFZp586E1422.1 - human (fragment) 
emb|CAB43306.1| (AL050170) hypothetical protein 
[Homo sapiens] 


5.5 


1514 


221758 


gi|221758|dbj|BAA01683.1| (D10879) UL37 [human 
herpesvirus 1] 


0.18 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1517 


11466232 


gi|11466232|reflNP_062855.1| ORF9, contains 8 trans 
membrane regions, putative [Physarum polycephalum] 
dbj|BAB08089.1| (AB027295) ORF9, contains 8 trans 
membrane regions, putative [Physarum polycephalum] 


0.43 


1518 


182710 


gi|182710|gb|AAA52467.1| (M13918) fibronectin 
receptor alpha-subunit precursor [Homo sapiens] 


1.3 


1519 


1142976 


gi|l 142976|gb|AAC52392. 1| (U28769) odorant receptor 
[Mus musculus] prf||2207403C odorant receptor [Mus 
musculus] 


5.6 


1520 


6912446 


gi|6912446|ref]NP_036417.1| potassium voltage-gated 
channel,, subfamily H (eag-related), member 4; ether-a- 
go-go K(+) channel family member [Homo sapiens] 
dbj|BAA83592.1| (AB022698) BEC2 [Homo sapiens] 


8.4 


1522 


14601483 


gi|14601483|reflNP_148021.1| hypothetical protein 
[Aeropyrum pernix] pir||G72637 hypothetical protein 
APE1558 - Aeropyrum pernix (strain Kl) 
dbj|BAA80557.1| (AP000061) 279aa long hypothetical 
protein [Aeropyrum pernix] 


5.7 


1526 


12515306 


gi|12515306|gb|AAG56369.1|AE005365_3 (AE005365) 
orf, hypothetical protein [Escherichia coli 0157:H7 
EDL933] dbj|BAB35435.1| (AP002557) hypothetical 
protein [Escherichia coli 0157:H7] 


7.4 


1530 


2765672 


gi|2765672|emb|CAB068 19.1| (Z861 15) ArbX 
[Lactobacillus delbrueckii] 


7.9 


1532 


13472514 


gi| 134725 14|reflNP_10408 1 . 1| unknown protein 
[Mesorhizobium loti] dbj|BAB49867.1| (AP003000) 
unknown protein [Mesorhizobium loti] 


7.9 


1533 


3309522 


gi|3309522|gb|AAC26098.1| (U18292) unknown 
[Borrelia burgdorferi] 


2.1 


1535 


14520328 


gi|14520328|ref|NP_125803.1| activator 1, replication 
factor C, small subunit [Pyrococcus abyssi] pir||C75198 
activator 1, replication factor c, small chain PAB0068 - 
Pyrococcus abyssi (strain Orsay) emb|CAB49034.1| 
(AJ248283) activator 1, replication factor C, small 
subunit [Pyrococcus abyssi] 


0.41 


1536 


7229605 


gi|7229605|gb|AAF42902.1| (AF229961) NADH 
dehydrogenase subunit 1 [Taygetis andromede] 


6.7 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1537 


6681261 


gi|6681261|ref|NP_031926.1| ect2 oncogene [Mus 
musculus] sp|Q07139|ECT2_MOUSE ECT2 PROTEIN 
(ECT2 ONCOGENE) pir||S32372 transforming orotein 
(ect2) - mouse gb|AAA37536. 1| (LI 1316) ect2 [Mus 
musculus] prfl|191 1407 A oncogene ect2 [Mus 
musculus] 


6E-10 


1538 


12381848 


gi|12381848|emb|CAC24715.1| (AJ297319) glucoses- 
phosphate dehydrogenase-6-phosphogluconolactonase 
[Plasmodium berghei] 


7 


1540 


7688657 


gi|7688657|gb|AAF67469. 1|AF146760_1 (AF146760) 
septin 2-like cell division control protein [Homo sapiens 


9.7 


1541 


13021853 


gi| 1302 1 853|gb|AAKl 1 564. 1 |AF3 1 8500_1 (AF3 1 8500) 
ent-kaurenoic acid hydroxylase [Arabidopsis thaliana] 


0.091 


1543 


6900006 


gi|6900006|emb|CAB7 1294.1) (AJ251917) chorion 
protein sl8 [Ceratitis capitata] 


2.1 


1545 


1076445 


gi|1076445|pir||S53004 mitosis-specific cyclin CYC2 - 
rape gb|AAA5 1660.1] (L25406) cyclin [Brassica napus] 


4.3 


1548 


7293274 


gi|7293274|gb| AAF48655.1| (AE003503) CG9644 gene 
product [Drosophila melanogaster] 


0.46 


1549 


11358961 


gi|11358961|pir||T51243 Sell protein [imported] - rice 
(fragment) gb| AAC98091.1| (AF067401) Sell protein 
[Oryza sativa] 


3.9 


1550 


8978966 


gi|8978966|dbj|BAA98801.1| (AP002547) phenylalanyl 
tRNA synthetase beta [Chlamydophila pneumoniae 
J138] 


4.1 


1551 


14010341 


gi|14010341|gb|AAK51958.1|AF362013_l (AF362013) 
ATP synthase F0 subunit 6 [Halichondria sp. RFW- 
2001] 


8.2 


1553 


8923094 


gi|8923094|ref|NP_060130. 1| hypothetical protein 
FLJ20085 [Homo sapiens] ref|XP_009383.2| 
hypothetical protein FLJ20085 [Homo sapiens] 
dbj|BAA90939. 1| (AK000092) unnamed protein product 
[Homo sapiens] 


1.6 


1557 


14724725 


gi|14724725|reflXP_037600.1| hypothetical protein 
XP_037600 [Homo sapiens] 


4.2 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1561 


14758684 


gi|14758684lrefIXP 007211 41 retinnhlastnmn l 

(including osteosarcoma) [Homo sapiens] 
ref|XP_033825.1| retinoblastoma 1 (including 
osteosarcoma) [Homo sapiens] ref|XP_033826.1| 
retinoblastoma 1 (including osteosarcoma) [Homo 
sapiens] ref|XP_033827.1| retinoblastoma 1 (including 
osteosarcoma) [Homo sapiens] 


0.91 


1565 


9635387 


gi|9635387|ref|NP_059285.1| ORF137 [Xestia c-nigrum 
granulovirus] gb|AAF0525 1 . 1 |AF16222 1_137 
(AF162221) ORF137 [Xestia c-nigrum granulovirus] 


5.9 


1572 


1730077 


gi|1730077|sp|P18160|KYKl_DICDI NON- 
RECEPTOR TYROSINE KINASE SPORE LYSIS A 
(TYROSINE-PROTEIN KINASE 1) pir||T18276 non- 
receptor tyrosine kinase - slime mold (Dictyostelium 
discoideum) gb|AAB41 125. 1| (U32174) non-receptor 
tyrosine kinase [Dictyostelium discoideum] 


6 


1574 


9622133 


gi|9622 1 33 |gb|AAF89633 . 1 |AF 1 677 1 9_1 (AF 1 677 1 9) 
transmembrane leptin receptor [Sus scrofa] 


2.6 


1578 


139809 


gi|139809|sp|P27571|XIST MOUSE X INACTIVE 
SPECIFIC TRANSCRIPT PROTEIN pir||S 15433 
hypothetical protein - mouse emb|CAA41978. 1| 
(X59289) ORF [Mus musculus] prfl|171 1440A xist 
gene [Mus musculus] 


7.7 


1579 


8567792 


gi|8567792|gb|AAF76364.1| (AC013428) I-box binding 
factor, putative [Arabidopsis thaliana] 


5.7 


1581 


7638161 


gi|763 8 1 6 1 |gb| AAF65408 . 1 |AF23 83 1 2_1 (AF23 8312) 
putative serine-threonine protein kinase MkcB 
Dictyostelium discoideum] 


9.2 


1583 


12644495 


gi|12644495|sp|Q9Z7G7|EX5B_CHLPN 
EXODEOXYRIBONUCLEASE V BETA CHAIN 
dbj|BAA98945.1| (AP002547) exodeoxyribonuclease V, 
beta [Chlamydophila pneumoniae J 138] 


7.2 


1588 


1 
1 

11350667 « 


gi|11350667|pir||D83160 nitrite extrusion protein 1 
PA3877 [imported] - Pseudomonas aeruginosa (strain 
PAOl) emb|CAA75538.1| (Y15252) nitrate extrusion 
protein [Pseudomonas aeruginosa] 
gb|AAG07264.1|AE004805_2 (AE004805) nitrite 
extrusion protein 1 [Pseudomonas aeruginosa] 


9.7 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1589 


12644495 


gi| 12644495|sp|Q9Z7G7|EX5B_CHLPN 
EXODEOXYRIBONUCLEASE V BETA CHAIN 
dbj|BAA98945.1| (AP002547) exodeoxyribonuclease V, 
seta [Chlamydophila pneumoniae J 138] 


5.2 


1590 


305479 


gi|305479|gb|AAC37807.1| (L11473) envelope 
glycoprotein (Human immunodeficiency virus type 1] 


4.3 


1591 


9055244 


gi|9055244|ref|NP 061256. 11 huntington yeast partner C 

[Mus musculus] gb|AAD39464.1|AF135440_l 

(AF 135440) huntington yeast partner C [Mus musculus] 


0.8 


1593 


12666210 


gi| 1 26662 1 0|emb|C AC28083 . 1 1 (AL 138875) 
bA103J18.2 (novel protein) (Homo sapiens] 


4 


1596 


12855510 


gi| 128555 10|dbj|BAB30362.1| (AK016654) putative 
[Mus musculus] 


0.067 


1597 


133747 


gi|133747|sp|P09899|RS12_MICLU 30S RIBOSOMAL 
PROTEIN S12 pir||A26956 ribosomal protein S12 - 
Micrococcus luteus gb|AAA25317.1| (M17788) 
ribosomal protein S12 (gtg start codon) [Micrococcus 
luteus] 


9.9 


1600 


7292455 


gi|7292455|gb|AAF47859.1| (AE003480) CG15005 
gene product [Drosophila melanogaster] 


0.43 


1601 


14748674 


gi| 14748674|ref|XP_03 8133.1) Prader-Willi/Angelman 
syndrome-5 [Homo sapiens] 


8.9 


1602 


11278020 


gi|l 1278020|pir||H82215 serine transporter VC1301 
[imported] - Vibrio cholerae (group 01 strain N 16961) 
gb|AAF94460.1| (AE004210) serine transporter [Vibrio 
cholerae] 


7.8 


1607 


9757538 


gi|9757538|dbj|BAB08122.1| (AB030852) maturase 
[Lilium rubellum] 


6.6 


1618 


12249143 


gi|12249143|ref|NP_066194.2| NADH dehydrogenase 
subunit 5 [Schistosoma japonicum] gb|AAG13134.2| 
(AF2 15860) NADH dehydrogenase subunit 5 
[Schistosoma japonicum] 


5.9 


1620 


7495508 


gi|7495508|pir||T18993 hypothetical protein C06B8.1 - 
Caenorhabditis elegans emb|CAB03850.1| (Z81463) 
Similarity to C.elegans zinc finger proteins, contains 
similarity to Pfam domain: PF00104 (Ligand-binding 
domain of nuclear hormone receptor), Score— 13. 7, E- 
value=0.051 ? N=l [Caenorhabditis elegans] 


2.9 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1621 


7288570 


gi|7288570|gb|AAF45 1 70. 1 |AF2 1 8963_1 (AF2 1 8963) 
period [Drosophila miranda] 


4.3 


1622 


7707666 


gi|7707666|dbj|BAA95343.1| (AB027560) ATPase 
subunit 6 [Echinococcus vogeli] 


4.1 


1625 


4761646 


gi|4761646|gb|AAD29428. 1|AF139060_1 (AF139060) 
transmembrane cell adhesion receptor MUA-3 precursor 
[Caenorhabditis elegans] emb|CAA83226.2| (Z30974) 
contains similarity to Pfam domain: PF00057 (Low- 
density lipoprotein receptor domain class A), 
Score=52.9, E-value=2.2e-12, N=3, second half (former 
T20G5.3) contains similarity to Pfam domain: PF00008 
(EGF-like domain), Score=326.3, E-value=l> 
emb|CAC42345.1| (Z30423) contains similarity to Pfam 
domain: PF00057 (Low-density lipoprotein receptor 
domain class A), Score=52.9, E-value=2.2e-12, N=3, 
second half (former T20G5.3) contains similarity to 
Pfam domain: PF00008 (EGF-like domain), 
Score=326.3, E-value=l> 


1.7 


1626 


3582424 


gi|3582424|dbj|BAA33057.1| (AB017255) arginine 
kinase two-domain chain [Pseudocardium sachalinensis] 


4.5 


1628 


9758077 


gi|9758077|dbj|BAB08521.1| (AB009052) sucrose 
cleavage protein-like [Arabidopsis thaliana] 


6.2 


1629 


11610630 


gi|11610630|gb|AAG37436.1| (AY013711) cyclin E 
[Mustela vison] 


10 


1631 


4838093 


gi|4838093|gb|AAD30838.1| (AF103278) 
immunoglobulin heavy chain variable region [Homo 
sapiens] 


3.3 


1635 


12045265 


gi|12045265|ref]NP_073076.1| ATP synthase F0, 
subunit B (atpF) [Mycoplasma genitalium] 
sp|P47643|ATPF_MYCGE ATP SYNTHASE B 
CHAIN PRECURSOR pir||F64244 ATP synthase B 
chain (atpF) - Mycoplasma genitalium gb|AAC7 1 63 1 . 1 1 
(U39722) ATP synthase F0, subunit B (atpF) 
[Mycoplasma genitalium] 


8.5 


1636 


627406 


gi|627406|pir||A54849 collagen alpha l(Vn) chain 
precursor - human 


0.66 


1643 


2246540 


gi|2246540|gb|AAB62665.1| (U93872) glycoprotein M 
[Human herpesvirus 8] 


1.5 


1644 


2388576 


gi|2388576|gb|AAB7 1457.1| (AC000098) YUP8H12.17 
[Arabidopsis thaliana] 


4.7 
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SEQID 
NO 



ACCESS 
N 



1645 



1646 



1648 



1649 



1650 



1655 



1660 



1661 



1664 



1669 



Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



7490184 



gi|7490184|pir||T37997 carboxypeptidase y - fission 
yeast (Schizosaccharomyces pombe) pir||T43236 
carboxypeptidase C (EC 3.4.16.5) precursor [validated] 
fission yeast (Schizosaccharomyces pombe) 
emb|CAB10121.1| (Z97209) carboxypeptidase y 
Schizosaccharomyces pombe] dbj|BAA25568.1| 
(D86560) carboxypeptidase Y [Schizosaccharomyces 
pombe] 



7415597 



gi|74 15597|dbj|BAA93452. 1| (AB026494) 
acyltransferase homolog [Gentiana triflora] 



3541404 



gi|13541404|reflNP_l 1 1092.1| DNA helicase 
superfamily II) [Thermoplasma volcanium] 

dbj|BAB59714.1| (AP000992) DNA helicase 
Thermoplasma volcanium] 



6174902 



gi|6 1 74902|sp|Q24767|PER_DROYA PERIOD 
CIRCADIAN PROTEIN pir||S 17286 period clock 
protein - fruit fly (Drosophila yakuba) 
emb|CAA43439.1| (X61127) period [Drosophila 
yakuba] 



14318508 



5052950 



2988422 



14756108 



10047313 



5902048 



DESCRIPTION 



gi| 143 1 8508|ref]NP_l 1 664 1 . 1 1 Ieslp [Saccharomyces 
cerevisiae] sp|P43579|YFB3_YEAST 
HYPOTHETICAL 78.8 KD PROTEIN IN HSP12- 
HXT10 INTERGENIC REGION pir||S48316 probable 
membrane protein YFL013c - yeast (Saccharomyces 
cerevisiae) emb|CAA86347.1| (Z46255) orf, len: 692, 
CAI: 0.14 [Saccharomyces cerevisiae] dbj|BAA09225.1 
(D50617) YFL013C [Saccharomyces cerevisiae] 



gi|5052950|gb|AAD38784. 1|AF149422_1 (AF149422) 
unknown [Homo sapiens] 



gi|2988422|gb|AAC39776.1| (AF016903) agrin 
precursor [Homo sapiens] 



gi| 1 475 6 1 08|ref|XP_0298 83 . 1 1 EGF-like-domain, 
multiple 4 [Homo sapiens] 



gi|10047313|dbj|BAB13444.1| (AB046838) KIAA1618 
protein [Homo sapiens] 



gi|5902048|reflNP_008974.1| HTV-1 rev binding protein 
2; Rev interacting protein [Homo sapiens] pir||G02629 
Rev interacting protein Rip-1 - human gb|AAB00557.1 
(U55766) Rev interacting protein Rip-1 [Homo sapiens 



P VALUE 



0.009 



6.1 



7.5 



0.2 



8.8 



0.000000007 



3.4 



0.00001 



0.009 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1672 


13122465 


gi|13122465|gb|AAK12639.1|AF317654_l (AF317654) 
G protein-coupled receptor [Homo sapiens] 


2.4 


1673 


7296752 


gi|7296752|gb|AAF52030.1| (AE003603) CG12147 
gene product [Drosophila melanogaster] 


2.1 


1681 


14767950 


gi|14767950|ref|XP_040494. 1| nuclear RNA export 
factor 5 [Homo sapiens] emb|CAC20428.1| (AJ277654) 
nuclear RNA export factor 5 [Homo sapiens] 


6.9 


1684 


14740021 


gi|14740021|refIXP_033953.1| 32575 [Homo sapiens] 
reflXP_033954.1| 32576 [Homo sapiens] 
ref|XP_033955.1| 32577 [Homo sapiens] 


2.5 


1685 


13568988 


gi| 1 3568988|gb|AAK30843. 1 |AF25457 1_1 (AF25457 1) 
growuVdifferentiation factor 7 [Mus musculus] 


5.8 


1694 


106323 


gi|106323|pir||A34087 hypothetical protein (L1H 5' 
region) - human 


0.005 


1695 


7444442 


g i|7444442|pir||T02087 gag/pol polyprotein - maize 
retrotransposon Hopscotch gb|AAA57005.1| (U12626) 
copia-like retrotransposon Hopscotch polyprotein [Zea 
mays] 


0.49 


1696 


13569915 


gi|13569915|ref|NP_l 12205. 1| amnionless protein 
[Homo sapiens] gb|AAK28532.1|AF328788_l 
(AF328788) amnionless [Homo sapiens] 


4.4 


1697 


13812078 


gi| 1 3 8 1 2078|refjNP_l 1 32 1 5 . 1 1 hypothetical protein 
[Guillardia theta] gb|AAF2401 1 . 1|AF08303 1_8 
(AF08303 1) hypothetical protein [Guillardia theta] 


5.2 


1699 


462193 


gi|462193|sp|Q06003|GOLI_DROME GOLIATH 
PROTEIN (Gl PROTEIN) pir||JC1495 regulatory 
protein Gl - fruit fly (Drosophila melanogaster) 
gb|AAA285 82.1| (M97204) goliath protein [Drosophila 
melanogaster] 


9.3 


1702 


13376638 


gi|13376638|ref]NP_079359.1| hypothetical protein 
FLJ21128 [Homo sapiens] dbj|BAB15001.1| 
(AK024781) unnamed protein product [Homo sapiens] 


5E-49 


1705 


7445803 


gi|7445803|pir||JC5348 cdd4 protein - Clostridium 
difficile emb|CAA63569.1| (X92982) cdd4 [Clostridium 
difficile] 


0.28 


1706 


10175016 


gi| 1 0 1 750 1 6|dbj |BAB06 1 1 5 . 1 1 (AP00 1 5 1 5) 
BH2396~unknown [Bacillus halodurans] 


9 


1709 


3953531 


gi|3953531|dbj|BAA34722.1| (AB015856) ATF6 
Homo sapiens] 


9.1 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Prntmn*) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1712 


14329676 


gi|14329676|emb|CAC40651.1| (AJ242782) centaurin 
beta [Homo sapiens] 


0 00003 


1714 


10432382 


gi|10432382|emb|CAC10340.1| (AL139421) dJ717I23.1 
(novel protein similar to Xenopus laevis Sojo protein) 
[Homo sapiens] 


0.00006 


1717 


8247357 


gi|8247357|emb|CAB92957.1| (AJ401026) hvoothetical 
protein [Thermoanaerobacter thermohydrosulfuricus] 


1.4 


1719 


7487043 


gi|7487043|pir||T08926 hypothetical protein T15N24.50 
- Arabidopsis thaliana emb|CAB77061.1| (AL078465) 
putative protein [Arabidopsis thaliana] 
emb|CAB795 15. 1| (AL161565) putative protein 
[Arabidopsis thaliana] 


0.39 


1721 


13507765 


ffill350776SlreflNP 10Q714 11 citnilo*-^ crm™***. 

[Mycoplasma pneumoniae] sp|P75088(Y024_MYCPN 
PROBABLE GTP-BINDING PROTFTN MG094 
HOMOLOG (B01_ORF362) pir||S73454 probable GTP 
binding protein yyaF - Mycoplasma pneumoniae (strain 
ATCC 29342) gb|AAB95776.1| (AE000015) similar to 
GTPases [Mycoplasma pneumoniae] 


6 


1725 


11071788 


gi| 1 1 07 1 788|emb|CAC14632. 1| (AL449 144) 
hypothetical protein P2 14.26 [Leishmania major] 


9 


1727 


14325595 


gi|14325595|dbj|BAB60498.1| (AP000996) hypothetical 
protein [Thermoplasma volcanium] 


2.5 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Pmtf-inc) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1728 


6498442 


gi|6498442|dbj|BAA87845.1| (AP000815) ESTs 
AU081301(E20138),C99280(E10593) correspond to a 
region of the predicted gene.-Similar to Arabidopsis 
thaliana chromosome II BAC T26B15 genomic 
sequence,unknown protein. (AC004681) [Oryza sativa] 
dbj|BAB00648.2| (AP002804) ESTs 
AU08 1301 (E20 1 3 8),C99280(E 1 0593) correspond to a 
region of the predicted gene.-Similar to Arabidopsis 
thaliana chromosome II BAC T26B15 genomic 
sequence; unknown protein (AC004681) [Oryza sativa] 
dbj|BAB 17744.1| (AP002862) contains ESTs 
AU081301(E20138) C99280fE10593V~simi1ar tr> 
Arabidopsis thaliana chromosome 2, 
F12C20. 1 1 -unknown protein [Oryza sativa] 
dbj|BAB44118.1| (AP003103) contains ESTs 
AU081301(E20138) 5 C99280(E10593)~similarto 
Arabidopsis thaliana chromosome 2, 
F12C20. 1 1 -unknown protein [Oryza sativa] 


1.1 


1741 


11467083 


gi|l 1467083|ref]NP_042559.1| NADH dehydrogenase, 
su burnt 3 rAcanthamoeba castpllaniil 
sp|Q37382|NU3M_ACACA NADH-UBIQUINONE 
OXIDOREDUCTASE CHAIN 3 oirllS53860 NADH 
dehydrogenase (ubiquinone) (EC 1.6.5.3) chain 3 - 
Acanthamoeba castellanii mitochondrion 
gb|AADl 1852.1| (U12386) NADH dehydrogenase, 
subunit 3 [Acanthamoeba castellanii] 


5 


1742 


12843826 


gi|12843826|dbj|BAB26128.1| (AK009187) putative 
[Mus musculus] 


0.00000002 


1743 


4928550 


gi|4928550|gb|AAD33637.1| (AF133876) DBL alpha 
jrotein [Plasmodium falciparum] 


6.2 


1744 


6503033 


gi|6503033|gb|AAF14557.1|AF176666_l (AF176666) 
F-box leucine-rich repeat protein 5 [Xenopus laevis] 


3.8 


i /HD 


1 

6562750 ] 


gi|6562750|emb|CAB62889.1| (AL035475) hypothetical 
protein, MAL4P2.48 [Plasmodium falciparum] 


8.9 


1750 


1 

7498998 < 


»i|7498998|pir||T16057 hypothetical protein F13D11.2 - 
Caenorhabditis elegans 


5.4 


1751 


6939792 


gi|6939792|dbj|BAA90658!l| (AB037920) HA-17 
[Clostridium botulinum] 


7.3 
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Table 3B Nearest Neighbor (BlastX vs. Non-ReHnnHant Pmhm.) 


SEQID 
NO 


ACCESS 
N 


DESCRIPTION 


P VALUE 


1754 


14195008 


gi|14195008|sp|O9JI55|PLEl CRIGR PI FCTTN 1 
(PLTN) (PCN) (300-KDA INTERMEDIATE 
FILAMENT-ASSOCIATED PROTEIN) (IFAP300) 
gb|AAF70372.1| (AF260753) plectin [Cricetulus 
griseus] 


1.6 


1756 


74519 


gi|745 19|pir||GNWVR4 structural polyprotein - rubella 
virus (strain Therien) 


4.4 


1760 


6324076 


gi|6324076|rer]NP_014146.1| Ynl253wp 
rSaccharomvces cerevi^iael ^nlPS^RS 1 IVTsJT"* vp a ct 
HYPOTHETICAL 47.2 KD PROTEIN IN SIP3- 
MRPL30 INTERGENIC REGION pir||S63226 
hypothetical protein YNL253w - yeast (Saccharomyces 
cerevisiae) emb|CAA65491.1| (X96722) ORF N0860 
[Saccharomyces cerevisiae] emb|CAA96160.1| 
(Z71529) ORF YNL253w [Saccharomyces cerevisiae] 


3.6 


1762 


1708082 


gi|1708082|sp|P50900|GUX2_CLOSR 
EXOGLUCANASE II PRECURSOR 
(EXOCELLOBIOHYDROLASE II) (1,4-BETA- 
CELLOBIOHYDROLASE II) (AVTCELASE II) 
emb|CAA93280.1| (Z69359) avicelase II [Clostridium 
stercorarium] 


1.9 


1765 


2224838 


gi|2224838|emb|CAA45388.1| (X63974) putative 
protein-tyrosine phosphatase [Rhodobacter capsulatus] 


5.2 


1768 


6513773 


gi|6513773|gb|AAF14748.1|AF197756_l (AF197756) 
maturase [Hedyosmum arborescens] 


0.63 


1775 


14732870 


gi|14732870|ref|XP_029018.1| hypothetical protein 
FLJ 10647 [Homo sapiens] ref|XP_001969.2| 
hypothetical protein FLJ 10647 [Homo sapiens] 


1.3 


1779 


7521942 


gi|7521942|pir||T29096 gag polyprotein - murine 
endogenous retrovirus ERV-L emb|CAA73250.1| 
(Y 127 13) Gag polyprotein [Mus musculus] 


0.071 


1 19 < 


7508408 ' 


gi|7508408|pir||T25251 hypothetical protein T24H10.4 - 
Caenorhabditis elegans emb|CAA90944.1| (Z54216) 
T24H10.4 [Caenorhabditis elegans] 


2.9 


1788 


1 
i 

I 

7460247 


gi|7460247|pir||B71612 hypothetical protein PFB0555c - 
nalaria parasite (Plasmodium falciparum) 
»b|AAC71900.1| (AE001402) hypothetical protein 
Plasmodium falciparum] 


6.4 
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Table 3B Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQID 
NO 


ACCESS 

N 


DESCRIPTION 


P VALUE 


1789 


4337102 


gi|4337 102|gb|AAD 1 8078. 1 |AAD 18078 (AF 129756) 
G6f [Homo sapiens] 


2E-24 


1792 


1723494 


eill723494lsDlO10413IYT")8R SrHPOVFUV 
HYPOTHETICAL 13.3 KD PROTEIN C1F3.08C IN 
CHROMOSOME I pir||T38079 very hypothetical 
protein SPAClF3.08c - fission yeast 
(Schizosaccharomyces pombe) emb|CAA94626.1| 
(Z70690) very hypothetical protein 
[Schizosaccharomyces pombe] 


7.2 


1793 


4589921 


gi|45 8992 1 |dbj|BAA76927. 1 1 (ABO 1 7 1 92) 
molybdopterin biosynthesis protein [Clostridium 
perfringens] 


4.2 


1799 


7494302 


gi|7494302|pir||E7161 1 hypothetical protein PFB0580w ■ 
malaria parasite (Plasmodium falciparum) 
gb|AAC71905.1| (AE001404) hypothetical protein 
[Plasmodium falciparum] 


3.3 


1800 


14318508 


gi|14318508|reflNP_l 16641. 1| Ieslp [Saccharomyces 
cerevisiael snlP43579[YFFn VFA^IT 
HYPOTHETICAL 78.8 KD PROTEIN IN HSP12- 
HXT10 INTERGENIC REGION DirllS483 16 orobable 
membrane protein YFL013c - yeast (Saccharomyces 
cerevisiae) emb|CAA86347.1| (Z46255) orf, len: 692, 
CAI: 0.14 [Saccharomyces cerevisiae] dbj|BAA09225.1| 
(D50617) YFL013C [Saccharomyces cerevisiae] 


6.6 


1802 


14325595 


gi|14325595|dbj|BAB60498.1| (AP000996) hypothetical 
protein [Thermoplasma volcanium] 


2 


1809 


7520399 


gi|7520399|pir||Tl 1689 NADH dehydrogenase 
(ubiquinone) (EC 1.6.5.3) chain 5 - Graphium sarpedon 
mitochondrion (fragment) dbj|BAA28187. 1| 
(AB013 147) NADH dehydrogenase subunit 5 
Graphium sarpedon] 


4.9 


1816 


423981 


gi|423981|pir||A46193 88K E-26-specific domain 
protein Pok - fruit fly (Drosophila melanogaster) 
dbj|BAA01080. 1| (D10228) Ets domain protein 
Drosophila melanogaster] 


0.23 


1818 


7463036 


gi|7463036|pir||C70177 beta-glucosidase homolog - 
Lyme disease spirochete gb|AAC66976.1| (AE001 163) 
beta-glucosidase, putative [Borrelia burgdorferi] 


8.4 


1819 


3851471 |i 


gi|3851471|gb|AAC72292.1| (AF037295) sulfonylurea 
receptor- 1 [Mus musculus] | 


3.4 



500 



